Reader for seqFISH data #53

LLehner · 2023-06-19T15:32:33Z

Add reader for seqFISH data.

for more information, see https://pre-commit.ci

codecov-commenter · 2023-06-19T15:34:39Z

Codecov Report

Merging #53 (9f41cbe) into main (755d475) will increase coverage by 0.68%.
Report is 22 commits behind head on main.
The diff coverage is 44.66%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #53      +/-   ##
==========================================
+ Coverage   41.92%   42.60%   +0.68%     
==========================================
  Files          16       17       +1     
  Lines         854      946      +92     
==========================================
+ Hits          358      403      +45     
- Misses        496      543      +47

Files Changed	Coverage Δ
src/spatialdata_io/readers/merscope.py	`25.00% <8.33%> (-2.03%)`	⬇️
src/spatialdata_io/readers/seqfish.py	`33.96% <33.96%> (ø)`
src/spatialdata_io/__init__.py	`100.00% <100.00%> (ø)`
src/spatialdata_io/_constants/_constants.py	`100.00% <100.00%> (ø)`

…ta-io into seqFISH_reader

for more information, see https://pre-commit.ci

timtreis

Very minor changes

src/spatialdata_io/__init__.py

src/spatialdata_io/readers/seqfish.py

for more information, see https://pre-commit.ci

giovp · 2023-06-20T17:21:39Z

remember to add it to docs/api.md and also add codex there ( I think it was missing from #34 ). Also please delete the "coming soon" section in the index.md . You can delete this whole thing

EDIT: and same thing to the README.md of the repo, please check that the index.md and the readme.md always correspond, thank you!

timtreis · 2023-06-20T23:37:05Z

It seems impossible for me to download that 85 GB monster of seqFISh data to actually validate that reader, @giovp could you maybe try it out?

LucaMarconato · 2023-06-21T17:11:51Z

@LLehner please test with napari before merging. I suggest to write to Zarr and read again the sdata object in case in which the performance are bad (since the first lazy representation is reading from a non-performant disk storage, while after you save and read Zarr and Parquet are used).

for more information, see https://pre-commit.ci

LLehner · 2023-06-22T10:48:19Z

@LucaMarconato
seems to work, here is an example of points visualization:

sdata = seqfish(path=path)
interactive = Interactive(sdata)
interactive.run()

then selecting global>transcripts_1 results in:

for more information, see https://pre-commit.ci

LucaMarconato · 2023-07-15T13:50:44Z

Great work @LLehner!

A few comments:

As Liang Ding reported, there was a bug in the indexing, I fixed it (you were using the last value of enumerate() which was giving len - 1).
I noticed that ImageModel.parse() was not using scale_factors even if the images are big. This leads to poor performance because it doesn't compute intermediate resolutions. I fixed this in my last commit.
Similar to this, no chunking was used for the images (in this case the images are so big that it was not possible saving them to disk). You can see this in the screenshot below.
I will make this point clearer in the docs/notebooks because it is not obvious that one should pay attention to the chunk shape/size of the data by using chunks parameter in the parser. Please notice that when using scale_factors, the chunking is automatically computed to get an efficient representation (also this is not obvious from the docs).

I think that in the napari screenshot you showed you were didn't save the data to disk first. To harness the chunked and multiscale representation of the data one needs first to save to Zarr (sdata.write()) and then read again. Otherwise the visualization of the images would be very inefficient and napari would hang. I will update the docs/notebooks to make this clearer as it is not straightforward.

I will now try visualizing the data with napari.

LucaMarconato · 2023-07-18T18:11:32Z

@LLehner I downloaded the data on my machine and used napari to view it. There are some bugs that I ask you to fix please, but it's almost there, they are all very quick to address.

the key names for images and labels are swapped
the "cells" (shapes) should be also per section. So cells_1, cells_2, cells_3, instead of one element will all the cells. Otherwise when they are plot they overlap in space.
the cells should have a radius that is deduced from the area, not a constant radius. We do this also for xenium(), you can copy the code from there
finally, here we should not have a unique coordinate system called global, but we should have three coordinate systems, one per sample/fov. Please check the mibitof/to_zarr.py file in the sandbox, or cosmx() and I think also steinbock() to see how to do that.

LucaMarconato · 2023-07-18T18:15:23Z

I also noticed that

the gene expression and obs from the table can't be plotted

Not sure why, I get a two different weird errors, one for obs and one for expression. When the rest is addressed we should look into this. To do this I usually run napari/vscode from PyCharm and go with breakpoints. I can also check into this if you want, please keep me posted.

for more information, see https://pre-commit.ci

giovp · 2024-02-05T17:30:06Z

is this still planned to be included? @LucaMarconato @LLehner ?

LucaMarconato · 2024-02-05T21:46:33Z

Yes, it's in the todo list; still didn't have time to test.

LucaMarconato · 2024-06-16T15:12:09Z

@LLehner I addressed all the task items from this conversation that were still open. I have also added a converter script in the sandbox and tested the reader with napari-spatialdata. Finally, I added extra arguments to be able to parse a subset of the elements and of the sections (useful when debugging).

Thanks again for the work on this PR, ready to merge now (the failing tests are due to the fact that we need to make a new release in spatialdata and are unrelated to this PR).

LLehner and others added 2 commits June 19, 2023 17:30

Add reader for seqFISH data

d477f46

[pre-commit.ci] auto fixes from pre-commit.com hooks

ad967c6

for more information, see https://pre-commit.ci

LLehner and others added 3 commits June 19, 2023 17:50

Fix list index

d5c37e8

Merge branch 'seqFISH_reader' of https://github.com/LLehner/spatialda…

73f16e2

…ta-io into seqFISH_reader

[pre-commit.ci] auto fixes from pre-commit.com hooks

42ed80e

for more information, see https://pre-commit.ci

timtreis requested changes Jun 19, 2023

View reviewed changes

src/spatialdata_io/__init__.py Outdated Show resolved Hide resolved

src/spatialdata_io/readers/seqfish.py Outdated Show resolved Hide resolved

src/spatialdata_io/readers/seqfish.py Outdated Show resolved Hide resolved

src/spatialdata_io/readers/seqfish.py Outdated Show resolved Hide resolved

LLehner and others added 2 commits June 19, 2023 22:53

Add changes

74c296c

[pre-commit.ci] auto fixes from pre-commit.com hooks

c6001e0

for more information, see https://pre-commit.ci

LLehner requested a review from timtreis June 20, 2023 12:31

Merge branch 'main' into seqFISH_reader

22f0d7e

LLehner and others added 2 commits June 21, 2023 23:28

Fix images, labels, points

4d36911

[pre-commit.ci] auto fixes from pre-commit.com hooks

8231c3b

for more information, see https://pre-commit.ci

LucaMarconato mentioned this pull request Jul 12, 2023

draft seqfish giovp/spatialdata-sandbox#26

Closed

LucaMarconato and others added 2 commits July 15, 2023 15:43

fix wrong index, using multiscale for images and labels

fb2444b

[pre-commit.ci] auto fixes from pre-commit.com hooks

2b6752e

for more information, see https://pre-commit.ci

fix index in element names

4bad71b

Fix key for images and labels

d56d5b3

LLehner changed the title ~~Reader for seqFISH files~~ Reader for seqFISH data Jul 27, 2023

LLehner and others added 4 commits July 27, 2023 16:24

Add shapes per section

c0b40a7

[pre-commit.ci] auto fixes from pre-commit.com hooks

d5a3e79

for more information, see https://pre-commit.ci

Merge branch 'scverse:main' into seqFISH_reader

3c9b933

Fix radius

c341044

LLehner added 2 commits August 4, 2023 14:18

Add instance_id to .obs before required in ShapesModel

7a8ab20

Fix instance key value

9f41cbe

LLehner marked this pull request as draft August 20, 2023 11:41

Merge branch 'scverse:main' into seqFISH_reader

740b9a4

LLehner added 2 commits February 21, 2024 16:02

Fix shapes

118c897

Update shapes

11661c3

LLehner mentioned this pull request Jun 12, 2024

seqFISH reader #49

Closed

2 tasks

LucaMarconato added 3 commits June 16, 2024 16:58

improvements seqfish reader

8f2a875

Merge branch 'main' into seqFISH_reader

3a147ee

fix pre-commit

ed06295

LucaMarconato marked this pull request as ready for review June 16, 2024 15:08

LucaMarconato merged commit af90e05 into scverse:main Jun 16, 2024
2 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reader for seqFISH data #53

Reader for seqFISH data #53

LLehner commented Jun 19, 2023 •

edited

Loading

codecov-commenter commented Jun 19, 2023 •

edited

Loading

timtreis left a comment

giovp commented Jun 20, 2023 •

edited

Loading

timtreis commented Jun 20, 2023

LucaMarconato commented Jun 21, 2023

LLehner commented Jun 22, 2023 •

edited

Loading

LucaMarconato commented Jul 15, 2023 •

edited

Loading

LucaMarconato commented Jul 18, 2023 •

edited

Loading

LucaMarconato commented Jul 18, 2023 •

edited

Loading

giovp commented Feb 5, 2024

LucaMarconato commented Feb 5, 2024

LucaMarconato commented Jun 16, 2024

Reader for seqFISH data #53

Reader for seqFISH data #53

Conversation

LLehner commented Jun 19, 2023 • edited Loading

codecov-commenter commented Jun 19, 2023 • edited Loading

Codecov Report

timtreis left a comment

Choose a reason for hiding this comment

giovp commented Jun 20, 2023 • edited Loading

timtreis commented Jun 20, 2023

LucaMarconato commented Jun 21, 2023

LLehner commented Jun 22, 2023 • edited Loading

LucaMarconato commented Jul 15, 2023 • edited Loading

LucaMarconato commented Jul 18, 2023 • edited Loading

LucaMarconato commented Jul 18, 2023 • edited Loading

giovp commented Feb 5, 2024

LucaMarconato commented Feb 5, 2024

LucaMarconato commented Jun 16, 2024

LLehner commented Jun 19, 2023 •

edited

Loading

codecov-commenter commented Jun 19, 2023 •

edited

Loading

giovp commented Jun 20, 2023 •

edited

Loading

LLehner commented Jun 22, 2023 •

edited

Loading

LucaMarconato commented Jul 15, 2023 •

edited

Loading

LucaMarconato commented Jul 18, 2023 •

edited

Loading

LucaMarconato commented Jul 18, 2023 •

edited

Loading