-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update getting_started & sce_file_contents with celltyping instructions #193
Update getting_started & sce_file_contents with celltyping instructions #193
Conversation
…n a subsection, not the bullet list of available celltypes
…ecide later which to use
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This mostly looks good, but I did have some comments on overall organization that I left. Something I didn't mention in my comments is that we should include a few resources on cell type annotation at the end of the section. This would mirror the other sections.
docs/getting_started.md
Outdated
- Automated annotations from [`SingleR`](https://bioconductor.org/packages/release/bioc/html/SingleR.html), a reference-based method ([Looney _et al._ 2019](https://doi.org/10.1038/s41590-018-0276-y)). | ||
- Automated annotations from [`CellAssign`](https://github.com/Irrationone/cellassign), a marker-gene based method ([Zhang _et al._ 2019](https://doi.org/10.1038/s41592-019-0529-1)). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Automated annotations from [`SingleR`](https://bioconductor.org/packages/release/bioc/html/SingleR.html), a reference-based method ([Looney _et al._ 2019](https://doi.org/10.1038/s41590-018-0276-y)). | |
- Automated annotations from [`CellAssign`](https://github.com/Irrationone/cellassign), a marker-gene based method ([Zhang _et al._ 2019](https://doi.org/10.1038/s41592-019-0529-1)). | |
- Annotations determined by using [`SingleR`](https://bioconductor.org/packages/release/bioc/html/SingleR.html), a reference-based method ([Looney _et al._ 2019](https://doi.org/10.1038/s41590-018-0276-y)). | |
- Annotations determined by using [`CellAssign`](https://github.com/Irrationone/cellassign), a marker-gene based method ([Zhang _et al._ 2019](https://doi.org/10.1038/s41592-019-0529-1)). |
I don't know if this is much better, but we didn't take annotations from these methods, we performed annotation using those methods.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this could be a good middle ground - Annotations determined by <method>, an automated <thing-based> method
docs/getting_started.md
Outdated
|
||
#### Submitter-provided annotations | ||
|
||
To access submitter-provided annotations in the `SingleCellExperiment`, use the following command: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we should add a note that not all libraries will contain submitter annotations. Although everything should include CellAssign/SingleR, it's much more likely that these will be missing.
@@ -203,6 +203,167 @@ See these resources for more information on clustering: | |||
- [Quantifying clustering behavior in Orchestrating Single Cell Analysis](https://bioconductor.org/books/release/OSCA.advanced/clustering-redux.html#quantifying-clustering-behavior) | |||
|
|||
|
|||
### Cell type annotation |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Generally, I think we want to add a little bit more context to this section. I would describe the references used in this first section. I would also mention the supplemental cell type report and that it's a good resource for evaluating the provided cell type annotations.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I added 2 links for cell annotation, but we could add more! Perhaps from this list we have going here? https://github.com/AlexsLemonade/training-specific-template/blob/main/additional-resources/single-cell-resources.md#cell-type-annotation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@allyhawkins this comment may have fallen by the wayside, any other links here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think what you added is good!
Co-authored-by: Ally Hawkins <[email protected]>
…tonal rearrangement and add code to determine which cell types are present
…ith regular annotation code
Thanks for the feedback, ready for another look! Note that I also updated the "intro" section here to show how to determine which cell types are present. This could use some feedback about wording. Also...
For this, I also popped a bullet into the intro section to say that we dont always expect submitter. What do you think of that approach vs text in the submitter section directly? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Organization looks good! I just had a few comments about word choice. Also we may want to revisit the two additions to the metadata based on incoming scpca-nf changes.
docs/sce_file_contents.md
Outdated
@@ -136,8 +136,11 @@ metadata(sce) # experiment metadata | |||
| `singler_results` | If cell typing with `SingleR` was performed, the full result object returned by `SingleR` annotation. Only present for `processed` objects | | |||
| `singler_reference` | If cell typing with `SingleR` was performed, the name of the [`celldex`](http://bioconductor.org/packages/release/data/experiment/html/celldex.html) reference dataset used for annotation. Only present for `processed` objects | | |||
| `singler_reference_label` | If cell typing with `SingleR` was performed, the name of the label in the reference dataset used for annotation. Only present for `processed` objects | | |||
| `singler_reference_version` | If cell typing with `SingleR` was performed with ontology labels, the [`celldex`](http://bioconductor.org/packages/release/data/experiment/html/celldex.html) package version used for the reference dataset | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just to note that I'm going to file a PR today with adding this into the SCE object. I am adding both package and version so it will be celldex_1-12-1
or whatever the version is. So we might update this once we make a decision regarding how we are storing this information.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok, I'll circle back once that PR is set then (we can block this PR as needed, probably dont need to for this situation)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
And we're back! In the end, we did the following. Names here still need to be updated, but the components are not changed.
Co-authored-by: Ally Hawkins <[email protected]>
I have updated this with the final naming scheme and information we've used in
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for catching that! I made the update in AlexsLemonade/scpca-nf#569. And then also made some small suggestions here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 👍
@@ -203,6 +203,167 @@ See these resources for more information on clustering: | |||
- [Quantifying clustering behavior in Orchestrating Single Cell Analysis](https://bioconductor.org/books/release/OSCA.advanced/clustering-redux.html#quantifying-clustering-behavior) | |||
|
|||
|
|||
### Cell type annotation |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think what you added is good!
Closes #165
Closes #192
This PR updates docs with more cell typing information. Note that this is more or less ready for review, but I'm opening as a Draft PR specifically because some of the changes (celldex/panglaodb versions) here depend on forthcoming updates as part of AlexsLemonade/scpca-nf#538. So, we should not move to final.final approval until that is done.
Most changes here are in
getting_started.md
. I have subsections for each possible annotation type, and I go show SCE then AnnData for each set of tasks one might do. Noting a different way to break this up could be to show all SCE things for a given annotation, and then all AnnData things.You'll also notice these sentences:
We currently don't need these sentences, but we could use them instead if we don't want to show how to pull out reference information from the metadata, and instead point folks to that table. Let me know which approach we prefer - showing the code here, or having this sentence instead!
As indicated, I went ahead and took care of #192 since it was so quick. I also removed some periods from the ends of table contents per overall style since I noticed them while scrolling through.