You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
I want to check if GCA accessions that I pulled from genbank have CDSs, before filtering further to see if I want to download. I used the commands --formats cds-fasta to only look at CDS and -n to check rather than download. However, -n it returns all the GCAs I input, not just ones with CDS files.
I want it to check if a CDS exists without downloading yet, is there a way to do this?
thanks,
Elinor
The text was updated successfully, but these errors were encountered:
I just had this same question! I ended up doing it outside of ncbi-genome-download. I'm 95% sure my solution is correct :D
The genbank or refseq annotation_hashes.txt file has a columns named "Features hash" and "Proteins name hash."
When the values of either of those columns are "D41D8CD98F00B204E9800998ECF8427E", that indicates the file does not exist.
Note the annotation_hashes.txt file only exists under subsets of refseq/genbank
Hello,
I want to check if GCA accessions that I pulled from genbank have CDSs, before filtering further to see if I want to download. I used the commands
--formats cds-fasta
to only look at CDS and-n
to check rather than download. However, -n it returns all the GCAs I input, not just ones with CDS files.I want it to check if a CDS exists without downloading yet, is there a way to do this?
thanks,
Elinor
The text was updated successfully, but these errors were encountered: