-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: create catalog build python package (#251) #258
Conversation
genomes_df = add_gene_model_url(genomes_df) | ||
report_missing_values("accessions", "matched with gene model URLs", genomes_df["accession"], genomes_df["geneModelUrl"].astype(bool)) | ||
else: | ||
genomes_df["geneModelUrl"] = "" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've added an option to disable gene model URLs, because the process of getting them can take some time which makes it harder to test changes to the catalog build script -- not certain this will actually come in handy in the future but there were a couple times here where it would've been useful so I figure it's a nice thing to have available
if len(missing_values) > 0: | ||
if len(missing_values) > len(values_series)/2: | ||
present_values = values_series[present_values_mask] | ||
print(f"Only {len(present_values)} of {len(values_series)} {values_name} {message_predicate}: {", ".join(present_values)}") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If there are more missing values than present values, the present values are reported instead, which makes the message more meaningful in extreme cases
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thx @hunterckx
Closes #251
Drafted for now in case further updates are needed to support galaxyproject/ga2#23