Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fail to build database when multiple antiSMASH version result are present #346

Open
matinnuhamunada opened this issue May 1, 2024 · 0 comments

Comments

@matinnuhamunada
Copy link
Collaborator

The command bgcflow build database fails when there are multiple antiSMASH versions in the data_warehouse. This is because dbt will fetch all parquet of the different versions and then fail the test because there are duplicates of the ids (genome_id, region_id, etc). The dirty fix is to clean up the processed folder to only have result from 1 antiSMASH version. Or improve the DBT schema to only use the latest version of the result.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant