Use skani for genome clustering #670
Replies: 1 comment
-
Copy of discussion from #672 Do you have some recommendations about the alignment fraction? My intuition was to use the max alignment fraction as a small genome fragment aligned to a large genome with high ANI would indicate that the two should be clustered together. Anyway, the larger genome would be chosen as the cluster representative. Maybe even better would be to remove the smaller genome if the difference between the two genomes is too large...
|
Beta Was this translation helpful? Give feedback.
-
Skani
The tool Skani claims to be better and faster than the combination of mash + FastANI as used by dRep
I implemented the skin for species clustering.
We now do the species clustering in the
atlas run binning
step.So you get information about the number of dereplicated species in the binning report. This allows you to run different binners before choosing the one to use for the genome annotation.
Also, the file storage was improved all important files are in
Binning/{binner}/
My custom species clustering does the following steps:
This ensures that not a bad genome evaluated on the general model is preferred over a better genome evaluated on the specific model.
See also https://silask.github.io/post/better_genomes/ Section 2.
New Contributors
Full Changelog: v2.16.3...v2.17.0
This discussion was created from the release Use skani for genome clustering.
Beta Was this translation helpful? Give feedback.
All reactions