-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Using recall for somatic samples #9
Comments
Daniel; I don't know of existing workflows for large numbers of tumor-only samples to have a great suggestions about setting parameters for that case. How many samples are you looking to call together? What caller were you targeting? The best approach to tweaking this is probably to add an additional caller target (say, freebayes-somatic) that has the specific tweaks for somatic calling instead of germline. Happy to help with this if you have more specifics about the command lines you're looking to run. Thanks again. |
Thanks Brad. Yes tweaking by adding somatic specific callers would probably
|
I don't know of ready to run approaches to this, or validations to demonstrate how much it helps versus standard tumor/normal analysis. @brentp and @arq5x mentioned they were hoping to work on this with FreeBayes so might have some advice. FreeBayes is a good first target for this since it already handles both tumor/normal and pooled germline cases, and is sensitive and precise on germline calls. For 48 samples, my suggestion would be to do a workflow like:
So I wouldn't try to do anything fancy like recalling, and then evaluate this versus standard tumor/normal with a caller like VarDict to see if you're getting improved resolution, especially of low frequency variants. I'd be very interested in hearing how the experiment turns out. Hope this helps and thanks for all the discussion. |
No problem, it seems like it is a bit of an under-looked at piece, so I'm happy to do some experimentation in this area. All of this is part of the pipeline construction/validation phases prepping for clinical work. In our case we don't have matched normals, for clinical sequencing this isn't typically being done due to cost constraints coupled with working with smaller targeted panels and only reporting on a subset of discovered variants. |
Daniel; https://github.com/chapmanb/bcbio-nextgen/blob/4c57c0666e77b442013cb658a750b40afc466ea6/bcbio/variation/freebayes.py#L92 I'd definitely have interest in hearing your results. |
Daniel -- it would also be worth following this FreeBayes thread: freebayes/freebayes#228 Erik and Brent are talking about more generalized approaches for handling multi-sample tumor calling. |
Thanks for the heads up, much appreciated. |
Hi Brad,
I am assuming the recall jar uses optimized parameter settings for the variant caller based on bcbio settings. I've been looking at exploring incremental join calling on tumour only samples. I use parameter settings largely similar to bcbio but of course with some specific tweaks and threshold values for acceptable allele frequencies. Is it possible to tweak recall to also pass command-line parameters to the caller?
The text was updated successfully, but these errors were encountered: