gnomAD: How are mutation rates calculated? #3709
Unanswered
iris-garden
asked this question in
Support Requests
Replies: 2 comments
-
Note The following post was exported from discuss.hail.is, a forum for asking questions about Hail which has since been deprecated. (Oct 17, 2023 at 17:24) iris-garden said:hi, this forum is primarily for support for the hail library, so i think you would likely have more luck reaching out to the gnomad team directly over email at [email protected]. hope that helps! |
Beta Was this translation helpful? Give feedback.
0 replies
-
Note The following post was exported from discuss.hail.is, a forum for asking questions about Hail which has since been deprecated. (Oct 19, 2023 at 05:18) agastya said:Thanks, I’ll do that. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Note
The following post was exported from discuss.hail.is, a forum for asking questions about Hail which has since been deprecated.
(Oct 16, 2023 at 06:29) agastya said:
Hi everyone, I apologize if this question does not fall in the purview of this forum.
I was going through the gnomAD constraint calculation code (https://github.com/broadinstitute/gnomad_lof/blob/master/constraint_utils/constraint_basics.py line 318) and had some questions regarding the calculation of the mutation probabilities.
I do not know the shape of the data inside genome_ht and context_ht as I have never used hail.
My question is: How are the probabilites of a base mutation calculated? I find supplementary documentation vague when they talk about its calculation.
There are 3 ways that I have narrowed down the calculation to, here they are:
A= no. of AAA>ATA mutations in the whole genome.
B= Total no. of AAA context mutations in the whole genome. This is basically AAA>ATA + AAA>ACA + AAA>AGA mutations.
C= No. of times AAA occurs in the whole genome. In the sequence AAAAA, AAA occurs 3 times.
Which of the following is the correct equation for the calculation of probability of AAA>ATA mutation?
Also, whats the logic behind the correction factor to calculate mu from these probabilities?
Thanks a lot
Beta Was this translation helpful? Give feedback.
All reactions