FDR calculation does not work when I lower the mass range #42

LiaSerrano · 2022-09-26T01:58:19Z

Hello,

I noticed a pattern in when the FDR calculation works and does not. When I drop the lower ms2 m/z range to 150 from 300, I get the error shown below and the resulting FDR outputs are blank. This was replicated with three different pairs of rawfiles where the only difference was the lower mass. Is there an obvious reason for this?

Thanks!
Lia

data = group_nodes_with_same_edge(data)
File "C:\Users\lrserrano\Anaconda3\envs\csod\lib\site-packages\csodiaq\idpicker.py", line 23, in group_nodes_with_same_edge
if first: l1, l2 = map(list,zip(*data))
ValueError: not enough values to unpack (expected 2, got 0)

jessegmeyerlab · 2022-09-26T15:22:54Z

Thanks Lia,

It looks like this error keeps coming up in different scenarios. Based on the traceback it appears to be a problem in protein inference. Without investigating I suspect this could be coming up in cases where there are no significant proteins to group. For example, when you drop the mz range maybe you get more decoy hits and now they are in the top 100 proteins so there are none significant below 1% fdr. Does that seem possible? How many protein hits did you have before expanding the fragment range?

@CCranney would you have time to help us investigate this error? I think there is also a second issue open with this same error

LiaSerrano · 2022-10-10T01:46:12Z

Hi @jgmeyerucsd

I just checked on that-- yes, the results with the lower mz range have ~1K more decoys

LiaSerrano · 2022-10-17T16:39:35Z

how can this be explained when there is only 1 decoy hit in the unfiltered output?

jessegmeyerlab · 2022-10-17T16:42:09Z

Is that one decoy within the top 100 proteins if you sort proteins by the best scoring peptide?

…

On Mon, Oct 17, 2022, 9:39 AM LiaSerrano ***@***.***> wrote: how can this be explained when there is only 1 decoy hit in the unfiltered output? — Reply to this email directly, view it on GitHub <#42 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APRLBLEAL5NUHI4SWYEZJSLWDV6NDANCNFSM6AAAAAAQVLFMPI> . You are receiving this because you commented.Message ID: ***@***.***>

LiaSerrano · 2022-10-17T16:47:03Z

no it is not

LiaSerrano · 2022-10-17T16:49:15Z

Is there a way to see most likely protein ID from idPicker without the protein FDR filter applied, but rather just from the 1% peptide FDR list?

jessegmeyerlab · 2022-10-17T17:14:45Z

Since I don't think we require a FASTA input, I believe the way this works is that it looks back at your spectral library to get the protein assignment. Maybe the format of your protein names in your spectral library file is different than the names used in our example human.tsv traml and that is confusing the protein grouping code? Worst case you could do this with a script in R or python manually by loading the spectral library and doing a lookup from the peptide hits.

@CCranney wrote the code for this and has since left the lab to start his MS degree. We are having troubling understanding his implementation because there are not many comments. If he does not have time to look at this unfortunately it will likely be a few months before we have hired more people to help

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FDR calculation does not work when I lower the mass range #42

FDR calculation does not work when I lower the mass range #42

LiaSerrano commented Sep 26, 2022

jessegmeyerlab commented Sep 26, 2022 •

edited

Loading

LiaSerrano commented Oct 10, 2022

LiaSerrano commented Oct 17, 2022

jessegmeyerlab commented Oct 17, 2022 via email

LiaSerrano commented Oct 17, 2022

LiaSerrano commented Oct 17, 2022

jessegmeyerlab commented Oct 17, 2022 •

edited

Loading

FDR calculation does not work when I lower the mass range #42

FDR calculation does not work when I lower the mass range #42

Comments

LiaSerrano commented Sep 26, 2022

jessegmeyerlab commented Sep 26, 2022 • edited Loading

LiaSerrano commented Oct 10, 2022

LiaSerrano commented Oct 17, 2022

jessegmeyerlab commented Oct 17, 2022 via email

LiaSerrano commented Oct 17, 2022

LiaSerrano commented Oct 17, 2022

jessegmeyerlab commented Oct 17, 2022 • edited Loading

jessegmeyerlab commented Sep 26, 2022 •

edited

Loading

jessegmeyerlab commented Oct 17, 2022 •

edited

Loading