Fix for NaN's when training DASM with ambiguous sequences #93

willdumm · 2024-12-10T18:49:19Z

The fix is one line, applying a mask that should have been applied from the very beginning, but previously didn't matter much since the data didn't contain ambiguities.

Also adds parallelized neutral model application, moved to the cpu. This makes running some notebooks much faster, and also speeds up setup for training Snakemake runs.

willdumm · 2024-12-10T18:51:20Z

netam/dasm.py

@@ -195,11 +196,10 @@ def loss_of_batch(self, batch):
        # logit space, so we are set up for using the cross entropy loss.
        # However we have to mask out the sites that are not substituted, i.e.
        # the sites for which aa_subs_indicator is 0.
-        subs_mask = aa_subs_indicator == 1
+        subs_mask = (aa_subs_indicator == 1) & mask


This is it, the whole Nan/inf fix! Could you have a look around here and make sure this looks reasonable to you, too?

Nice sleuthing! It seems good to me... am I missing something?

I don't think so, just wanted extra eyes on it! Thanks

matsen

Awesome!

matsen · 2024-12-10T19:13:51Z

netam/dasm.py

@@ -195,11 +196,10 @@ def loss_of_batch(self, batch):
        # logit space, so we are set up for using the cross entropy loss.
        # However we have to mask out the sites that are not substituted, i.e.
        # the sites for which aa_subs_indicator is 0.
-        subs_mask = aa_subs_indicator == 1
+        subs_mask = (aa_subs_indicator == 1) & mask


Nice sleuthing! It seems good to me... am I missing something?

willdumm added 5 commits December 5, 2024 14:11

debugging

6bc541a

parallelize function wrapper

2f98e4a

updated parallelization

58fe027

tweaks

96d0480

fix NaN issue

87a0008

willdumm commented Dec 10, 2024

View reviewed changes

willdumm added 2 commits December 10, 2024 11:00

cleanup

e9c7823

more cleanup and format

6a24dcd

willdumm requested a review from matsen December 10, 2024 19:06

matsen approved these changes Dec 10, 2024

View reviewed changes

willdumm merged commit 22c8873 into main Dec 10, 2024
2 checks passed

willdumm deleted the wd-nan-ambig-fix branch December 10, 2024 19:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for NaN's when training DASM with ambiguous sequences #93

Fix for NaN's when training DASM with ambiguous sequences #93

willdumm commented Dec 10, 2024

willdumm Dec 10, 2024

matsen Dec 10, 2024

willdumm Dec 10, 2024

matsen left a comment

matsen Dec 10, 2024

Fix for NaN's when training DASM with ambiguous sequences #93

Fix for NaN's when training DASM with ambiguous sequences #93

Conversation

willdumm commented Dec 10, 2024

willdumm Dec 10, 2024

Choose a reason for hiding this comment

matsen Dec 10, 2024

Choose a reason for hiding this comment

willdumm Dec 10, 2024

Choose a reason for hiding this comment

matsen left a comment

Choose a reason for hiding this comment

matsen Dec 10, 2024

Choose a reason for hiding this comment