adding a nan sanitizer similar to our int sanitizer #6009

adamomainz · 2025-02-24T20:06:38Z

Hey All - We have had a few debugging issues with nan values popping up that are extremely difficult to find. Adding a nan sanitizer to make everyones life a bit easier.

New contributor declaration

[x ] I am not making a trivial change, such as fixing a typo in a comment.
[x ] I have written a PR description following these
rules.
[x ] I have run pre-commit run --from-ref origin/main --to-ref HEAD.
Select one of the following.
- [ x] I have added tests.
  - /test for lit tests
  - /unittest for C++ tests
  - /python/test for end-to-end tests
- [x ] This PR does not need a test because ``.
Select one of the following.
- [ x] I have not added any lit tests.
- The lit tests I have added follow these best practices,
  including the "tests should be minimal" section. (Usually running Python code
  and using the instructions it generates is not minimal.)

Jokeren · 2025-02-24T20:39:05Z

python/triton/runtime/interpreter.py

@@ -195,7 +208,10 @@ def _convert_float(input, input_dtype, output_dtype, rounding_mode):
        significand[subnormal_index] = (significand[subnormal_index] << bit_pos[subnormal_index]) & (
            (1 << input_dtype.fp_mantissa_width) - 1)
    # Prevent overflow and underflow
-    exponent_output = np.maximum(0, np.minimum((exponent - bias_input + bias_output), (1 << output_exponent_width) - 1))


Too many diffs. Maybe you should run pre-commit

Apologies for the extra changes in here as well pre commit decided to do more linting than I like

That's not expected actually

agree let me remove pre-commit and try again

@Jokeren apologies for the delay! This is finally cleaned up

embg · 2025-02-24T23:48:08Z

python/triton/language/semantic.py

+    """
+    if not builder.options.sanitize_nan:
+        return
+    input_cond = and_(equal(lhs,lhs,builder), equal(rhs,rhs,builder), builder)


Neat trick to detect NaN values!

pawelszczerbuk · 2025-02-26T16:09:12Z

python/triton/language/semantic.py

        return tl.tensor(builder.create_fsub(input.handle, other.handle), input.type)
    # int - int
    elif scalar_ty.is_int():
+        if sanitize_nan:


Probably not needed for ints?

yeah good point :)

pawelszczerbuk · 2025-02-26T16:11:15Z

python/triton/language/semantic.py

    input, other = binary_op_type_checking_impl(input, other, builder)
    scalar_ty = input.type.scalar
+    if sanitize_nan:


Is there a reason why this check is in common path while in other functions it is after checking for int/float type?

this is the only path where we dont have a ptr check so I thought it would be cleaner to just do the check here seeing as though in both cases below we are currently checking

Would it make sense to put it everywhere under if float for consistency?

agreed especially after your comment about not needing this for ints. Also considering moving this to some ops that have a higher chance of NaN ie exponentials, sqrt, pow etc. what do you think?

Agree, the set of ops that may require this is most likely different than for overflow.

adding a nan sanitizer similar to our int sanitizer

0e56f58

adamomainz requested review from antiagainst, zhanglx13 and ptillet as code owners February 24, 2025 20:06

adding test

aebb7b5

Jokeren reviewed Feb 24, 2025

View reviewed changes

adamomainz and others added 6 commits February 24, 2025 13:17

cleaning up from precommit

be87023

cleaning up from precommit

d358a5a

cleaning up from precommit

db8d014

Merge branch 'main' into main

29217b1

cleaning up from precommit

7d805bf

all clean!

45ccc9c

embg reviewed Feb 24, 2025

View reviewed changes

pawelszczerbuk reviewed Feb 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding a nan sanitizer similar to our int sanitizer #6009

adding a nan sanitizer similar to our int sanitizer #6009

adamomainz commented Feb 24, 2025 •

edited

Loading

Jokeren Feb 24, 2025

Jokeren Feb 24, 2025

adamomainz Feb 24, 2025

adamomainz Feb 24, 2025

embg Feb 24, 2025

pawelszczerbuk Feb 26, 2025

adamomainz Feb 26, 2025

pawelszczerbuk Feb 26, 2025

adamomainz Feb 26, 2025

pawelszczerbuk Feb 26, 2025

adamomainz Feb 26, 2025 •

edited

Loading

pawelszczerbuk Feb 26, 2025

adding a nan sanitizer similar to our int sanitizer #6009

Are you sure you want to change the base?

adding a nan sanitizer similar to our int sanitizer #6009

Conversation

adamomainz commented Feb 24, 2025 • edited Loading

New contributor declaration

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adamomainz Feb 26, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adamomainz commented Feb 24, 2025 •

edited

Loading

adamomainz Feb 26, 2025 •

edited

Loading