Сomparing large numbers doesn't work #181

partizanes · 2023-12-11T13:25:54Z

python3.8, python3.9

Test code:

from dictdiffer import diff

dict1 = dict({'test': {'serial_number': 22570409781991170591038650551}})
dict2 = dict({'test': {'serial_number': 22570409781991170591038650552}})
print(list(diff(dict1, dict2)))

Return empty list.

The problem was using math.isclose which produces false results:

def are_different(first, second, tolerance, absolute_tolerance=None):
    """Check if 2 values are different.

    In case of numerical values, the tolerance is used to check if the values
    are different.
    In all other cases, the difference is straight forward.
    """
    if first == second:
        # values are same - simple case
        return False

    first_is_nan, second_is_nan = bool(first != first), bool(second != second)

    if first_is_nan or second_is_nan:
        # two 'NaN' values are not different (see issue #114)
        return not (first_is_nan and second_is_nan)
    elif isinstance(first, num_types) and isinstance(second, num_types):
        # two numerical values are compared with tolerance
        return not math.isclose(
            first,
            second,
            rel_tol=tolerance or 0,
            abs_tol=absolute_tolerance or 0,
        )
    # we got different values
    return True

Test example:

import math  

print(math.isclose(22570409781991170591038650551, 22570409781991170591038650552, rel_tol=0.0, abs_tol=0.0)) # TRUE
print(22570409781991170591038650551 == 22570409781991170591038650552)                                       # FALSE

The text was updated successfully, but these errors were encountered:

dbrakman · 2024-03-30T02:54:21Z

The standard library says here that rel_tol must be greater than 0.

In your specific example, dictdiffer can produce the desired output with the following:

from decimal import Decimal
from dictdiffer import diff

dict1 = dict({'test': {'serial_number': 22570409781991170591038650551}})
dict2 = dict({'test': {'serial_number': 22570409781991170591038650552}})
print(list(diff(dict1, dict2, tolerance=Decimal(f"1e-{sys.get_int_max_str_digits()}"))))

However, since python3 has no maximum integer, I think a general solution for allowing requiring exact equality between integers would require a change to the library. I was looking at the same code in wondering how to dictdiffer should compare Decimal(1.99) to Decimal("1.99"), and after looking further into the subclasses of number.Number came to the conclusion that tolerating floating-point imprecision is appropriate only when both operands are floats.

This is a judgement call for the maintainers that is arguably a breaking change, but I think it's consistent with the docstring and all the existing tests and that the alternative is a can of worms with too many edge cases. Proposed #183

partizanes added the bug label Dec 11, 2023

dbrakman mentioned this issue Mar 30, 2024

diff: allow tolerance only between floats #183

Open

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Сomparing large numbers doesn't work #181

Сomparing large numbers doesn't work #181

partizanes commented Dec 11, 2023

dbrakman commented Mar 30, 2024

Сomparing large numbers doesn't work #181

Сomparing large numbers doesn't work #181

Comments

partizanes commented Dec 11, 2023

dbrakman commented Mar 30, 2024