IOByte computation in benchmarks #3721

Priya2698 · 2025-01-16T22:45:56Z

We currently use the input-ouputs consumed by nvfuser definitions as reference for the IOBytes computation for all executors.
This has certain limitations:

Requires manual effort to identify the reference IOBytes from nvfuser definitions when adding Thunder-nvfuser benchmarks (PR rope_benchmark #3550)
There is also the possibility of the IOBytes being different between executors (torch.compile, eager and Thunder).

naoyam · 2025-01-21T17:45:09Z

There is also the possibility of the IOBytes being different between executors (torch.compile, eager and Thunder).

Why is this?

Priya2698 · 2025-01-22T02:56:18Z

There is also the possibility of the IOBytes being different between executors (torch.compile, eager and Thunder).

Why is this?

If the executors save different variables for backward pass or choose to rematerialize any intermediate variables, that strategy may differ across executors, particularly for larger fusions. I don't think we see this in our current benchmarks though.

naoyam · 2025-01-22T06:47:12Z

There is also the possibility of the IOBytes being different between executors (torch.compile, eager and Thunder).

Why is this?

If the executors save different variables for backward pass or choose to rematerialize any intermediate variables, that strategy may differ across executors, particularly for larger fusions. I don't think we see this in our current benchmarks though.

If that's the case, I wonder if it still make sense to compare the performances between the backends because they don't seem to compare apples to apples.

Can this happen only between the torch.compile and thunder backends but not between the thunder-torch.compile and thunder backends?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IOByte computation in benchmarks #3721

IOByte computation in benchmarks #3721

Priya2698 commented Jan 16, 2025 •

edited

Loading

naoyam commented Jan 21, 2025

Priya2698 commented Jan 22, 2025

naoyam commented Jan 22, 2025

IOByte computation in benchmarks #3721

IOByte computation in benchmarks #3721

Comments

Priya2698 commented Jan 16, 2025 • edited Loading

naoyam commented Jan 21, 2025

Priya2698 commented Jan 22, 2025

naoyam commented Jan 22, 2025

Priya2698 commented Jan 16, 2025 •

edited

Loading