-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
IOByte computation in benchmarks #3721
Comments
Why is this? |
If the executors save different variables for backward pass or choose to rematerialize any intermediate variables, that strategy may differ across executors, particularly for larger fusions. I don't think we see this in our current benchmarks though. |
If that's the case, I wonder if it still make sense to compare the performances between the backends because they don't seem to compare apples to apples. Can this happen only between the |
We currently use the input-ouputs consumed by nvfuser definitions as reference for the IOBytes computation for all executors.
This has certain limitations:
The text was updated successfully, but these errors were encountered: