Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[XLA:GPU] Fix RaggedAllToAllDecomposer when input and output buffers have different sizes. #23256

Merged
merged 1 commit into from
Feb 28, 2025

Conversation

copybara-service[bot]
Copy link

[XLA:GPU] Fix RaggedAllToAllDecomposer when input and output buffers have different sizes.

We were doubling the size of the buffer to be able to use dynamic-update-slice, because by HLO semantics, if the update goes out of bound of the result, the update is not applied at all. The correct solution is to pad to input_size + output_size.

@copybara-service copybara-service bot force-pushed the test_732101740 branch 4 times, most recently from eb189af to 666faa5 Compare February 28, 2025 20:45
…have different sizes.

We were doubling the size of the buffer to be able to use dynamic-update-slice, because by HLO semantics, if the update goes out of bound of the result, the update is not applied at all. The correct solution is to pad to `input_size + output_size`.

PiperOrigin-RevId: 732239941
@copybara-service copybara-service bot merged commit 0c04616 into main Feb 28, 2025
@copybara-service copybara-service bot deleted the test_732101740 branch February 28, 2025 21:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant