Fix permute_multi_embedding kernel #3227

xw285cornell · 2024-10-05T18:06:39Z

Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/325

Looks like a typo to use permute_id = threadIdx.y + blockIdx.x * blockDim.x which should be blockDim.y. This doesn't affect Nvidia because blockDim.x and y are both 32 (32 threads per warp + 32 warps). For AMD GPU, blockDim.x is 64 and blockDim.y is 16, causing numerical issues.

Reviewed By: leitian, jianyuh, joebos

Differential Revision: D63936776

Summary: X-link: facebookresearch/FBGEMM#325 Looks like a typo to use `permute_id = threadIdx.y + blockIdx.x * blockDim.x` which should be `blockDim.y`. This doesn't affect Nvidia because blockDim.x and y are both 32 (32 threads per warp + 32 warps). For AMD GPU, blockDim.x is 64 and blockDim.y is 16, causing numerical issues. Reviewed By: leitian, jianyuh, joebos Differential Revision: D63936776

facebook-github-bot · 2024-10-05T18:06:52Z

This pull request was exported from Phabricator. Differential Revision: D63936776

netlify · 2024-10-05T18:06:57Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`657e566`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/67018032f7187300083b9bab
😎 Deploy Preview	https://deploy-preview-3227--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

facebook-github-bot · 2024-10-05T22:55:54Z

This pull request has been merged in 1815f89.

facebook-github-bot added the cla signed label Oct 5, 2024

facebook-github-bot added the fb-exported label Oct 5, 2024

facebook-github-bot closed this in 1815f89 Oct 5, 2024

facebook-github-bot added the Merged label Oct 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix permute_multi_embedding kernel #3227

Fix permute_multi_embedding kernel #3227

xw285cornell commented Oct 5, 2024

facebook-github-bot commented Oct 5, 2024

netlify bot commented Oct 5, 2024 •

edited

Loading

facebook-github-bot commented Oct 5, 2024

Fix permute_multi_embedding kernel #3227

Fix permute_multi_embedding kernel #3227

Conversation

xw285cornell commented Oct 5, 2024

facebook-github-bot commented Oct 5, 2024

netlify bot commented Oct 5, 2024 • edited Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

facebook-github-bot commented Oct 5, 2024

netlify bot commented Oct 5, 2024 •

edited

Loading