Support FP8 grouped GEMM with rowwise scailing #3560

jiawenliu64 · 2025-01-10T18:15:03Z

Summary: This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance

Differential Revision: D67806685

facebook-github-bot · 2025-01-10T18:15:21Z

This pull request was exported from Phabricator. Differential Revision: D67806685

netlify · 2025-01-10T18:15:21Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`17b656e`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/6781660fde43a60008b20d5c
😎 Deploy Preview	https://deploy-preview-3560--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Summary: X-link: facebookresearch/FBGEMM#646 This Diff supports FP8 grouped GEMM with rowwise scaling for MoE, and replaces the existing tensorwise with rowwise scaling to achieve better accuracy with similar performance Reviewed By: jwfromm Differential Revision: D67806685

facebook-github-bot · 2025-01-10T18:25:24Z

This pull request was exported from Phabricator. Differential Revision: D67806685

facebook-github-bot added the cla signed label Jan 10, 2025

facebook-github-bot added the fb-exported label Jan 10, 2025

jiawenliu64 force-pushed the export-D67806685 branch from fc5fd15 to 17b656e Compare January 10, 2025 18:25

jiawenliu64 mentioned this pull request Jan 10, 2025

[EVT] Add support for Row/Col broadcast PtrArray NVIDIA/cutlass#2033

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support FP8 grouped GEMM with rowwise scailing #3560

Support FP8 grouped GEMM with rowwise scailing #3560

jiawenliu64 commented Jan 10, 2025

facebook-github-bot commented Jan 10, 2025

netlify bot commented Jan 10, 2025 •

edited

Loading

facebook-github-bot commented Jan 10, 2025

Support FP8 grouped GEMM with rowwise scailing #3560

Are you sure you want to change the base?

Support FP8 grouped GEMM with rowwise scailing #3560

Conversation

jiawenliu64 commented Jan 10, 2025

facebook-github-bot commented Jan 10, 2025

netlify bot commented Jan 10, 2025 • edited Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

facebook-github-bot commented Jan 10, 2025

netlify bot commented Jan 10, 2025 •

edited

Loading