【LLM Inference-0】Add Split MoE Op && Add Group MoE #69687

gzy19990617 · 2024-11-25T11:07:11Z

PR Category

Inference

PR Types

Others

Description

card-71500

1.增加拆分的moe算子，更便于做精度对齐，将分组softmax操作合并到一个moe_dispatch算子之中，精度已对齐。

2.增加group_moe功能的支持，新增group_moe接口。

3.补充group moe下的拆分算子的精度单测

4.补充拆分算子与之前baseline不同精度的单测

paddle-bot · 2024-11-25T11:07:16Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle/phi/infermeta/multiary.cc

paddle/phi/kernels/fusion/cutlass/moe_dispatch.cu

CLAassistant · 2024-12-04T09:11:46Z

All committers have signed the CLA.

paddle-ci-bot · 2024-12-12T03:19:27Z

Sorry to inform you that f52f14b's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

paddle-ci-bot · 2025-01-22T03:17:27Z

Sorry to inform you that 79d7e73's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

gzy19990617 and others added 6 commits November 25, 2024 19:05

add group moe

23eb0da

fix

3ee31fc

梓源的修改和我的修改

4ed932f

final commit

2e0a413

fix

858affe

fix

1e8227a

gzy19990617 force-pushed the add_group_moe branch 4 times, most recently from 91d85bb to 316e922 Compare November 26, 2024 03:40

alignment accuracy

5bba5a3

gzy19990617 force-pushed the add_group_moe branch from 316e922 to 5bba5a3 Compare November 26, 2024 04:09

gzy19990617 changed the title ~~【LLM Inference】Add Split MOE Op && Add Group MOE~~ 【LLM Inference】Add Split MoE Op && Add Group MoE Nov 26, 2024

zhoutianzi666 reviewed Nov 26, 2024

View reviewed changes

paddle/phi/infermeta/multiary.cc Outdated Show resolved Hide resolved

modify short easy parameter name

064c875

zhoutianzi666 reviewed Nov 27, 2024

View reviewed changes

paddle/phi/kernels/fusion/cutlass/moe_dispatch.cu Outdated Show resolved Hide resolved

modify short easy parameter name

65185cb

zhoutianzi666 reviewed Nov 29, 2024

View reviewed changes

paddle/phi/kernels/fusion/cutlass/moe_dispatch.cu Outdated Show resolved Hide resolved

gzy19990617 added 4 commits November 29, 2024 12:20

fix review

371c010

just test

651629f

optimize dispatch input

4414d7b

support static

66d6660

gzy19990617 force-pushed the add_group_moe branch from bc58f6d to 66d6660 Compare November 29, 2024 08:21

Merge branch 'develop' into add_group_moe

8769e56

gzy19990617 force-pushed the add_group_moe branch from 2db7763 to 8769e56 Compare November 29, 2024 08:57

gzy19990617 and others added 2 commits December 2, 2024 18:15

test

88c097f

match the performance of cub sorter and fix precision problem

f52f14b

gzy19990617 added 4 commits January 6, 2025 01:42

fix moe dispatch

9aeec60

fix moe dispatch

730c9b7

fix token num so long

c07ff3e

Merge branch 'develop' into add_group_moe

79d7e73

gzy19990617 changed the title ~~【LLM Inference】Add Split MoE Op && Add Group MoE~~ 【LLM Inference-0】Add Split MoE Op && Add Group MoE Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【LLM Inference-0】Add Split MoE Op && Add Group MoE #69687

【LLM Inference-0】Add Split MoE Op && Add Group MoE #69687

gzy19990617 commented Nov 25, 2024 •

edited

Loading

paddle-bot bot commented Nov 25, 2024

CLAassistant commented Dec 4, 2024 •

edited

Loading

paddle-ci-bot bot commented Dec 12, 2024

paddle-ci-bot bot commented Jan 22, 2025

【LLM Inference-0】Add Split MoE Op && Add Group MoE #69687

Are you sure you want to change the base?

【LLM Inference-0】Add Split MoE Op && Add Group MoE #69687

Conversation

gzy19990617 commented Nov 25, 2024 • edited Loading

PR Category

PR Types

Description

paddle-bot bot commented Nov 25, 2024

CLAassistant commented Dec 4, 2024 • edited Loading

paddle-ci-bot bot commented Dec 12, 2024

paddle-ci-bot bot commented Jan 22, 2025

gzy19990617 commented Nov 25, 2024 •

edited

Loading

CLAassistant commented Dec 4, 2024 •

edited

Loading