[ONNX][TORCH] Add Onnx->Linalg lowering for RotaryEmbedding Op #4002

vivekkhandelwal1 · 2025-02-05T14:10:28Z

This commit adds the Onnx->Linalg lowering for Onnx's RotaryEmbedding op (ref: https://github.com/microsoft/onnxruntime/blob/main/docs/ContribOperators.md#commicrosoftrotaryembedding) by registering a customized torch op named OnnxVariantAtenRotaryEmbeddingOp. This is done so that the Onnx's RotaryEmbedding op can be lowered to this op and this op can be lowered from Torch->Linalg.

The lowering has been adopted from the OnnxRuntime. Files for references:
1.) https://github.com/microsoft/onnxruntime/blob/e1e3f623f61816008e79dddc91a51ffe7f0ff5cf/onnxruntime/contrib_ops/cpu/bert/rotary_embedding.cc#L47-L93
2.) https://github.com/microsoft/onnxruntime/blob/94c69f55d480cb4a8dcbc161d29ef3acca9392a7/onnxruntime/contrib_ops/cpu/bert/rotary_embedding_helper.h

Signed-off-by: Vivek Khandelwal [email protected]

AmosLewis · 2025-02-07T03:01:37Z

We need a test in https://github.com/nod-ai/SHARK-TestSuite/tree/main/alt_e2eshark/onnx_tests/operators to verify the numeric before merge

lib/Conversion/TorchToLinalg/Uncategorized.cpp

vivekkhandelwal1 · 2025-02-10T10:12:16Z

We need a test in https://github.com/nod-ai/SHARK-TestSuite/tree/main/alt_e2eshark/onnx_tests/operators to verify the numeric before merge

Actually, the test in SHARK-Testsuite is not working since the op comes from "com.microsoft" domain. Alhtough, I have verified the e2e correctness of lowering by manually generating the IR and then compiling and executing it with the IREE.

zjgarvey

A few small comments. I haven't double checked that the implementation is correct.

include/torch-mlir/Dialect/Torch/IR/TorchOps.td

lib/Conversion/TorchToLinalg/Uncategorized.cpp

test/Conversion/TorchToLinalg/basic.mlir

lib/Conversion/TorchOnnxToTorch/DefaultDomainQtoZ.cpp

Add custom parser and printer for the op Move the op lowering to a seperate code file for com.microsoft domain ops

vivekkhandelwal1 added 5 commits February 5, 2025 14:12

Initial commit for rotary embedding lowering

f551360

Add Torch->Linalg for rotary embedding

7b548c5

Add lowering for rotary embedding - rough patch

c96d05c

Add check inputs to rot_emb lowering

5d8d8d6

Add test for rotary embedding

846fbea

vivekkhandelwal1 force-pushed the rotary-embedding branch from 20809ca to 846fbea Compare February 5, 2025 14:13

Rebase fix

6b40a15

vivekkhandelwal1 force-pushed the rotary-embedding branch from a788784 to 6b40a15 Compare February 5, 2025 14:16

vivekkhandelwal1 requested review from zjgarvey, rsuderman and AmosLewis February 5, 2025 14:17

AmosLewis reviewed Feb 7, 2025

View reviewed changes

lib/Conversion/TorchToLinalg/Uncategorized.cpp Outdated Show resolved Hide resolved

AmosLewis reviewed Feb 7, 2025

View reviewed changes

vivekkhandelwal1 mentioned this pull request Feb 7, 2025

[ONNX] Add Onnx->Torch lowering for GroupQueryAttention op #4006

Open

Update PR comments

caa9622

vivekkhandelwal1 force-pushed the rotary-embedding branch 2 times, most recently from 16f397a to caa9622 Compare February 10, 2025 10:10

vivekkhandelwal1 requested a review from AmosLewis February 10, 2025 10:11

zjgarvey reviewed Feb 10, 2025

View reviewed changes

Address PR comments

7c72b59

vivekkhandelwal1 force-pushed the rotary-embedding branch from 02705a1 to 7c72b59 Compare February 11, 2025 07:31

vivekkhandelwal1 requested a review from zjgarvey February 11, 2025 07:31

Update onnx lit test

497f565

zjgarvey reviewed Feb 11, 2025

View reviewed changes

lib/Conversion/TorchOnnxToTorch/DefaultDomainQtoZ.cpp Outdated Show resolved Hide resolved

Address PR comments

8275639

Add custom parser and printer for the op Move the op lowering to a seperate code file for com.microsoft domain ops

vivekkhandelwal1 requested a review from zjgarvey February 17, 2025 08:17

Fix parser code

04032da

vivekkhandelwal1 force-pushed the rotary-embedding branch from 35a43fc to 04032da Compare February 17, 2025 09:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ONNX][TORCH] Add Onnx->Linalg lowering for RotaryEmbedding Op #4002

[ONNX][TORCH] Add Onnx->Linalg lowering for RotaryEmbedding Op #4002

vivekkhandelwal1 commented Feb 5, 2025

AmosLewis commented Feb 7, 2025

vivekkhandelwal1 commented Feb 10, 2025

zjgarvey left a comment

[ONNX][TORCH] Add Onnx->Linalg lowering for RotaryEmbedding Op #4002

Are you sure you want to change the base?

[ONNX][TORCH] Add Onnx->Linalg lowering for RotaryEmbedding Op #4002

Conversation

vivekkhandelwal1 commented Feb 5, 2025

AmosLewis commented Feb 7, 2025

vivekkhandelwal1 commented Feb 10, 2025

zjgarvey left a comment

Choose a reason for hiding this comment