cutlass 3.x vs cute: which one to use? #1710

mgrabban · 2024-08-13T18:55:34Z

mgrabban
Aug 13, 2024

I'm new to cutlass, and currently exploring both cutlass and cute.

The question I have is how is one supposed to use cute? Say I want to write an efficient GEMM for the latest GPU(s)? Should I use cutlass 3.x or cute or both? What would be the preferred way of writing brand new GEMM and related kernels in cutlass?

My understanding is that cutlass 3.x provides higher level convenience APIs while, based on looking at the cute tutorial examples, it seems, cute might be lower level compared to cutlass 3.x GEMM APIs.

Do I use cutlass 3.x? If yes, then what is the purpose of cute? If cute is the preferred way, can I just ignore cutlass 3.x completely. cute does seem low level. I also notice all cute examples are pure GEMM meaning I do not see any examples that has meaningful epilogues like adding a vector bias, applying an activation (relu, gelu etc.).

Are there plans to add more cute examples?

Thank you.

thakkarV · 2024-08-13T19:18:45Z

thakkarV
Aug 13, 2024
Collaborator

Use whatever floats your boat and gets you the the solution in an expedient manner. There is no one size fits all guideline here.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cutlass 3.x vs cute: which one to use? #1710

{{title}}

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

cutlass 3.x vs cute: which one to use? #1710

mgrabban Aug 13, 2024

Replies: 1 comment

thakkarV Aug 13, 2024 Collaborator

mgrabban
Aug 13, 2024

thakkarV
Aug 13, 2024
Collaborator