-
Hello everyone. cutlass::epilogue::NoSmemWarpSpecialized and put it as 4th template parameter. But after I changed OperatorClass to be arch::OpClassSimt everything is broken. Documentation is very lacking in some list of possible values for parameters to be chosen. I've been trying really hard to wrap my head around gemm_api_3x.md, but I'm stuck. I've I wasn't able to tinker parameters so that template magic turn out to be correct. Is it even possible to use new APIs with old cards like Volta?
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
We only support Hopper with the 3.x API in full, as documented in our readme file. Support for Volta and Ampere is via the 2.x API. That said the CuTe atoms for every architecture starting with Pascal are provided, so you can still build custom kernels using it if you want. You can use the CuTe tutorials as a reference to build those on top of. |
Beta Was this translation helpful? Give feedback.
Yes. Please see https://github.com/NVIDIA/cutlass/blob/main/test/unit/gemm/device/default_gemm_configuration.hpp for inspiration. A similar template config can be used for Volta/Turing and they should just work OOTB. We have some of these kernels internally that maybe @ccecka and I can work on upstreaming as single file examples in the future