Replies: 2 comments
-
Congrats to everyone involved! This work is super cool |
Beta Was this translation helpful? Give feedback.
0 replies
-
@hwu36 , Did we have an explanation of why I remember and understand why |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In today's GTC talk, we announced 3xTF32 as a new feature to be released with upcoming CUTLASS 2.8. Using Ampere tensor cores to emulate FP32 operations, 3xTF32 matches the accuracy of FP32 instruction with at least 2x throughput.
Beta Was this translation helpful? Give feedback.
All reactions