initial support blackwell #487

johnnynunez · 2025-01-21T23:54:55Z

10.0 blackwell b100/b200
12.0 blackwell rtx50

Tom94 · 2025-01-22T07:42:14Z

Hi, thanks a bunch for the PR!

Any chance you could also update the list of supported compute capabilities at the top of this file as well as the min/max compute capability conditions here?

johnnynunez · 2025-01-22T16:38:32Z

Hi, thanks a bunch for the PR!

Any chance you could also update the list of supported compute capabilities at the top of this file as well as the min/max compute capability conditions here?

Yeah for sure, I will add it

johnnynunez · 2025-01-22T18:56:02Z

Done @Tom94

johnnynunez · 2025-01-23T10:11:27Z

It's normal failing because is not public yet...

bindings/torch/setup.py

Tom94 · 2025-01-23T10:18:05Z

Wait, I thought this PR was based on already available new CUDA versions. Are you saying the 12.8 version is pure speculation?

I tried searching for it just now and couldn't find anything. Also what's your source for the version 13.0 condition you added to the minimum compute capability?

johnnynunez · 2025-01-23T10:20:08Z

I tried searching for it just now and couldn't find anything. Also what's your source for the version 13.0 condition you added to the minimum compute capability?

13.0 is speculation. But the other thing is true. I attach here references:
Dao-AILab/flash-attention#1436

Tom94 · 2025-01-23T10:30:50Z

I'll hold off on updating tiny-cuda-nn until there's something official then.

johnnynunez · 2025-01-23T13:18:09Z

I'll hold off on updating tiny-cuda-nn until there's something official then.

perfect, PR is ready when CUDA 12.8 it comes! Thanks for this library :)

johnnynunez · 2025-01-23T21:09:03Z

Cuda 12.8 is out
whitepaper: https://docs.nvidia.com/cuda/pdf/ptx_isa_8.7.pdf

johnnynunez · 2025-01-24T17:35:01Z

@Tom94 merge :)

johnnynunez · 2025-01-24T19:41:18Z

@Tom94 nvcc warning : Support for offline compilation for architectures prior to '<compute/sm/lto>_75' will be removed in a future release (Use -Wno-deprecated-gpu-targets to suppress warning).

johnnynunez · 2025-01-24T19:41:54Z

@Tom94 nvcc warning : Support for offline compilation for architectures prior to '<compute/sm/lto>_75' will be removed in a future release (Use -Wno-deprecated-gpu-targets to suppress warning).

yo yes... only gpus with tensor cores will be supported +turing

Tom94 · 2025-01-24T20:06:49Z

Please still remove the 13.0 part. We've got no clear info which precise version will actually drop support for <75, even if it is deprecated. Thanks.

johnnynunez · 2025-01-24T20:10:49Z

Please still remove the 13.0 part. We've got no clear info which precise version will actually drop support for <75, even if it is deprecated. Thanks.

Done!

johnnynunez · 2025-01-24T21:34:41Z

@Tom94 fixed typo!

Co-authored-by: Thomas Müller <[email protected]>

johnnynunez · 2025-01-26T15:20:02Z

Tom94 reviewed Jan 23, 2025

View reviewed changes

bindings/torch/setup.py Outdated Show resolved Hide resolved

Tom94 force-pushed the master branch from 8c4f380 to 85686bc Compare January 25, 2025 07:03

feat: initial support for blackwell

e66aa8c

Co-authored-by: Thomas Müller <[email protected]>

Tom94 force-pushed the master branch from 85686bc to e66aa8c Compare January 25, 2025 07:27

Tom94 merged commit 394bfd2 into NVlabs:master Jan 25, 2025
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

initial support blackwell #487

initial support blackwell #487

johnnynunez commented Jan 21, 2025

Tom94 commented Jan 22, 2025

johnnynunez commented Jan 22, 2025

johnnynunez commented Jan 22, 2025

johnnynunez commented Jan 23, 2025

Tom94 commented Jan 23, 2025

johnnynunez commented Jan 23, 2025

Tom94 commented Jan 23, 2025

johnnynunez commented Jan 23, 2025 •

edited

Loading

johnnynunez commented Jan 23, 2025 •

edited

Loading

johnnynunez commented Jan 24, 2025

johnnynunez commented Jan 24, 2025

johnnynunez commented Jan 24, 2025

Tom94 commented Jan 24, 2025

johnnynunez commented Jan 24, 2025

johnnynunez commented Jan 24, 2025

johnnynunez commented Jan 26, 2025

initial support blackwell #487

initial support blackwell #487

Conversation

johnnynunez commented Jan 21, 2025

Tom94 commented Jan 22, 2025

johnnynunez commented Jan 22, 2025

johnnynunez commented Jan 22, 2025

johnnynunez commented Jan 23, 2025

Tom94 commented Jan 23, 2025

johnnynunez commented Jan 23, 2025

Tom94 commented Jan 23, 2025

johnnynunez commented Jan 23, 2025 • edited Loading

johnnynunez commented Jan 23, 2025 • edited Loading

johnnynunez commented Jan 24, 2025

johnnynunez commented Jan 24, 2025

johnnynunez commented Jan 24, 2025

Tom94 commented Jan 24, 2025

johnnynunez commented Jan 24, 2025

johnnynunez commented Jan 24, 2025

johnnynunez commented Jan 26, 2025

johnnynunez commented Jan 23, 2025 •

edited

Loading

johnnynunez commented Jan 23, 2025 •

edited

Loading