Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[QST] Build for sm100 Blackwell GPUs #2072

Open
phantaurus opened this issue Jan 30, 2025 · 4 comments
Open

[QST] Build for sm100 Blackwell GPUs #2072

phantaurus opened this issue Jan 30, 2025 · 4 comments

Comments

@phantaurus
Copy link

phantaurus commented Jan 30, 2025

What is your question?
Hello!

I noticed this:
"Note: The NVIDIA Blackwell SM100 architecture used in the datacenter products has a different compute capability than the one underpinning NVIDIA Blackwell GeForce RTX 50 series GPUs. As a result, kernels compiled for Blackwell SM100 architecture with arch conditional features (using sm100a) are not compatible with RTX 50 series GPUs."

When building cutlass examples, I tried both DCUTLASS_NVCC_ARCHS="100a" and DCUTLASS_NVCC_ARCHS="100".
When setting it to "100", examples such as 70_blackwell_gemm disappeared from the Makefile.

Does this mean that non-datacenter sm_100 Blackwell GPUs do not have the new TensorCore features? If so, do they fall back to Hopper? Can I use the hopper tensorcore examples to get max TFLOPS on sm_100 GPUs? Or does this mean cutlass currently only support sm_100a TensorCore operations?

Thank you so much!

@phantaurus phantaurus changed the title [QST] Build for Blackwell GPUs [QST] Build for sm100 Blackwell GPUs Jan 30, 2025
@thakkarV
Copy link
Collaborator

CUTLASS 3.8 only supports sm100a aka datacenter Blackwell chips for now. Support for sm120a will land in a future release.

@phantaurus
Copy link
Author

@thakkarV Thank you so much for your reply! What about DRIVE Thor which has a compute capability of 10.1? Will they be supported along with sm120a?

@hwu36
Copy link
Collaborator

hwu36 commented Feb 6, 2025

What about DRIVE Thor which has a compute capability of 10.1? Will they be supported along with sm120a?

No, sm101a cannot run on sm120a.

@thakkarV
Copy link
Collaborator

thakkarV commented Feb 7, 2025

All kernels we have released in 3.8 can be run on 100 or 101. Just compile with the right CUTLASS_NVCC_ARCH set. SM120a support will come in a future release

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants