Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for setting the PTX version #1

Open
maleadt opened this issue Jun 14, 2023 · 3 comments
Open

Support for setting the PTX version #1

maleadt opened this issue Jun 14, 2023 · 3 comments

Comments

@maleadt
Copy link
Member

maleadt commented Jun 14, 2023

We currently can't realistically use NVVM, because it always sets the PTX version to the latest one available. For example, current libNVVM_jll comes from CUDA 12.1, so the PTX version is set to 8.1, only supporting the very latest drivers. I've filed a feature request with NVIDIA to make the PTX version configurable, which should just be a matter of setting a TargetMachine option.

The alternative, using the libNVVM from the user's toolkit, is unwanted because we'd then need to support multiple versions of the NVVM IR.

@maleadt
Copy link
Member Author

maleadt commented Apr 29, 2024

Worked around by #1, but lets keep this open until there's an officially supported solution.

@maleadt
Copy link
Member Author

maleadt commented Apr 29, 2024

It's been suggested that a better solution is to ship ptxas (or libnvptxcompiler once it's distributed as a dynamic library) and use that to ensure the PTX ISA emitted is supported.

EDIT: I'm not actually sure this is going to work; is GPU machine code generated by ptxas from CUDA 12.x compatible with an 11.x driver? Things like cooperative groups rely on special allocations made by the driver, so this seems questionable.

@maleadt
Copy link
Member Author

maleadt commented May 15, 2024

EDIT: I'm not actually sure this is going to work; is GPU machine code generated by ptxas from CUDA 12.x compatible with an 11.x driver? Things like cooperative groups rely on special allocations made by the driver, so this seems questionable.

Turns out this incompatibility is real, and we won't be able to use ptxas from CUDA 12.x to target an 11.x-era driver. Guess we'll have to wait for a proper feature (#4 is too much of a hack).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant