Distribute wheels with cuBLAS support for all supported NVIDIA GPU architectures #400

jllllll · 2023-06-19T19:23:38Z

I recently discovered that llama-cpp-python can be compiled with cuBLAS support for all supported GPU architectures by setting the CUDAFLAGS environment variable to -arch=all. On Windows, I can use these commands in CMD:

set FORCE_CMAKE=1
set "CMAKE_ARGS=-DLLAMA_CUBLAS=on"
set "CUDAFLAGS=-arch=all -lcublas"
python -m pip install git+https://github.com/abetlen/llama-cpp-python

Note that, due to an issue with the current llama.cpp version in this repo, -lcublas has to be added as well in order to link the needed cuBLAS library. Setting the VERBOSE environment variable to 1 allows you to see the full output of the build process. Doing this, I can see that it is indeed building for all architectures as it shows a warning about the deprecated Kepler architectures.

The resulting wheel works on my own system, but that is to be expected. I have not been able to test if a wheel works on a different system.

This will greatly improve the user experience for text-generation-webui. Especially for Windows users due to eliminating the need for Visual Studio.

The text was updated successfully, but these errors were encountered:

jllllll · 2023-06-19T19:47:18Z

Seems this method does not work with the latest llama.cpp due to them changing their CMakeLists.txt to use -arch=native.

gjmulder · 2023-06-20T06:12:42Z

Duplicate of #243

jllllll changed the title ~~Distribute wheels with cuBLAS support for all NVIDIA GPU architectures~~ Distribute wheels with cuBLAS support for all supported NVIDIA GPU architectures Jun 19, 2023

jllllll mentioned this issue Jun 19, 2023

cmake compilations issues after setting CUDA_ARCHITECTURES to native ggml-org/llama.cpp#1940

Closed

gjmulder added duplicate This issue or pull request already exists build hardware Hardware specific issue labels Jun 20, 2023

gjmulder marked this as a duplicate of #243 Jun 20, 2023

gjmulder added the enhancement New feature or request label Jun 20, 2023

jllllll closed this as not planned Won't fix, can't repro, duplicate, stale Jun 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distribute wheels with cuBLAS support for all supported NVIDIA GPU architectures #400

Distribute wheels with cuBLAS support for all supported NVIDIA GPU architectures #400

jllllll commented Jun 19, 2023 •

edited

Loading

jllllll commented Jun 19, 2023

gjmulder commented Jun 20, 2023

Distribute wheels with cuBLAS support for all supported NVIDIA GPU architectures #400

Distribute wheels with cuBLAS support for all supported NVIDIA GPU architectures #400

Comments

jllllll commented Jun 19, 2023 • edited Loading

jllllll commented Jun 19, 2023

gjmulder commented Jun 20, 2023

jllllll commented Jun 19, 2023 •

edited

Loading