-
Notifications
You must be signed in to change notification settings - Fork 11.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error on make LLAMA_CUBLAS=1 #1470
Comments
I came across the same thing for Windows and the immediate issue was that your path (the "NVIDIA GPU Computing Toolkit" part) has spaces in it and apparently Make does not like spaces, so I think you can put quotes around it to make the path literal. However I kept running into other issues in Windows that I gave up and dual installed Ubuntu along with my Windows installation and it ran great. Never got CUBLAS to work on Windows though, I'd say before an official solution comes out just running it in Linux probably saves you a lot of headache in the coming days. |
As @technicolor-twelve says, this seems to happen because your CUDA_PATH env variable has spaces. I guess that you are trying to build with mingw, but as far as I know CUDA is not supported with mingw anyway. You have to use cmake and MSVC to build with CUDA under windows, there are instructions in the README. Alternatively, use one of the pre-built binaries available at https://github.com/ggerganov/llama.cpp/tags |
You have to escape the spaces with What compiler is that? Is it MingW? It can't use the Linux CUDA SDK anyway, since it is a Windows compiler. |
@SlyEcho If it's anything like my experience it's probably w64devkit with MinGW32, probably because make works on Windows with OpenBLAS and without the BLAS build, naturally I thought the cuBLAS build worked out of the box for windows as well in the Readme. |
i must escape directly in makefile? im using w64devkit-1.19.0 as project suggests. |
After adding double quotation marks " " to cuda PATH it works, but now fail in this step:
|
nvcc is calling cl (MSVC). They use different flags than GCC. You can't mix compilers like this. You have to rewrite a lot of the Makefile probably. Or use CMake that should configure everything automatically and find the paths and compilers and stuff. |
Can confirm, using CMake is the way to go, didn't face any issues!
|
CMake for cublas, w64devkit for openblas. Just makes things work. |
Fixed using cMake instead w64devkit, thanks to all! |
adjusting the ENV variable works well - below or shortened copy fix - add to PATH
solving
|
TORCH_CUDA_ARCH_LIST=8.6+PTX |
Trying to compile with CUDA support and get this:
The text was updated successfully, but these errors were encountered: