-
Notifications
You must be signed in to change notification settings - Fork 11.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Anyone got CLBLAST working on Intel macOS with AMD GPU? Is it meant to work? #1429
Comments
Personally I just change the argument in
And it should compile and run fine! |
Thanks for the fast reply! That certainly got it running. But..
Hmm :) Have you got it running OK on macOS with AMD GPU? |
I’ll try to fix it. |
Thank you very much! |
I'll try to rewrite the CL platform and device selection, too. |
Hey friend of mine has the same issue on AMD but with -6 instead of -30, latest pull does not work! |
@skidd-level-100 -6 error is for @TheBloke I haven't tried it on my Intel Mac.. sorry, didn't think Apple's drivers would have different API supports between architectures. Might also try SlyEcho's fix later on a Hackintosh to see if my Clover & OpenCore loaders crash too. |
On linux btw (fedora) CLblast was working fine on integrated, but then we recompiled with latest pull and it broke with -6 |
OK thanks. FYI I'm on a Hackintosh too. No sign of any crashes, but I tried Sly's fix and got exactly the same issue with the long error message about quantisation formats. Maybe I'll test with an fp16 model later |
Please see: https://github.com/ggerganov/llama.cpp/pull/1435/files#r1193090108
clinfo:
|
Yeah I'm still getting that long message about 'automatic variable qualified with an address space' as well
|
idk if it helps, but I tried slicing the error message string, and the string from character Edit : I changed the declarations from Edit 2 : The change from constant to const worked :) |
That should be the problem. It is now fixed in #1435, please test it and give feedback. |
Any perf improvements over CPU? How does it compare against cublast? I'm also on Hackintosh. |
Yeah I saw improvements in the prompt generation time, I think it was about half. I didn't do any proper benchmarks and I've not compared against CUBLAS. Right now it doesn't support the full gpu offloading that's now available with CUBLAS, so it's not going to be the same huge boost as that's provided. But definitely nice to have in situations where you have long context, eg ongoing chats, I would say. And just really nice to see the GPU being used in macOS! :) (well, TBF not the first time, as WebLLM/mlc can do that too - but that has very limited model support right now.) |
"nice to see the GPU being used in macOS" -> definitely... tell me about it! Main use case is 1% usage when moving windows around. 😂 Thanks for the feedback, I'll give it a go. I tried to compile it a few days ago but also miserably failed. |
I'm still getting the error:
@swittk do you know how I can apply this workaround to the new ggml-opencl.cpp file?
Seems like it's on line 490 now but not sure what to change. Thank you. |
|
Hi all
I just learned about CLBLAST so wanted to try it at home on my Intel macOS system with AMD 6900XT GPU.
I have no idea if it's meant to work on this system or with AMD GPUs? Maybe it's only designed for NV on Linux or Windows at the moment? But I figured as it's using OpenCL, it should work with any GPU? Maybe? :)
Installing CLBLAST is easy:
Compiling went fine:
First attempt got this problem - it's using the wrong device:
My CPU, not GPU.
So I edited
ggml-opencl.c
and changed this line to device 1 :Now it tries to use my GPU, but still fails with exactly the same error:
I've never used CLBLAST before so no clue what this error means or what might be wrong.
Any help or advice would be appreciated!
The text was updated successfully, but these errors were encountered: