-
Notifications
You must be signed in to change notification settings - Fork 10.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Share a simpler Cmake methd to compile and run GPU accelerated version(openBLAS and CLBlast) on android Qualcomm Adreno device. #2169
Comments
can you provide some performance info? thx |
It's an improvement over the current instructions. It's more a generic guide than an Adreno/Snapdagon guide. "ocl-icd" doesn't work with Adreno in the same way as other Android devices, so export, & setting platform/device may be unnecessary. This works for my device: Ensure to move your model to the correct directory for better performance: For example: |
just a quick update,it works,and I saw the GPU be used(-ngl + big num),but the performance is very pool(qualcomm 8Gen1 less than Cpu openblas). maybe there are something wrong? |
I am an android noob, can you tell me why I can get better performance via moving model? |
The reason moving the model from downloads to |
This issue was closed because it has been inactive for 14 days since being marked as stale. |
I browse all issues and the official setup tutorial of compiling llama.cpp to GPU. But I found it is really confused by using MAKE tool and copy file from a src path to a dest path(Especially the official setup tutorial is little weird)
Here is the method I summarized (which I though much simpler and more elegant)
0 Install NDK and CMake tools
please refer the basic step of how to compile llama.cpp on CPU on android device
1 install openblas
2 install openCL
3 Install CLBLast
4 Build llama
5 run llama
The text was updated successfully, but these errors were encountered: