cuBLAS doc + error if -ngl > 0 and no cuBLAS #1466

JohannesGaessler · 2023-05-15T09:33:50Z

This PR adds documentation for how to use the GPU accelerated token generation that I implemented. Also, if llama.cpp was compiled without cuBLAS and the user then tries to run llama.cpp with 1 or more gpu layers an arror is raised and informs the user that they need to compile with cuBLAS (this was a point of failure for multiple people).

README.md

llama.cpp

JohannesGaessler · 2023-05-15T16:26:15Z

The log for the failed job says:

Error: buildx failed with: ERROR: failed to solve: process "/bin/sh -c apt-get update &&     apt-get install -y build-essential python3 python3-pip" did not complete successfully: exit code: 100

So it looks like it has nothing to do with my changes?

Green-Sky · 2023-05-15T16:56:52Z

Looks like a temporary issue. I'm rerunning the failed jobs, should fix it.

JohannesGaessler · 2023-05-15T17:28:19Z

Which button would I have had to press to re-run the job myself?

Green-Sky · 2023-05-15T17:42:49Z

click on the jobs details and then in the upper right there is a re-run button. If something fails, you can also select re-run failed. :)

JohannesGaessler · 2023-07-31T12:34:02Z

Long since outdated.

JohannesGaessler added the documentation Improvements or additions to documentation label May 15, 2023

Green-Sky reviewed May 15, 2023

View reviewed changes

README.md Show resolved Hide resolved

ggerganov approved these changes May 15, 2023

View reviewed changes

llama.cpp Outdated Show resolved Hide resolved

cuBLAS doc + error if -ngl > 0 and no cuBLAS

9b1f955

JohannesGaessler force-pushed the cublas-documentation branch from e4c0147 to 9b1f955 Compare May 15, 2023 15:55

JohannesGaessler closed this Jul 31, 2023

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuBLAS doc + error if -ngl > 0 and no cuBLAS #1466

cuBLAS doc + error if -ngl > 0 and no cuBLAS #1466

JohannesGaessler commented May 15, 2023

JohannesGaessler commented May 15, 2023

Green-Sky commented May 15, 2023

JohannesGaessler commented May 15, 2023

Green-Sky commented May 15, 2023

JohannesGaessler commented Jul 31, 2023

cuBLAS doc + error if -ngl > 0 and no cuBLAS #1466

cuBLAS doc + error if -ngl > 0 and no cuBLAS #1466

Conversation

JohannesGaessler commented May 15, 2023

JohannesGaessler commented May 15, 2023

Green-Sky commented May 15, 2023

JohannesGaessler commented May 15, 2023

Green-Sky commented May 15, 2023

JohannesGaessler commented Jul 31, 2023