Skip to content

cuBLAS doc + error if -ngl > 0 and no cuBLAS #1466

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conversation

JohannesGaessler
Copy link
Collaborator

This PR adds documentation for how to use the GPU accelerated token generation that I implemented. Also, if llama.cpp was compiled without cuBLAS and the user then tries to run llama.cpp with 1 or more gpu layers an arror is raised and informs the user that they need to compile with cuBLAS (this was a point of failure for multiple people).

@JohannesGaessler JohannesGaessler added the documentation Improvements or additions to documentation label May 15, 2023
@JohannesGaessler
Copy link
Collaborator Author

The log for the failed job says:

Error: buildx failed with: ERROR: failed to solve: process "/bin/sh -c apt-get update &&     apt-get install -y build-essential python3 python3-pip" did not complete successfully: exit code: 100

So it looks like it has nothing to do with my changes?

@Green-Sky
Copy link
Collaborator

Looks like a temporary issue. I'm rerunning the failed jobs, should fix it.

@JohannesGaessler
Copy link
Collaborator Author

Which button would I have had to press to re-run the job myself?

@Green-Sky
Copy link
Collaborator

click on the jobs details and then in the upper right there is a re-run button. If something fails, you can also select re-run failed. :)

@JohannesGaessler
Copy link
Collaborator Author

Long since outdated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants