llama-cpp-binaries

llama.cpp server in a Python wheel.

Installation

git clone --recurse-submodules https://github.com/oobabooga/llama-cpp-binaries
cd llama-cpp-binaries
CMAKE_ARGS="-DGGML_CUDA=ON" pip install -v .

Replace -DGGML_CUDA=ON with the appropriate flag for your GPU, or remove it if you don't have a GPU.

Usage

import subprocess
import llama_cpp_binaries

server_path = llama_cpp_binaries.get_binary_path()
process = subprocess.Popen([server_path, "--help"])

For a more detailed example, consult: https://github.com/oobabooga/text-generation-webui/blob/main/modules/llama_cpp_server.py

Name		Name	Last commit message	Last commit date
Latest commit History 180 Commits
.github/workflows		.github/workflows
llama.cpp @ ff55414		llama.cpp @ ff55414
llama_cpp_binaries		llama_cpp_binaries
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

llama-cpp-binaries

Installation

Usage

About

Uh oh!

Releases 64

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

oobabooga/llama-cpp-binaries

Folders and files

Latest commit

History

Repository files navigation

llama-cpp-binaries

Installation

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 64

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages