Push Docker images to Dockerhub using Github actions for running a llama-cpp-python REST server

A lot of people would like to run their own server, but don't have the necessary DevOps skills to configure and build a `llama-cpp-python + python + llama.cpp` environment.

I'm working on developing some Dockerfiles that are run via a Github action to publish to [Docker Hub](https://hub.docker.com/) similar to [llama.cpp's workflows/docker.yml](https://github.com/ggerganov/llama.cpp/blob/master/.github/workflows/docker.yml) for both OpenBLAS (i.e. no NVidia GPU) and CuBLAS (NVidia GPU via Docker) support.

Which [CC licensed](https://creativecommons.org/licenses/) models are now available that are compatible with `llama.cpp`'s new quantized format? Ideally we want to start with small models to keep the Docker image sizes manageable.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Push Docker images to Dockerhub using Github actions for running a llama-cpp-python REST server #236

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Push Docker images to Dockerhub using Github actions for running a llama-cpp-python REST server #236

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions