-
Notifications
You must be signed in to change notification settings - Fork 407
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement support for CUDA #944
base: main
Are you sure you want to change the base?
Conversation
Hi guys! First of all, thanks for this amazing project! Any news regarding release of CUDA support feature? |
Please add this pull request - it would really make a difference if we can finally use CUDA :) |
* Update dev.sh * Update deploy.sh * Update serge.env --------- Co-authored-by: Juan Calderon-Perez <835733+gaby@users.noreply.github.com>
* Update serge.env * Update deploy.sh Update path * Update dev.sh update path * Update serge.env * Update serge.env Bump version of Llama cpp python to v0.2.44
@coderabbitai review |
Note Reviews PausedUse the following commands to manage reviews:
WalkthroughThe project has been enhanced to support GPU acceleration, specifically targeting NVIDIA and AMD GPUs, for Dockerized large language models. This includes configuring the build process to create Docker images with CUDA support, adjusting deployment scripts to dynamically select package sources based on GPU architecture, and setting up environment variables for GPU support. These changes aim to improve performance by leveraging GPU capabilities for computational tasks. Changes
Assessment against linked issues
Poem
Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media? TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (invoked as PR comments)
Additionally, you can add CodeRabbit Configration File (
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Review Status
Actionable comments generated: 2
Configuration used: CodeRabbit UI
Files ignored due to path filters (1)
docker-compose.dev.yml
is excluded by:!**/*.yml
Files selected for processing (7)
- .dockerignore (1 hunks)
- .github/workflows/docker-gpu.yml (1 hunks)
- .github/workflows/docker.yml (2 hunks)
- docker/Dockerfile.gpu (1 hunks)
- scripts/deploy.sh (1 hunks)
- scripts/dev.sh (1 hunks)
- scripts/serge.env (1 hunks)
Additional comments: 5
scripts/serge.env (1)
- 1-2: The addition of
SERGE_GPU_NVIDIA_SUPPORT
andSERGE_GPU_AMD_SUPPORT
configuration flags with default values set tofalse
is a clear and effective way to introduce optional GPU support. This approach allows for easy configuration and ensures backward compatibility by making GPU support opt-in..dockerignore (1)
- 12-12: Updating the
.dockerignore
file to ignore thedocker/
directory is a good practice for optimizing Docker builds by excluding unnecessary files. This change supports a cleaner project structure by centralizing Docker-related files..github/workflows/docker.yml (1)
- 63-63: The update to the Dockerfile path in the
.github/workflows/docker.yml
workflow file correctly reflects the project's restructuring to support GPU-enabled Docker images. This change ensures that the CI/CD pipeline builds and publishes the correct Docker images.scripts/deploy.sh (1)
- 28-35: The updated logic in
deploy.sh
for dynamically setting thepip_command
based on NVIDIA or AMD GPU support is well-implemented, aligning with the PR's objectives to introduce flexible GPU support. Ensure that theserge.env
file is correctly sourced in deployment environments to properly set theSERGE_GPU_NVIDIA_SUPPORT
andSERGE_GPU_AMD_SUPPORT
variables.scripts/dev.sh (1)
- 29-36: The updates to
dev.sh
for dynamically setting thepip_command
based on GPU support are consistent with similar changes indeploy.sh
, promoting flexibility and alignment with the PR's objectives. Ensure that theserge.env
file is correctly sourced in development environments to properly set the GPU support variables.
@coderabbitai pause |
Done:
-cuda
docker imagesTODO:
nvidia-runtime
image toDockerfile.gpu
README.md
Fixes #43
Fixes #128
Summary by CodeRabbit
.dockerignore
to optimize Docker builds.