End-to-End LLMOps for open LLMs on Amazon SageMaker

This repository provides an end-to-end example of using LLMOps practices on Amazon SageMaker for large language models (LLMs). The repository demonstrates a sample LLMOps pipeline for training, optimizing, deploying, monitoring, and managing LLMs on SageMaker using infrastructure as code principles.

Currently implemented:

End-to-End:

Infernece:

Deploy Llama3 on Amazon SageMaker
Deploy Mixtral 8x7B on Amazon SageMaker
Scale LLM Inference on Amazon SageMaker with Multi-Replica Endpoints
Optimizing LLMs with Quantization (coming soon)
Monitoring and managing LLMs with CloudWatch (coming soon)

Training:

Train and evaluate LLMs

Pre-requisites

Before we can start make sure you have met the following requirements

AWS Account with quota
AWS CLI installed
AWS IAM user configured in CLI with permission to create and manage ec2 instances

Contributions

Contributions are welcome! Please open issues and pull requests.

License

This repository is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
assets		assets
demo		demo
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

End-to-End LLMOps for open LLMs on Amazon SageMaker

Contents

Pre-requisites

Contributions

License

About

Releases

Packages

Contributors 3

Languages

License

philschmid/llm-sagemaker-sample

Folders and files

Latest commit

History

Repository files navigation

End-to-End LLMOps for open LLMs on Amazon SageMaker

Contents

Pre-requisites

Contributions

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages