modal-outlines-examples

Examples of code that use Outlines to enable structured text generation for LLMs running on Modal.

Is your computer only capable of running small, quantized LLMs? Would you love to have an Nvidia H100 at home? Keep reading, because we're about to make your dream come true.

Level 1: The Basics

The basic idea: We deploy a function to Modal, an easy-to-use cloud platform, then we call that function locally from a Jupyter notebook.

The unbelievable part: The function will run on a super powerful GPU in the cloud. Plus, we get to use free processing credits ($30 monthly).

Gated repo: https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2

Level 2: Different LLMs

Yeah, I know Mistral is good, but what if I want to test different models?

Level 3: vLLM

Coming soon.

Level 4: FastAPI

How to serve an inference endpoint for structured text generation. GPU-powered. Serverless. Automatically scales. Pay as you go.

Installation

conda create --name modal-outlines-examples -c conda-forge python=3.11
conda activate modal-outlines-examples
pip install -r requirements.txt
python -m modal setup

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
modal		modal
notebooks		notebooks
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

modal-outlines-examples

Level 1: The Basics

Level 2: Different LLMs

Level 3: vLLM

Level 4: FastAPI

Installation

About

Releases

Packages

Languages

License

aastroza/modal-outlines-examples

Folders and files

Latest commit

History

Repository files navigation

modal-outlines-examples

Level 1: The Basics

Level 2: Different LLMs

Level 3: vLLM

Level 4: FastAPI

Installation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages