Name		Name	Last commit message	Last commit date
parent directory ..
benchmarks		benchmarks
configs		configs
llama2		llama2
llama3		llama3
nn		nn
quantize_tinystories		quantize_tinystories
replit		replit
samplers		samplers
tokenizer		tokenizer
weights		weights
README.md		README.md
__init__.🔥		__init__.🔥

README.md

MAX Pipelines

These are end-to-end pipelines that demonstrate the power of MAX for accelerating common AI workloads, and more. The umbrella pipelines Mojo module contains these pipelines as their own modules, along with shared modules hosting common functionality.

Pipelines

The pipelines include:

Llama 3: A text completion demo using the Llama 3 model, implemented in Mojo using the MAX Graph API. This pipeline contains everything needed to run a self-hosted large language model.
Llama 2: Similar to the Llama 3 text generation pipeline, only with the Llama 2 model. The Llama 2 pipeline also shows how to use a custom kernel in MAX Graphs.
Replit Code: Code generation via the Replit Code V1.5 3B mode, implemented in Mojo using the MAX Graph API.
Quantize TinyStories: A demonstration of quantizing a full-precision model using the MAX Graph API, originally trained on the TinyStories dataset.

Instructions for how to run each pipeline can be found in their respective subdirectories. A shared run_pipeline.🔥 Mojo driver is used to execute the pipelines.

Shared modules

In addition to the pipelines, common modules contain types and functions shared between the various pipelines. These modules currently include:

nn: Abstractions for common layers in neural network architectures.
tokenizer: Shared tokenizers used across text pipelines.
weights: A module containing code for loading common weight formats, such as GGUF.

Tests

Unit tests for the pipelines and their shared components have been provided in a directory parallel to this one. They can be run using the Mojo testing framework via an invocation similar to the following (if running from this directory):

mojo test ../ -I ../

To select a single test case to run, an invocation like the following can be used:

mojo test -I ../ "../test/llama3/test_heap.mojo::test_simple()"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pipelines

pipelines

README.md

MAX Pipelines

Pipelines

Shared modules

Tests

Files

pipelines

Directory actions

More options

Directory actions

More options

Latest commit

History

pipelines

Folders and files

parent directory

README.md

MAX Pipelines

Pipelines

Shared modules

Tests