AIBrix

Welcome to AIBrix, an open-source initiative designed to provide essential building blocks to construct scalable GenAI inference infrastructure. AIBrix delivers a cloud-native solution optimized for deploying, managing, and scaling large language model (LLM) inference, tailored specifically to enterprise needs.

Key Features

The initial release includes the following key features:

High-Density LoRA Management: Streamlined support for lightweight, low-rank adaptations of models.
LLM Gateway and Routing: Efficiently manage and direct traffic across multiple models and replicas.
LLM App-Tailored Autoscaler: Dynamically scale inference resources based on real-time demand.
Unified AI Runtime: A versatile sidecar enabling metric standardization, model downloading, and management.
Distributed Inference: Scalable architecture to handle large workloads across multiple nodes.
Distributed KV Cache: Enables high-capacity, cross-engine KV reuse.
Cost-efficient Heterogeneous Serving: Enables mixed GPU inference to reduce costs with SLO guarantees.
GPU Hardware Failure Detection: Proactive detection of GPU hardware issues.

Architecture

Quick Start

To get started with AIBrix, clone this repository and follow the setup instructions in the documentation. Our comprehensive guide will help you configure and deploy your first LLM infrastructure seamlessly.

# Local Testing
git clone https://github.com/vllm-project/aibrix.git
cd aibrix

# Install nightly aibrix dependencies
kubectl create -k config/dependency

# Install nightly aibrix components
kubectl create -k config/default

Install stable distribution

# Install component dependencies
kubectl create -k "github.com/vllm-project/aibrix/config/dependency?ref=v0.2.0"

# Install aibrix components
kubectl create -k "github.com/vllm-project/aibrix/config/overlays/release?ref=v0.2.0"

Documentation

For detailed documentation on installation, configuration, and usage, please visit our documentation page.

Contributing

We welcome contributions from the community! Check out our contributing guidelines to see how you can make a difference.

Slack Channel: #aibrix

License

AIBrix is licensed under the APACHE 2.0 License.

Support

If you have any questions or encounter any issues, please submit an issue on our GitHub issues page.

Thank you for choosing AIBrix for your GenAI infrastructure needs!

Name		Name	Last commit message	Last commit date
Latest commit History 323 Commits
.github		.github
api		api
benchmarks		benchmarks
build/container		build/container
cmd		cmd
config		config
development		development
docs		docs
hack		hack
pkg		pkg
python/aibrix		python/aibrix
samples		samples
test		test
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.readthedocs.yaml		.readthedocs.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
PROJECT		PROJECT
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AIBrix

Key Features

Architecture

Quick Start

Documentation

Contributing

License

Support

About

Releases

Packages

Languages

License

TheCodeWrangler/aibrix

Folders and files

Latest commit

History

Repository files navigation

AIBrix

Key Features

Architecture

Quick Start

Documentation

Contributing

License

Support

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages