Cortex

Cortex is built to deploy, manage, and scale machine learning models in production for AWS. It provides features such as

Serverless workloads
Automated cluster management
CI/CD and observability integrations

Cortex supports 4 different ways to build scalable API :

Realtime: create APIs that respond to requests in real-time.
Async: create APIs that respond to requests asynchronously.
Batch: create APIs that run distributed batch jobs.
Task: create APIs that run jobs on-demand.

Cortex requires only two configuration file to deploy the application. Cortex creates a cluster from cluster.yaml file including a s3 bucket and cloudwatch log group. The Cortex cluster runs on an EKS (Kubernetes) cluster in a dedicated VPC on your AWS account. Each individual API contains cortex.yaml to deploy different types of workloads.

In this exercise, transformers sentiment classifier application is deployed using Cortex two different APIs.

Cortex is super 🚀 With just 2 commands, 2 configuration files and right amount of patience, the application is deployed seamlessly without modifications to the application.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
.trunk		.trunk
async		async
realtime		realtime
.dockerignore		.dockerignore
.flake8		.flake8
.gitignore		.gitignore
.hadolint.yaml		.hadolint.yaml
.isort.cfg		.isort.cfg
.markdownlint.yaml		.markdownlint.yaml
Readme.md		Readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cortex

About

Releases

Packages

Languages

dudeperf3ct/11-cortex-deploy

Folders and files

Latest commit

History

Repository files navigation

Cortex

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages