KServe

KServe provides a Kubernetes Custom Resource Definition for serving machine learning (ML) models on arbitrary frameworks. It aims to solve production model serving use cases by providing performant, high abstraction interfaces for common ML frameworks like Tensorflow, XGBoost, ScikitLearn, PyTorch, and ONNX.

It encapsulates the complexity of autoscaling, networking, health checking, and server configuration to bring cutting edge serving features like GPU Autoscaling, Scale to Zero, and Canary Rollouts to your ML deployments. It enables a simple, pluggable, and complete story for Production ML Serving including prediction, pre-processing, post-processing and explainability. KServe is being used across various organizations.

For more details, visit KServe website

Since 0.7 KFServing is rebranded to KServe, we still support previous KFServing 0.5.x and 0.6.x releases, please refer to corresponding release branch for docs.

Learn More

To learn more about KServe, how to deploy it as part of Kubeflow, how to use various supported features, and how to participate in the KServe community, please follow the KServe website documentation. Additionally, we have compiled a list of presentations and demoes to dive through various details.

Installation

Standalone Installation

KServe by default installs Knative for serverless deployment, please follow Serverless installation guide to install KServe. If you are looking to install KServe without Knative(this feature is still alpha), please follow Raw Kubernetes Deployment installation guide.

Quick Install

Please follow quick install to install KServe on your local machine.

Create test inference service

Please follow getting started to create your first InferenceService.

Roadmap

API Reference

InferenceService v1beta1 API Docs

Developer Guide

Developer Guide.

Contributor Guide

Adopters

Name		Name	Last commit message	Last commit date
Latest commit History 879 Commits
.github		.github
cmd		cmd
config		config
docs		docs
hack		hack
install		install
manifests/charts		manifests/charts
pkg		pkg
python		python
release		release
test		test
third_party/library		third_party/library
tools/tf2openapi		tools/tf2openapi
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
OWNERS		OWNERS
PROJECT		PROJECT
README.md		README.md
ROADMAP.md		ROADMAP.md
agent.Dockerfile		agent.Dockerfile
go.mod		go.mod
go.sum		go.sum
prow_config.yaml		prow_config.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KServe

Learn More

Installation

Standalone Installation

Quick Install

Create test inference service

Roadmap

API Reference

Developer Guide

Contributor Guide

Adopters

About

Releases

Packages

Languages

License

shrinath-suresh/kserve

Folders and files

Latest commit

History

Repository files navigation

KServe

Learn More

Installation

Standalone Installation

Quick Install

Create test inference service

Roadmap

API Reference

Developer Guide

Contributor Guide

Adopters

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages