You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
These Inference Extensions are being formally specified as part of the Kubernetes Gateway API and will provide a standard interface for inference optimized load balancing algorithms such as the production stack Router.
Why do you need this feature?
Provides a (soon-to-be) standardized integration with Kubernetes Gateways.