Model name | Model source | Sample workspace | Kubernetes Workload | Distributed inference |
---|---|---|---|---|
falcon-7b-instruct | tiiuae | link | Deployment | false |
falcon-7b | tiiuae | link | Deployment | false |
falcon-40b-instruct | tiiuae | link | Deployment | false |
falcon-40b | tiiuae | link | Deployment | false |
- Public: Kaito maintainers manage the lifecycle of the inference service images that contain model weights. The images are available in Microsoft Container Registry (MCR).
See document.