planning: API for Active models #1688

gabrielle-ong · 2024-11-15T05:58:37Z

Goal

We currently have the CLI version of cortex ps to show active models and consumed resources (Ram/vram usage and how long its been running)
we have an unofficial API GET http://localhost:39281/inferences/server/models
we should make this an official /v1 API

Discussion / Success Criteria: @louis-jan

Should it 1 API to show all running models & consumed resources? ie == CLI cortex ps
Or separate APIs to show active status for /models (per model or all models?)
And separate API to show resources consumed /system

Tasklist

add unofficial API to docs (interim) - @gabrielle-ong
implement official API endpoint(s)

Current

GET http://localhost:39281/inferences/server/models

Response:

{
    "data": [
        {
              "engine": "cortex.llamacpp",
              "id": "llama3.2-1b-instruct",
              "model_size": 123,
              "object": "model",
              "ram": 123,
              "start_time": 123,
              "vram": 123,              
        }
    ],
    "object": "list"
}

Future: API / CLI

API

1. Feature

GET /v1/endpoint

Body:

{
    "key": "value"
}

Response

200
{
}
Error
{
}

User request

The text was updated successfully, but these errors were encountered:

gabrielle-ong added the type: epic A major feature or initiative label Nov 15, 2024

github-project-automation bot added this to Menlo Nov 15, 2024

github-project-automation bot moved this to Investigating in Menlo Nov 15, 2024

gabrielle-ong added type: planning Opening up a discussion category: model management Model pull, yaml, model state and removed type: epic A major feature or initiative labels Nov 15, 2024

gabrielle-ong modified the milestones: v1.0.3, v1.0.4 Nov 15, 2024

gabrielle-ong moved this from Investigating to Icebox in Menlo Nov 27, 2024

dan-menlo assigned vansangpfiev Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

planning: API for Active models #1688

planning: API for Active models #1688

gabrielle-ong commented Nov 15, 2024

planning: API for Active models #1688

planning: API for Active models #1688

Comments

gabrielle-ong commented Nov 15, 2024

Goal

Discussion / Success Criteria: @louis-jan

Tasklist

Current

Future: API / CLI

API

1. Feature

User request