epic: Cortex Model Structures and simplified `cortex run` #1512

gabrielle-ong · 2024-10-17T08:45:26Z

Problem

Model names are complex in capturing repo, source, version, alias
Models names should be simplified, compact and not overwhelm users with long model names
We need a clearer way to handle Model names for models list, cortex pull and cortex run
This affects the core logic of how we handle models in cortex.db

Success Criteria

cortex pull models (huggingface, cortexso) successfully pulls model
cortex.db saves the right fields for pulled model
Cortex model list shows simplified table
cortex run uses regex to ask user which model ID they want to run

repo, version, source, id, alias

Tasklist / Sub-issues

to be added

Eng Specs

from #1410

Concepts

Model Repo: i.e. tinyllama, or bartowski/...
Model Source: huggingface, cortex
Model Version: i.e. specific quant, that belongs to a Model Repo
Model ID: should be :
Model Alias: user-defined shortname - Deprecated in favour of regex

1. Models Table & `cortex models list`

The Models table in cortex.db still remains the same as before

model table	Description
`model`	Unique identifier for the model
`author_repo_id`	Author or repository identifier
`branch_name`	Doesn't exist
`path_to_model_yaml`	Path to the model's YAML file

result of cortex models list will be simplify as follow:

$ cortex models list
| Index | Model ID                                                           |
|-------|---------------------------------------------------------------------|
| 1     | tinyllama:1b-gguf                                                   |
| 2     | tinyllama:1b-gguf                                                   |
| 3     | bartowski/Mistral-8b-instruct-gguf:Mistral-8b-instruct-8b.q4k_m     |
| 4     | mistral:7b                                                          |
| 5     | nvidia-cloud/Mistral-Nemo-12b:int4                                  |
| 6     | huggingface.co/bartowski/Mistral-8b-instruct-gguf:quant |

The engine will be infer from model.yml of each model
We will read the model.yml through path_to_model_yaml.
Model ID in model list result command = model field in Models table of cortex.db

When running, we would find the matching ID from the database. The model list could also include an option to filter out models:

$ cortex models list mis
| Index | Model ID                                                           |
|-------|---------------------------------------------------------------------|
| 1     | bartowski/Mistral-8b-instruct-gguf:Mistral-8b-instruct-8b.q4k_m     |
| 2     | mistral:7b                                                          |
| 3     | nvidia-cloud/Mistral-Nemo-12b:int4                                  |
| 4     | huggingface.co/bartowski/Mistral-8b-instruct-gguf:quant |

2. `cortex run` with regex search (Deprecate model aliases)

If only 1 returns, we run the one model
If there are multiple models matched, we show a menu for the user to choose
If no arg, we show all models and let user choose via menu.
update API to match with this change
we would no longer need model alias field.

Logic:

$ cortex models list
| Index | Model ID                                                           |
|-------|---------------------------------------------------------------------|
| 1     | tinyllama:1b-gguf                                                   |
| 2     | tinyllama:1b-gguf                                                   |
| 3     | bartowski/Mistral-8b-instruct-gguf:Mistral-8b-instruct-8b.q4k_m     |
| 4     | mistral:7b                                                          |
| 5     | nvidia-cloud/Mistral-Nemo-12b:int4                                  |
| 6     | huggingface.co/bartowski/Mistral-8b-instruct-gguf:quant |

$ cortex run mis
Please select an option:
1. mistral-nemo:12b-gguf-q8
2. huggingface.co/bartowski/Mistral-8b-instruct-gguf:quant
3. mistral:7b
4. nvidia-cloud/Mistral-Nemo-12b:int4

Bug tracking

bug: Could not able to run models since 172 #1505 (wontfix; should be resolved by this)

The text was updated successfully, but these errors were encountered:

gabrielle-ong · 2024-10-17T08:54:06Z

Questions (may be out of scope)

is there any implication on model import?
eg importing a bartowski model gguf, and naming it model_id = gabbyllama
what will the output of models list and expected input of cortex run be?

$ cortex models import  --model_id gabbyllama --model_path /Users/gab/cortexcpp-nightly/models/cortex.so/gabllama/gguf/model.gguf

gabrielle-ong · 2024-10-17T08:55:40Z

@0xSage
X Large
Spec Status - Finalized
Sprint 23

namchuai · 2024-10-21T01:53:47Z

Questions (may be out of scope)

is there any implication on model import?
eg importing a bartowski model gguf, and naming it model_id = gabbyllama
what will the output of models list and expected input of cortex run be?
$ cortex models import  --model_id gabbyllama --model_path /Users/gab/cortexcpp-nightly/models/cortex.so/gabllama/gguf/model.gguf

This should not affect model import. Model id will still be the same as user input.

feat(#1512): simplify cortex run

gabrielle-ong · 2024-11-08T06:30:05Z

QA - nicely done, thanks @namchuai!

cortex models list shows all model IDs
cortex models list <substring> filters models by substring
cortex run <substring> filters models by substring

gabrielle-ong added the type: epic A major feature or initiative label Oct 17, 2024

gabrielle-ong assigned namchuai Oct 17, 2024

gabrielle-ong added this to Jan & Cortex Oct 17, 2024

github-project-automation bot moved this to Investigating in Jan & Cortex Oct 17, 2024

gabrielle-ong moved this from Investigating to Scheduled in Jan & Cortex Oct 17, 2024

gabrielle-ong added category: model management Model pull, yaml, model state category: model running Inference ux, handling context/parameters, runtime labels Oct 17, 2024

gabrielle-ong mentioned this issue Oct 17, 2024

discussion: Simplify Model Structures for Cortex #1410

Closed

gabrielle-ong mentioned this issue Oct 18, 2024

bug: Could not able to run models since 172 #1505

Closed

6 tasks

namchuai added a commit that referenced this issue Oct 20, 2024

feat(#1512): simplify cortex run

635162c

namchuai added a commit that referenced this issue Oct 20, 2024

feat(#1512): simplify cortex run

ec8511e

namchuai added a commit that referenced this issue Oct 20, 2024

feat(#1512): simplify cortex run

ed601d2

namchuai added a commit that referenced this issue Oct 20, 2024

feat(#1512): simplify cortex run

9265840

namchuai added a commit that referenced this issue Oct 21, 2024

feat(#1512): simplify cortex run

e2b58cf

namchuai mentioned this issue Oct 21, 2024

feat(#1512): simplify cortex run #1521

Merged

3 tasks

namchuai added a commit that referenced this issue Oct 21, 2024

feat(#1512): simplify cortex run

0aad82c

namchuai closed this as completed in #1521 Oct 21, 2024

namchuai added a commit that referenced this issue Oct 21, 2024

Merge pull request #1521 from janhq/j/simplified-cortex-run

ed7bcef

feat(#1512): simplify cortex run

github-project-automation bot moved this from In Review to Review + QA in Jan & Cortex Oct 21, 2024

namchuai added a commit that referenced this issue Oct 21, 2024

feat(#1512): simplify cortex run

06876be

gabrielle-ong added this to the v1.0.2 milestone Nov 5, 2024

gabrielle-ong mentioned this issue Nov 8, 2024

epic: QA Cortex v1.0.3 #1604

Open

gabrielle-ong moved this from Review + QA to Completed in Jan & Cortex Nov 8, 2024

gabrielle-ong modified the milestones: v1.0.2, v1.0.3 Nov 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

epic: Cortex Model Structures and simplified `cortex run` #1512

epic: Cortex Model Structures and simplified `cortex run` #1512

gabrielle-ong commented Oct 17, 2024 •

edited

Loading

gabrielle-ong commented Oct 17, 2024 •

edited

Loading

gabrielle-ong commented Oct 17, 2024

namchuai commented Oct 21, 2024

gabrielle-ong commented Nov 8, 2024 •

edited

Loading

epic: Cortex Model Structures and simplified cortex run #1512

epic: Cortex Model Structures and simplified cortex run #1512

Comments

gabrielle-ong commented Oct 17, 2024 • edited Loading

Problem

Success Criteria

Tasklist / Sub-issues

Eng Specs

Concepts

1. Models Table & cortex models list

2. cortex run with regex search (Deprecate model aliases)

Logic:

Bug tracking

gabrielle-ong commented Oct 17, 2024 • edited Loading

gabrielle-ong commented Oct 17, 2024

namchuai commented Oct 21, 2024

gabrielle-ong commented Nov 8, 2024 • edited Loading

epic: Cortex Model Structures and simplified `cortex run` #1512

epic: Cortex Model Structures and simplified `cortex run` #1512

gabrielle-ong commented Oct 17, 2024 •

edited

Loading

1. Models Table & `cortex models list`

2. `cortex run` with regex search (Deprecate model aliases)

gabrielle-ong commented Oct 17, 2024 •

edited

Loading

gabrielle-ong commented Nov 8, 2024 •

edited

Loading