bug: `cortex run` flakiness #1272

0xSage · 2024-09-19T15:25:57Z

Problem Statement

cortex run mistral

Model loaded!
Inorder to exit, type `exit()`

Chat a bit and quit interactive shell
Run it again
sudo cortex run mistral

Starting server ...
20240919 15:54:32.723277 UTC 33913 INFO  Host: 127.0.0.1 Port: 3928
 - main.cc:32

Server started

Model loaded!
Inorder to exit, type `exit()`

Model loads & chats just fine

Expected

Expected: the same stdout experience as the first time I invoked cortex run
But I got: "Starting Server..." for all subsequent invocations

Questions

What could be causing this inconsistency in the run step?

OS: Mac

The text was updated successfully, but these errors were encountered:

vansangpfiev · 2024-09-20T03:32:59Z

cortex run is the command chain which includes:

cortex engines install
cortex pull
cortex start
cortex models start
cortex chat
In this situation, the cortex models start doesn't work correctly. The CLI doesn't check if the model was loaded before starting it, and the engine doesn't ignore it when starting it again.
The issue will be fixed on the CLI side first, and then on the engines side.

0xSage · 2024-09-21T07:12:51Z

seems fixed on v75, nice job!

0xSage added good first issue Good for newcomers category: model running Inference ux, handling context/parameters, runtime P3: nice to have Nice to have feature labels Sep 19, 2024

0xSage mentioned this issue Sep 19, 2024

epic: Structure Manual QA for cortex.cpp #1225

Closed

5 tasks

0xSage changed the title ~~idea: when loading models, stdout "loading model into v/ram"~~ bug: cortex run flakiness Sep 19, 2024

0xSage added type: bug Something isn't working P0: critical Mission critical and removed P3: nice to have Nice to have feature good first issue Good for newcomers labels Sep 19, 2024

0xSage added this to Jan & Cortex Sep 19, 2024

0xSage moved this to Need Investigation in Jan & Cortex Sep 19, 2024

0xSage removed the P0: critical Mission critical label Sep 19, 2024

vansangpfiev self-assigned this Sep 20, 2024

vansangpfiev mentioned this issue Sep 20, 2024

fix: should check model status before start it #1277

Merged

3 tasks

0xSage closed this as completed Sep 21, 2024

github-project-automation bot moved this from Need Investigation to Completed in Jan & Cortex Sep 21, 2024

gabrielle-ong added this to the v1.0.0 milestone Oct 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: `cortex run` flakiness #1272

bug: `cortex run` flakiness #1272

0xSage commented Sep 19, 2024 •

edited

Loading

vansangpfiev commented Sep 20, 2024

0xSage commented Sep 21, 2024

bug: cortex run flakiness #1272

bug: cortex run flakiness #1272

Comments

0xSage commented Sep 19, 2024 • edited Loading

Problem Statement

Expected

Questions

vansangpfiev commented Sep 20, 2024

0xSage commented Sep 21, 2024

bug: `cortex run` flakiness #1272

bug: `cortex run` flakiness #1272

0xSage commented Sep 19, 2024 •

edited

Loading