bug: Missing NGL Setting for Vision Models #1763

imtuyethan · 2024-12-03T02:50:18Z

Jan version

0.5.10

Describe the Bug

https://discord.com/channels/1107178041848909847/1307915350406467664/1313079041821249556
The vision model configuration in model.json lacks an NGL (number of GPU layers) setting, which is needed to properly manage GPU resource allocation when running multiple models simultaneously (text model + vision/projector model). This causes uncertainty about proper GPU layer allocation between models.

Steps to Reproduce

Load a vision model with the following settings:

"settings": {
    "vision_model": true,
    "text_model": false,
    "ctx_len": 4096,
    "prompt_template": "\n### Instruction:\n{prompt}\n### Response:\n",
    "llama_model_path": "llava-v1.6-mistral-7b.Q4_K_M.gguf",
    "mmproj": "mmproj-model-f16.gguf"
}

Notice that NGL setting is missing, making it unclear how to allocate GPU layers when running multiple models

Screenshots / Logs

What is your OS?

MacOS
Windows
Linux

louis-jan · 2024-12-04T02:30:01Z

Can check after supporting vision models pull.

louis-jan · 2024-12-04T06:45:12Z

On this one, it's more about supporting vision models which actually here #1493. There will be an action item to pull correct NGL from Chat model (not projector model). I tried it, and it really sped up. For now we will close this one and refer to the original epic as an action item, since now we temporarily fixed the listed models on the model hub.

imtuyethan added the type: bug Something isn't working label Dec 3, 2024

imtuyethan assigned louis-jan Dec 3, 2024

imtuyethan added the category: tools RAG, function calling, etc label Dec 3, 2024

imtuyethan assigned vansangpfiev and nguyenhoangthuan99 and unassigned nguyenhoangthuan99 Dec 3, 2024

imtuyethan transferred this issue from janhq/jan Dec 3, 2024

imtuyethan assigned nguyenhoangthuan99 and unassigned louis-jan Dec 3, 2024

louis-jan mentioned this issue Dec 4, 2024

chore: add NGL settings for vision models janhq/jan#4214

Merged

louis-jan closed this as completed Dec 4, 2024

github-project-automation bot moved this from Investigating to QA in Jan & Cortex Dec 4, 2024

louis-jan mentioned this issue Dec 4, 2024

planning: Supporting vision model (Llava and Llama3.2) #1493

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: Missing NGL Setting for Vision Models #1763

bug: Missing NGL Setting for Vision Models #1763

imtuyethan commented Dec 3, 2024

louis-jan commented Dec 4, 2024

louis-jan commented Dec 4, 2024

bug: Missing NGL Setting for Vision Models #1763

bug: Missing NGL Setting for Vision Models #1763

Comments

imtuyethan commented Dec 3, 2024

Jan version

Describe the Bug

Steps to Reproduce

Screenshots / Logs

What is your OS?

louis-jan commented Dec 4, 2024

louis-jan commented Dec 4, 2024