You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
https://discord.com/channels/1107178041848909847/1307915350406467664/1313079041821249556
The vision model configuration in model.json lacks an NGL (number of GPU layers) setting, which is needed to properly manage GPU resource allocation when running multiple models simultaneously (text model + vision/projector model). This causes uncertainty about proper GPU layer allocation between models.
On this one, it's more about supporting vision models which actually here #1493. There will be an action item to pull correct NGL from Chat model (not projector model). I tried it, and it really sped up. For now we will close this one and refer to the original epic as an action item, since now we temporarily fixed the listed models on the model hub.
Jan version
0.5.10
Describe the Bug
https://discord.com/channels/1107178041848909847/1307915350406467664/1313079041821249556
The vision model configuration in model.json lacks an NGL (number of GPU layers) setting, which is needed to properly manage GPU resource allocation when running multiple models simultaneously (text model + vision/projector model). This causes uncertainty about proper GPU layer allocation between models.
Steps to Reproduce
Screenshots / Logs
What is your OS?
The text was updated successfully, but these errors were encountered: