MacOS Metal --gpulayers setting question #598

wilsonics · 2024-01-04T17:18:17Z

wilsonics
Jan 4, 2024

Hi everyone,

I have a question about the --gpulayers number and how to determine what the proper number is. I have been using --gpulayers 1 and it works fine for now, but I would like to know if there is a proper way to determine what it actually SHOULD be.

I'm using this as my command to start koboldcpp right now
python3 koboldcpp.py --noblas --smartcontext --contextsize 4096 --threads 10 --gpulayers 1 --model (modelhere)

it works great, no problems, just looking to see if I can grind out any more performance on it.
I have a MacMini M2 Pro (10 cores) with 16gb RAM.

I've found out at this time I can only load up 7B files with acceptable response speeds. The 13B files will load, but take forever (and a day) to respond to any chats. I'm sure that's more of a RAM problem though.

Thanks in advance.

Answered by LostRuins

Jan 5, 2024

--gpulayers 1 literally just uses a single layer from the model, which isn't going to be very much.

Each model has a different number of layers. A 7B model typically has about 32 layers, a 13B model has about 43, and a 70B model may have almost 80 layers. The best way to determine how many layers you can offload is by trial and error, specifically picking a value and seeing how much VRAM you have used once its loaded. If you exceed, the program will close and you can try again with a lower value.

View full answer

LostRuins · 2024-01-05T09:43:24Z

LostRuins
Jan 5, 2024
Maintainer

--gpulayers 1 literally just uses a single layer from the model, which isn't going to be very much.

Each model has a different number of layers. A 7B model typically has about 32 layers, a 13B model has about 43, and a 70B model may have almost 80 layers. The best way to determine how many layers you can offload is by trial and error, specifically picking a value and seeing how much VRAM you have used once its loaded. If you exceed, the program will close and you can try again with a lower value.

1 reply

wilsonics Jan 5, 2024
Author

Thanks, that's kind of what I figured. I'll play around on my end until to see what works. I appreciate your help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MacOS Metal --gpulayers setting question #598

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

MacOS Metal --gpulayers setting question #598

wilsonics Jan 4, 2024

Replies: 1 comment · 1 reply

LostRuins Jan 5, 2024 Maintainer

wilsonics Jan 5, 2024 Author

wilsonics
Jan 4, 2024

Replies: 1 comment 1 reply

LostRuins
Jan 5, 2024
Maintainer

wilsonics Jan 5, 2024
Author