Skip to content

MacOS Metal --gpulayers setting question #598

Closed Answered by LostRuins
wilsonics asked this question in Q&A
Discussion options

You must be logged in to vote

--gpulayers 1 literally just uses a single layer from the model, which isn't going to be very much.

Each model has a different number of layers. A 7B model typically has about 32 layers, a 13B model has about 43, and a 70B model may have almost 80 layers. The best way to determine how many layers you can offload is by trial and error, specifically picking a value and seeing how much VRAM you have used once its loaded. If you exceed, the program will close and you can try again with a lower value.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@wilsonics
Comment options

Answer selected by wilsonics
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants