Llama.cpp 30B runs with only 6GB of RAM now #337

1Mark · 2023-04-01T11:03:16Z

See link for further details https://news.ycombinator.com/item?id=35393284

Is there anything required to be done in order to benefit from this?

ghost · 2023-04-03T00:59:48Z

Just FYI - you won't be able to run 30B llama just with 6GB RAM, it's just that with mmap the memory usage of the model is now counted as kernel's file cache, see https://news.ycombinator.com/item?id=35395739 and answers below

teaalltr mentioned this issue Apr 1, 2023

Use master ggerganov branch, with mmap etc #338

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama.cpp 30B runs with only 6GB of RAM now #337

Llama.cpp 30B runs with only 6GB of RAM now #337

1Mark commented Apr 1, 2023

ghost commented Apr 3, 2023 •

edited by ghost

Loading

Llama.cpp 30B runs with only 6GB of RAM now #337

Llama.cpp 30B runs with only 6GB of RAM now #337

Comments

1Mark commented Apr 1, 2023

ghost commented Apr 3, 2023 • edited by ghost Loading

ghost commented Apr 3, 2023 •

edited by ghost

Loading