Replies: 6 comments 21 replies
-
It runs on the Steam Deck, so why not. |
Beta Was this translation helpful? Give feedback.
-
AFAIK, a maximum 4GB of system RAM can be shared with an AMD APU integrated GPU. If 16GB were possible, I would immediately order a framework ryzen laptop... please update if you learn anything different.. EDIT: 16GB VRAM (UMA Frame Buffer Size in BIOS) possible on some manufacturer BIOS with at least 680M/780M (Zen 3/Zen 4) |
Beta Was this translation helpful? Give feedback.
-
I spent some time use setup_var changed my igpu size to 8g. But it's sad the speed not as good as my expectation, especially considered the openblas via clblas speed difference: the 7B speed in GPU evol time in CLBLas version --ngl 1000 is almost same as cpu, and yes the max mem is around 5.6.😂 |
Beta Was this translation helpful? Give feedback.
-
If there is a lot of interest in iGPUs it might be worth creating a zero-copy GPU implementation. That's only possible on iGPUs since they share the main memory with the CPU. |
Beta Was this translation helpful? Give feedback.
-
Today I do some test in windows, but I think it also will work in Linux. Just reduce the thread of cpu will increase the speed of printing. Update: not the same in Linux, still too slow to use. 13b(interactive mode) Update 2: And I failed to reproduce the windows speed, I assume it's the same problem like Intel p/e core... 30b: q4_0 60 layors only take 19g vram.(plus Blas) |
Beta Was this translation helpful? Give feedback.
-
May I know is there currently an iGPU zero copy implementation in llama.cpp? |
Beta Was this translation helpful? Give feedback.
-
Allocate huge vram to delicated AMD gpu
As we know 680m in 6700h, close to 2050,
May the cheapest way to do anything😅😂
Beta Was this translation helpful? Give feedback.
All reactions