Bug: RPC server doesn't load GPU if I use Vulkan #8536

metal3d · 2024-07-17T09:08:36Z

What happened?

I compiled llamacpp with Vulkan backend. The "rpc-server" binary is linked to libvulkan but it never uses my GPUs. While "llama-cli" is OK.

Name and Version

version: 3384 (4e24cff)
built with cc (GCC) 14.1.1 20240701 (Red Hat 14.1.1-7) for x86_64-redhat-linux

What operating system are you seeing the problem on?

Linux

Relevant log output

./rpc-server
create_backend: using CPU backend
Starting RPC server on 0.0.0.0:50052, backend memory: 23967 MB


ldd ./rpc-server
        linux-vdso.so.1 (0x00007f18759f2000)
        libllama.so => /home/metal3d/Projects/ML/llama.cpp/build-rpc/src/libllama.so (0x00007f1875879000)
        libggml.so => /home/metal3d/Projects/ML/llama.cpp/build-rpc/ggml/src/libggml.so (0x00007f1875400000)
        libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f1875000000)
        libm.so.6 => /lib64/libm.so.6 (0x00007f187531c000)
        libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f187582b000)
        libc.so.6 => /lib64/libc.so.6 (0x00007f1874e0f000)
        /lib64/ld-linux-x86-64.so.2 (0x00007f18759f4000)
        libvulkan.so.1 => /lib64/libvulkan.so.1 (0x00007f18757af000)
        libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f18752c6000)

The text was updated successfully, but these errors were encountered:

rgerganov · 2024-08-11T11:04:44Z

The Vulkan backend is using the tensor->extra property which is not supported by the RPC backend. There is the same issues with the SYCL backend (PR #7682)

xvim · 2024-09-04T13:56:24Z

is any plan to support vulkan when using RPC backend?

rgerganov · 2024-09-05T07:54:41Z

I will try to find out how to avoid using tensor->extra in Vulkan. Maybe adding a global map ggml_tensor -> ggml_tensor_extra_gpu

slaren · 2024-09-05T09:16:56Z

The extras in the Vulkan backend are not really necessary, all the data that they contain is already present (directly or indirectly) in other fields of the tensor. At this point I think they are only there for legacy reasons, but could be removed with a refactor.

This patch allows using the Vulkan backend with the RPC backend as tensor->extra is no longer used. Ref: ggerganov#8536

* vulkan : do not use tensor->extra This patch allows using the Vulkan backend with the RPC backend as tensor->extra is no longer used. Ref: #8536 * Adapt GGML_VULKAN_CHECK_RESULTS to extra removal (#2) --------- Co-authored-by: 0cc4m <picard12@live.de>

closes ggerganov#8536

metal3d · 2024-10-03T21:48:47Z

Big thanks 👍 Le jeu. 3 oct. 2024, 12:01, Radoslav Gerganov ***@***.***> a écrit :

…

Closed #8536 <#8536> as completed via 841713e <841713e> . — Reply to this email directly, view it on GitHub <#8536 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAYN4BA5Z3SEW2IYQU5IDDZZUIWVAVCNFSM6AAAAABLAGD476VHI2DSMVQWIX3LMV45UABCJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XDWMJUGUYDCOBWGEZDKMA> . You are receiving this because you authored the thread.Message ID: ***@***.***>

* vulkan : do not use tensor->extra This patch allows using the Vulkan backend with the RPC backend as tensor->extra is no longer used. Ref: ggerganov#8536 * Adapt GGML_VULKAN_CHECK_RESULTS to extra removal (Mobile-Artificial-Intelligence#2) --------- Co-authored-by: 0cc4m <picard12@live.de>

closes ggerganov#8536

metal3d added bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches) labels Jul 17, 2024

rgerganov self-assigned this Sep 5, 2024

rgerganov added a commit to rgerganov/llama.cpp that referenced this issue Sep 10, 2024

vulkan : do not use tensor->extra

adf3bce

This patch allows using the Vulkan backend with the RPC backend as tensor->extra is no longer used. Ref: ggerganov#8536

rgerganov mentioned this issue Sep 10, 2024

vulkan : do not use tensor->extra #9407

Merged

4 tasks

rgerganov added a commit to rgerganov/llama.cpp that referenced this issue Oct 2, 2024

rpc : enable vulkan

090cec2

closes ggerganov#8536

rgerganov mentioned this issue Oct 2, 2024

rpc : enable vulkan #9714

Merged

4 tasks

rgerganov closed this as completed in #9714 Oct 3, 2024

rgerganov closed this as completed in 841713e Oct 3, 2024

dsx1986 pushed a commit to dsx1986/llama.cpp that referenced this issue Oct 29, 2024

rpc : enable vulkan (ggerganov#9714)

d14d909

closes ggerganov#8536

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: RPC server doesn't load GPU if I use Vulkan #8536

Bug: RPC server doesn't load GPU if I use Vulkan #8536

metal3d commented Jul 17, 2024 •

edited

Loading

rgerganov commented Aug 11, 2024

xvim commented Sep 4, 2024

rgerganov commented Sep 5, 2024

slaren commented Sep 5, 2024 •

edited

Loading

metal3d commented Oct 3, 2024 via email

Bug: RPC server doesn't load GPU if I use Vulkan #8536

Bug: RPC server doesn't load GPU if I use Vulkan #8536

Comments

metal3d commented Jul 17, 2024 • edited Loading

What happened?

Name and Version

What operating system are you seeing the problem on?

Relevant log output

rgerganov commented Aug 11, 2024

xvim commented Sep 4, 2024

rgerganov commented Sep 5, 2024

slaren commented Sep 5, 2024 • edited Loading

metal3d commented Oct 3, 2024 via email

metal3d commented Jul 17, 2024 •

edited

Loading

slaren commented Sep 5, 2024 •

edited

Loading