Use GPU / system onnxruntime for inference on arch linux #131

luc-caspar · 2024-07-09T10:00:20Z

Following the instructions in the readme, I have compiled the plugin from source and installed the necessary files in the required locations.
The plugin is recognized by OBS and I can add it as a filter for video or audio sources. However, whenever I do so the logs indicate that both models (transcription and translation) are using the CPU for inference. Given the limited compute capacities of my computer, this means that I get a new line of caption every ~15 seconds.
Therefore, I was wondering if there is a way to force the plugin to use the GPU instead.
I have tried to make use of the system's onnxruntime as a workaround, but the cmake configuration step keeps on failing, even when manually providing the path to the onnxruntime include/lib folder.
Any help with this issue would be greatly appreciated.

The text was updated successfully, but these errors were encountered:

royshil · 2024-07-09T14:20:10Z

which OS are you on?
do you have a GPU?

luc-caspar · 2024-07-09T16:10:19Z

I am using Arch Linux.
And although it is not a powerful one, I do have a GPU, with the nvidia drivers installed.

jitingcn · 2024-07-11T06:55:50Z

Regarding the compilation options for Linux, GGML_CUDA=1 build flag is missing lead whisper.cpp not being built with GPU support. I encountered a linking error after attempt add the parameter and recompile.

/usr/bin/ld: Whispercpp_Build-prefix/lib/static/libwhisper.a(ggml-cuda.cu.o): warning: relocation against `_ZNSt3mapISt5arrayIfLm16EE24ggml_backend_buffer_typeSt4lessIS1_ESaISt4pairIKS1_S2_EEED1Ev' in read-only section `.text'
/usr/bin/ld: Whispercpp_Build-prefix/lib/static/libwhisper.a(ggml-cuda.cu.o): relocation R_X86_64_PC32 against symbol `stdout@@GLIBC_2.2.5' can not be used when making a shared object; recompile with -fPIC

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use GPU / system onnxruntime for inference on arch linux #131

Use GPU / system onnxruntime for inference on arch linux #131

luc-caspar commented Jul 9, 2024

royshil commented Jul 9, 2024

luc-caspar commented Jul 9, 2024

jitingcn commented Jul 11, 2024

Use GPU / system onnxruntime for inference on arch linux #131

Use GPU / system onnxruntime for inference on arch linux #131

Comments

luc-caspar commented Jul 9, 2024

royshil commented Jul 9, 2024

luc-caspar commented Jul 9, 2024

jitingcn commented Jul 11, 2024