Successfully loaded libtensorflow in Node.js, but couldn't load GPU. Make sure CUDA Toolkit and cuDNN are installed and accessible, or turn off GPU mode. #1060

remz1337 · 2023-12-22T17:09:53Z

Which version of recognize are you using?

5.0.3

Enabled Modes

Face recognition

TensorFlow mode

GPU mode

Downstream App

Memories App

Which Nextcloud version do you have installed?

27.1.5

Which Operating system do you have installed?

Ubuntu 22.04

Which database are you running Nextcloud on?

Postgres 14.10

Which Docker container are you using to run Nextcloud? (if applicable)

N/A

How much RAM does your server have?

4Gb

What processor Architecture does your CPU have?

x86_64

Describe the Bug

This is minor but Recognize admin panel is telling me no GPU found, but everything seems to be working fine (I see the recognize/bin/node process running on my GPU using nvidia-smi). Not sure if normal, but although I see the process on my GPU, my CPU usage is also way up.

The exact warning appears in the NodeJS section of the admin panel:
Successfully loaded libtensorflow in Node.js, but couldn't load GPU. Make sure CUDA Toolkit and cuDNN are installed and accessible, or turn off GPU mode.

More info: Proxmox 7.2, Nextcloud LXC with GPU successfully passed through (it was already done for processing ffmpeg in the Memories app). Installed the CUDA and cuDNN libs through the recommended instructions (pip install tensorflow[and-cuda]) and python is finding my GPU.

Expected Behavior

If everything is working fine and using my GPU, then there shouldn't be any warning about GPU not found.

To Reproduce

Not sure, probably something to do with my setup. If you can point me where to look, I can provide more logs that may help.

Debug log

No response

The text was updated successfully, but these errors were encountered:

github-actions · 2023-12-22T17:10:13Z

Hello 👋

Thank you for taking the time to open this issue with recognize. I know it's frustrating when software
causes problems. You have made the right choice to come here and open an issue to make sure your problem gets looked at
and if possible solved.
I try to answer all issues and if possible fix all bugs here, but it sometimes takes a while until I get to it.
Until then, please be patient.
Note also that GitHub is a place where people meet to make software better together. Nobody here is under any obligation
to help you, solve your problems or deliver on any expectations or demands you may have, but if enough people come together we can
collaborate to make this software better. For everyone.
Thus, if you can, you could also look at other issues to see whether you can help other people with your knowledge
and experience. If you have coding experience it would also be awesome if you could step up to dive into the code and
try to fix the odd bug yourself. Everyone will be thankful for extra helping hands!
One last word: If you feel, at any point, like you need to vent, this is not the place for it; you can go to the forum,
to twitter or somewhere else. But this is a technical issue tracker, so please make sure to
focus on the tech and keep your opinions to yourself. (Also see our Code of Conduct. Really.)

I look forward to working with you on this issue
Cheers 💙

NikitaKorneev · 2024-02-25T12:08:35Z

I have the same issue. I think I followed all the instructions regarding drivers and CUDA, DNN requirements.

marcelklehr · 2024-02-28T11:03:57Z

Are you using CUDA 12 or CUDA 11? I believe we currently only support CUDA 11

remz1337 · 2024-02-28T13:59:29Z

Indeed CUDA 12. The app is still working though, it's just that warning message that seems to be the issue

marcelklehr · 2024-02-28T14:11:31Z

I think it falls back to CPU if GPU can't be loaded

remz1337 · 2024-02-28T14:19:20Z

But I can see the recognize/bin/node process running on my GPU using nvidia-smi

marcelklehr · 2024-02-28T14:26:51Z

huh

Mikec78660 · 2024-08-16T10:28:31Z

Wondering if it is still the case that cuda 12 is not supported? I have:
Driver Version: 560.28.03 CUDA Version: 12.6

I have the same warning message when enabling gpu. I get a process on the GPU of a few hundred megs when I start a scan but no gpu utilization from that process.

NikitaKorneev · 2024-08-20T09:37:48Z

Wondering if it is still the case that cuda 12 is not supported? I have: Driver Version: 560.28.03 CUDA Version: 12.6

I have the same warning message when enabling gpu. I get a process on the GPU of a few hundred megs when I start a scan but no gpu utilization from that process.

There is something really wrong with this integration and idk if maintainers are on it...

macka849 · 2024-11-14T22:27:22Z

I had the same issue, and I have sorted it, but with some caveats. Firstly, I am on ubuntu server 22, as this was the latest server when the program was written, which does not appear to have been updated since then. More on that later.

My GPU has CUDA compute 5.2, which is not directly supported by the tensorflow precompiled binarys. So, I had to compile my own. Seven hours on Xeon E3 V2, which was the successful attempt.

I am on the latest Nvidia GPU and CUDA drivers. After installing the CUDA driver from NVIDIAs .run file, I had to manually link some libraries, which is detailed by the installer at the end of the cuda driver install. The NVIDIA FS kernal part always failed, but doesn't seem neccassary. My GPU may not be compatible

Anyway, after all of that, and confirming that tensorflow was working with GPU as per the tensorflow website, recognize still failed.

I found the test_gputensorflow.js in the /nextcloud/apps/recognize/src, and manually ran it from that folder. "sudo node test_gputensorflow.js"

The output indicated it was looking for libcudnn8.so. Ubuntu has moved onto libcudnn9 in the official repositories, but there is a way to manually install it, that I found on:

https://stackoverflow.com/questions/66977227/could-not-load-dynamic-library-libcudnn-so-8-when-running-tensorflow-on-ubun

The guide is for ubuntu 20, but I did some digging around the NVIDIA archive to find a library for libcudnn8.so for ubuntu 22. Sadly, they did not have libcudnn8.so in the ubuntu 24 folder, so It looks like I'm stuck on ubuntu 22 until recognize is updated for libcudnn9.

In any case, I installed it, the message went away, and the recognize job I had running found 10th gear, and took off like a ferrari in a tank race.
Nvidia-smi showed 100% unilization by the program.

I hope this helps others out there who are trying to get this working. I'm going to sleep now.

bugsyb · 2024-12-08T09:49:13Z

Question: would you mind to check logs if have movienet recognition working properly and classifying videos?

I'm running into:
#1122

Would be great if you could please check if after messages showing that ffmpeg finished extraction of frames:

Classifier process output: decoded 60/60 images

Do you have classification happening or maybe errors as in my case:

Classifier process output: 2024-12-08 10:15:46.966028: W tensorflow/core/framework/op_kernel.cc:1745] OP_REQUIRES failed at xla_ops.cc:296 : NOT_FOUND: could not find registered platform with id: 0x7f6d69c7fae4\

Thanks!

In case if you'd need to find other approach, Recognize can be Dockerized too.
Due to hassle I went through to get mine working, Dockerized version with approaches have been shared and is available here:
https://github.com/bugsyb/recognize_docker
Latest commits and what I use myself is the nvidia-tensor-based built on nVidia released Tensorflow docker container, with added on top of it PHP, Nextcloud and Recognize + some other custom apps - easily removable from the build.

In terms of support - some GPUs/older might not be ported and supported under Cuda 12 - simply are being dropped.

Currently for most of population, the main limitation here is the code requirement set up by Recognize as it requires Cuda 11.

The shared Dockerfiles show also other approaches to build Tensorflow container.

Due to Cuda 11 requirement, we're stuck on specific versions of underlying OS level libraries, however as it's Docker - that doesn't matter as can be run on any system as long as nVidia toolkit is installed at the Docker host level.

remz1337 added the bug Something isn't working label Dec 22, 2023

bugsyb mentioned this issue Apr 8, 2024

Movinet fails in GPU mode #1122

Open

github-project-automation bot added this to Recognize Aug 28, 2024

github-project-automation bot moved this to Bugs in Recognize Aug 28, 2024

marcelklehr added the priority: normal label Sep 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Successfully loaded libtensorflow in Node.js, but couldn't load GPU. Make sure CUDA Toolkit and cuDNN are installed and accessible, or turn off GPU mode. #1060

Successfully loaded libtensorflow in Node.js, but couldn't load GPU. Make sure CUDA Toolkit and cuDNN are installed and accessible, or turn off GPU mode. #1060

remz1337 commented Dec 22, 2023

github-actions bot commented Dec 22, 2023

NikitaKorneev commented Feb 25, 2024

marcelklehr commented Feb 28, 2024

remz1337 commented Feb 28, 2024

marcelklehr commented Feb 28, 2024

remz1337 commented Feb 28, 2024

marcelklehr commented Feb 28, 2024

Mikec78660 commented Aug 16, 2024

NikitaKorneev commented Aug 20, 2024

macka849 commented Nov 14, 2024

bugsyb commented Dec 8, 2024 •

edited

Loading

Successfully loaded libtensorflow in Node.js, but couldn't load GPU. Make sure CUDA Toolkit and cuDNN are installed and accessible, or turn off GPU mode. #1060

Successfully loaded libtensorflow in Node.js, but couldn't load GPU. Make sure CUDA Toolkit and cuDNN are installed and accessible, or turn off GPU mode. #1060

Comments

remz1337 commented Dec 22, 2023

Which version of recognize are you using?

Enabled Modes

TensorFlow mode

Downstream App

Which Nextcloud version do you have installed?

Which Operating system do you have installed?

Which database are you running Nextcloud on?

Which Docker container are you using to run Nextcloud? (if applicable)

How much RAM does your server have?

What processor Architecture does your CPU have?

Describe the Bug

Expected Behavior

To Reproduce

Debug log

github-actions bot commented Dec 22, 2023

NikitaKorneev commented Feb 25, 2024

marcelklehr commented Feb 28, 2024

remz1337 commented Feb 28, 2024

marcelklehr commented Feb 28, 2024

remz1337 commented Feb 28, 2024

marcelklehr commented Feb 28, 2024

Mikec78660 commented Aug 16, 2024

NikitaKorneev commented Aug 20, 2024

macka849 commented Nov 14, 2024

bugsyb commented Dec 8, 2024 • edited Loading

bugsyb commented Dec 8, 2024 •

edited

Loading