-
Notifications
You must be signed in to change notification settings - Fork 406
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ROCm-3.7+ broken on gfx803 #1265
Comments
Hi @xuhuisheng, Thank you for bringing this. Let me check about this. Are you using Fiji? |
@ashutoshamd Thank you for reply. Tensorflow offical sample could reproduce this issue, almost 90%. https://www.tensorflow.org/tutorials/keras/text_classification
My environment: OS: Ubuntu-20.04 |
How does one downgrade? |
Hi @AsimPoptani |
I'm encountering the same issue here: benchmarking rocm 3.9 and 3.10 on a system with 2x R9 nano gpus is > 10x slower on ROCm compared to clover (if it runs at all): |
Hi @jpsollie |
AMD dropped supporting gfx8 officially from ROCm 4.0 as per https://github.com/RadeonOpenCompute/ROCm#hardware-and-software-support . But some things might work. |
@ROCmSupport this seems a bit ridiculous as nvidia still support the 10 series graphics cards which are just as old... |
Also @ROCmSupport link is broken |
It is https://github.com/RadeonOpenCompute/ROCm#hardware-and-software-support. |
Here is my guide to downgrade to ROCm 3.5.1 + TensorFlow 2.2: |
Has anyone on Arch uploaded a working ROCm build yet? I've been struggling with this now. |
I resolved this problem using this: https://githubmemory.com/repo/xuhuisheng/rocm-gfx803 but i have to add this: tf.compat.v1.disable_eager_execution() HOWEBER, my routine without using the graphics card (with an AMD® Ryzen 5 2600 six-core processor × 12) is 2 times faster than using my graphics card :(. Is it normal? Is it because it is not the best graphics card for ML? PD: not underground my ROCm, i don't know how and check witch version i have. |
This issue is used to tracing issues on ROCm-3.7+ with gfx803 - e.g. rx470, rx570, rx580.
If you had a gfx803, and installed ROCm-3.7, ROCm-3.8, ROCm-3.9, ROCm-4.0. You could meet some situations likes:
Related issues:
My advise is:
Investigation info:
Building rocBLAS with BUILD_WITH_TENSILE_HOST=false can solve this issue.
From ROCm-3.7, the rocBLAS update BUILD_WITH_TENSILE_HOST from false to true, so ROCm-3.5.1 can work properly.
Seems the new tensile client which rocBLAS used does not support gfx803.
Created an issue to rocBLAS : ROCm/rocBLAS#1172
The text was updated successfully, but these errors were encountered: