-
Notifications
You must be signed in to change notification settings - Fork 242
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[TESTS][Vega][WORKAROUND] test_conv_embed_db fails with ROCm 4.4 tuning #1161
Comments
The anticipation is that we will get this issue fixed within the next two weeks |
Note that if ConvHipImplicitGemmV4R4Fwd configs are really missing from perf-db (i.e. this issue is not specific to the "embedded" configuration of the library), then we most likely have performance drop. Why: CI logs show that this Solver is the fastest one, and tuning can make it faster. |
@JehandadKhan hopefully we can fix this issue with the coming 4.5 tuning? |
@atamazov and @junliume I did tune the solver and add those entries to the db however, was unable to fix the issue when the test runs in the CI. I need to investigate further and figure out if I am doing the right thing. The tuning PR going through staging would inform us whether there is perf regression elsewhere or not, if discovered we can cover it there. |
[Symptom]
test_conv_embed_db fails with ROCm 4.4 tuning on the following stage:
http://micimaster.amd.com/blue/organizations/jenkins/MLLibs%2FMIOpen/detail/tuning-rocm-4.4-fordev/12/pipeline
http://micimaster.amd.com/blue/organizations/jenkins/MLLibs%2FMIOpen/detail/tuning-rocm-4.4-fordev/13/pipeline
http://micimaster.amd.com/blue/organizations/jenkins/MLLibs%2FMIOpen/detail/tuning-rocm-4.4-fordev/14/pipeline
[Root Cause]
unknown yet, not reproducible on local tests
[Plan]
Workaround to unblock performance tuning updates
The text was updated successfully, but these errors were encountered: