-
Notifications
You must be signed in to change notification settings - Fork 242
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Tuning] rocm 4.4 update for develop #1133
Conversation
This comment has been minimized.
This comment has been minimized.
@cderb CI has some interesting issues:
The following tests FAILED:
|
I'll run performance tests as soon as CI passed. |
CI issues are reproducible so we may need to fix them first @cderb @JehandadKhan |
|
I will merge recent develop and restart CI. |
CI run CI run Restarted. |
It seems like the |
Issue #1161 logged and will keep track |
I am running performance tests on Vega20 right now. |
[Intermediate testing results] Unfortunately I see performance & correctness problems with FP32 on gfx906_60. Analysis is in progress. An example of correctness problem: duplicated algorithm in find-db (file: gfx906_60.HIP.fdb.txt)
Two records that refer to the same algorithm, |
Unfortunately, there are some incorrect FP16 records that lead to errors like this: MIOpen(HIP): Info [FindSolutionImpl] Perf Db: record not found for: ConvOclDirectFwdFused Some detailsExample of incorrect record:
How it should look like:
Notice This PR must be reverted. More details available upon request. |
Maybe it is enough to revert only find-db changes. I am investigating this possibility. |
No. There is some FP32 perf regression (~23%) due to missing perf-db records (many "Perf Db: record not found for..." messages).
FP16/BF16 not tested. Let's revert all. |
…or develop"" This reverts commit 86e7e0a.
No description provided.