-
Notifications
You must be signed in to change notification settings - Fork 441
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Apply ROUNDUP_LWORK function in lapack #904
Apply ROUNDUP_LWORK function in lapack #904
Conversation
Codecov ReportAttention:
Additional details and impacted files@@ Coverage Diff @@
## master #904 +/- ##
=======================================
Coverage 0.00% 0.00%
=======================================
Files 1918 1918
Lines 188614 188617 +3
=======================================
- Misses 188614 188617 +3
☔ View full report in Codecov by Sentry. |
d7accd3
to
cc850c0
Compare
Great that you took the time to work on that. Thanks! I believe we should not think to much and just apply the round ups for single and double precision. If it is easy to do, I would include it in this PR. It adds minimal overhead to the workspace calculation, and the benefits can be big. |
I'll ping @mgates3 here as well, which might have something to add to the discussion. |
Fine to add it to double-precision files as well. Keeps single & double consistent and avoids issues if single is generated from double somehow.
Another way to look at it is lwork ≥ 8 bytes * 2^53 / 1024^5 = 64 PiB of memory before there was an issue for In MAGMA, the equivalent to |
Let me suggest to apply only for s|c precisions. As it was mentioned by @mgates3 , 64 PiB of memory before there was an issue for double. |
Thank you @mgates3 for the comprehensive comment. I think that adding functionality for d|z precision unnecessary at the moment, since the probability of working with matrix of size 47mn is extremely small. But for single precision I've met the rounding issue with matrix size 9000. |
465dd08
to
c453054
Compare
Hi @kleineLi. Would you be able to merge to/rebase with the current master? The script for the Github Actions needed to be updated in a recent commit. Thanks |
d3d3d39
to
095aca0
Compare
Hi @weslleyspereira. I have rebased my branch on the current master and completed all changes in this PR |
095aca0
to
fa1336d
Compare
fa1336d
to
39f7097
Compare
@kleineLi Could you be so kind and apply the fix also to the alternative implementation of the QR decomposition ( |
39f7097
to
689795f
Compare
Done |
689795f
to
28bf3f1
Compare
28bf3f1
to
88810e4
Compare
We can probably merge this one. I don't think the failure in AppVeyor has anything to do with this PR. The failure has to do with AppVeyor not finding the flang compiler. |
Apply ROUNDUP_LWORK function in lapack
This work is continuation of PR #605. Applied only for s|c precisions. Do we need these changes to be applied for d|z precisions or it is redundant?
Please review!
/cc @weslleyspereira