-
Notifications
You must be signed in to change notification settings - Fork 106
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GPU friendly BLAS calls #251
Comments
see also #245 (I actually believe this is a duplicate of that) |
There are also several hard-coded |
Since I need this, I might be able to go over some of these. If anyone has a usecase/ something that doens't currently work, please post it so I can take it into account. |
We have some of our own code implemented: See also this comment for our needs: CliMA/ClimateMachine.jl#330 (comment) |
@ChrisRackauckas found out that using CuArrays fails at points like https://github.com/JuliaMath/IterativeSolvers.jl/blob/17ef2613dff1d80724886797160ee01bd1bd2c71/src/simple.jl#L38
where the call to
BLAS.axpy!
explicitly assumes that the inputs are OK with OpenBLAS. The right alternative would be to useLinearAlgebra.axpy!
(which as it turns out is a different function) and let it naturally dispatch to BLAS iff possible.The text was updated successfully, but these errors were encountered: