-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BLAS nrm1 (aka asum) returns a different value than TPLs for complex input #914
Comments
brian-kelley
added a commit
to brian-kelley/kokkos-kernels
that referenced
this issue
Mar 19, 2021
- Made nrm1 compute the sum of all absolute real and imaginary parts to match BLAS/MKL/CUBLAS behavior, rather than sum of magnitudes. - Improved unit test coverage - verify each output element, not just dotprod of output with itself - for complex, create randomized inputs with nonzero imaginary parts - enable conj-trans mode testing for gemv
brian-kelley
added a commit
to brian-kelley/kokkos-kernels
that referenced
this issue
Mar 19, 2021
- Made nrm1 compute the sum of all absolute real and imaginary parts to match BLAS/MKL/CUBLAS behavior, rather than sum of magnitudes. - Improved unit test coverage - verify each output element, not just dotprod of output with itself - for complex, create randomized inputs with nonzero imaginary parts - enable conj-trans mode testing for gemv
brian-kelley
added a commit
to brian-kelley/kokkos-kernels
that referenced
this issue
Mar 19, 2021
- Made nrm1 compute the sum of all absolute real and imaginary parts to match BLAS/MKL/CUBLAS behavior, rather than sum of magnitudes. - Improved unit test coverage - verify each output element, not just dotprod of output with itself - for complex, create randomized inputs with nonzero imaginary parts - enable conj-trans mode testing for gemv
brian-kelley
added a commit
to brian-kelley/kokkos-kernels
that referenced
this issue
Mar 19, 2021
- Made nrm1 compute the sum of all absolute real and imaginary parts to match BLAS/MKL/CUBLAS behavior, rather than sum of magnitudes. - Improved unit test coverage - verify each output element, not just dotprod of output with itself - for complex, create randomized inputs with nonzero imaginary parts - enable conj-trans mode testing for gemv
brian-kelley
added a commit
that referenced
this issue
Mar 22, 2021
Fixed nrm1 (#914), removed cublas nrminf, improved blas tests
Fixed with #915. |
lucbv
pushed a commit
to lucbv/kokkos-kernels
that referenced
this issue
May 10, 2021
- Made nrm1 compute the sum of all absolute real and imaginary parts to match BLAS/MKL/CUBLAS behavior, rather than sum of magnitudes. - Improved unit test coverage - verify each output element, not just dotprod of output with itself - for complex, create randomized inputs with nonzero imaginary parts - enable conj-trans mode testing for gemv
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Our KokkosBlas::nrm1 (and wrapper KokkosBlas::asum) return the sum of magnitudes of the elements:
but BLAS TPLs (MKL, CUBLAS, netlib) return this the sum of absolute real and imaginary parts:
There is no difference between the two for real inputs (because imag_i is 0 for all i). I will fix this in our implementation.
The text was updated successfully, but these errors were encountered: