You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
On kokkos-dev2, with Clang 8 and CUDA 10.0 (a spot check build), team_dot_complex_double fails because of cudaErrorMisalignedAddress. Since this passes in all other CUDA builds (even with 16 == alignof(Kokkos::complex<double>)), it seems likely this is just a compiler bug.
I can tell this error is caused by the KokkosBlas::Experimental::dot device function, since cudaGetLastError() reports the misaligned access right after the functor launches in the test Test_Blas1_team_dot.hpp, in impl_test_team_dot():
If it's not a compiler bug, I have no idea where the actual bug could be. This dot implementation is definitely calling TeamDot::team_dot in KokkosBlas1_team_dot_spec.hpp, and this takes actual views as input, not raw pointers. I checked that the X and Y passed to this functor are correctly aligned too.
Disabling KOKKOS_ENABLE_COMPLEX_ALIGN does make the test pass.
The text was updated successfully, but these errors were encountered:
On kokkos-dev2, with Clang 8 and CUDA 10.0 (a spot check build), team_dot_complex_double fails because of
cudaErrorMisalignedAddress
. Since this passes in all other CUDA builds (even with16 == alignof(Kokkos::complex<double>)
), it seems likely this is just a compiler bug.I can tell this error is caused by the
KokkosBlas::Experimental::dot
device function, since cudaGetLastError() reports the misaligned access right after the functor launches in the test Test_Blas1_team_dot.hpp, inimpl_test_team_dot()
:If it's not a compiler bug, I have no idea where the actual bug could be. This dot implementation is definitely calling
TeamDot::team_dot
in KokkosBlas1_team_dot_spec.hpp, and this takes actual views as input, not raw pointers. I checked that the X and Y passed to this functor are correctly aligned too.Disabling
KOKKOS_ENABLE_COMPLEX_ALIGN
does make the test pass.The text was updated successfully, but these errors were encountered: