You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Its worth pointing that dot via kokkos is faster than the the library calls in most cases I've seen. I imagine the reason is that that is effectively a bandwidth race and the extra indirection of a library call introduces a tiny penalty. Perhaps, it would be better to provide CMake options that allow users to override the default choice. And leave the default as calling Kokkos Kernels. Atleast that way it is explicitly stated when Kokkos is used vs TPL, and provides a mechanism to change it.
kokkos-kernels/src/impl/tpls/KokkosBlas1_dot_tpl_spec_avail.hpp
Line 104 in 4ee5f3c
@kyungjoo-kim @jjellio
The text was updated successfully, but these errors were encountered: