Add CudaUVMSpace specializations for cuBLAS IAMAX and SCAL #758

vqd8a · 2020-07-02T06:36:51Z

Requested by GEMMA for integrating GEMMA and ADELUS.

srajama1 · 2020-07-02T15:26:09Z

Oh man, why do they need UVM when I am trying to remove it in other places.

crtrott · 2020-07-02T20:17:04Z

That's for ETI? Because what else would be the specialization?

vqd8a · 2020-07-02T21:02:23Z

It is for cuBLAS TPL. Kokkos-kernels' IAMAX and SCAL have no CudaUVMSpace specializations when calling cuBLAS TPL. We only have CudaSpace specializations. For example:

kokkos-kernels/src/impl/tpls/KokkosBlas1_iamax_tpl_spec_decl.hpp

Line 524 in ad36ed6

    
           KOKKOSBLAS1_ZIAMAX_TPL_SPEC_DECL_CUBLAS( unsigned long, Kokkos::LayoutLeft, Kokkos::CudaSpace, true)

When CudaUVMSpace is used, the fall-back implementations would be called, which might be slower than cuBLAS.
I did a simple fix in #759 by adding CudaUVMSpace specializations to the TPL files: KokkosBlas1_iamax_tpl_spec_decl.hpp, KokkosBlas1_iamax_tpl_spec_avail.hpp, and KokkosBlas1_scal_tpl_spec_decl.hpp, KokkosBlas1_scal_tpl_spec_avail.hpp

crtrott · 2020-07-02T21:05:42Z

why isn't this type-erased before it this that (i.e. use Device<Cuda,AnonymousSpace>)?

vqd8a · 2020-07-02T21:22:25Z

Does it require re-write the whole TPLs? I just followed what was done in #397 and #399 for GEMM for a quick fix.

vqd8a · 2020-07-02T22:08:06Z

As pointed by @ndellingwood, there is an open issue #144 about AnonymousSpace but it was backlogged. I feel it is somehow beyond the scope of my PR. But should using AnonymousSpace be reprioritized? @srajama1 @crtrott

crtrott · 2020-07-02T22:10:46Z

yeah its fine not to do it in your PR. but generally it might be worthwhile to look at that.

vqd8a · 2020-07-02T22:18:34Z

Thanks @crtrott

vqd8a · 2020-07-02T22:19:41Z

I will look at that.

vqd8a self-assigned this Jul 2, 2020

vqd8a added enhancement feature request labels Jul 2, 2020

vqd8a mentioned this issue Jul 2, 2020

Add UVM specializations for iamax and scal #759

Merged

vqd8a changed the title ~~Add CudaUVMSpace specializations for IAMAX and SCAL~~ Add CudaUVMSpace specializations for cuBLAS IAMAX and SCAL Jul 2, 2020

ndellingwood added the InDevelop label Jul 28, 2020

ndellingwood closed this as completed Aug 20, 2020

kokkos-devops-admin mentioned this issue Jul 26, 2023

BLAS: fix assignable check in gemv and gemm #1914

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CudaUVMSpace specializations for cuBLAS IAMAX and SCAL #758

Add CudaUVMSpace specializations for cuBLAS IAMAX and SCAL #758

vqd8a commented Jul 2, 2020

srajama1 commented Jul 2, 2020

crtrott commented Jul 2, 2020

vqd8a commented Jul 2, 2020

crtrott commented Jul 2, 2020

vqd8a commented Jul 2, 2020

vqd8a commented Jul 2, 2020

crtrott commented Jul 2, 2020

vqd8a commented Jul 2, 2020

vqd8a commented Jul 2, 2020

Add CudaUVMSpace specializations for cuBLAS IAMAX and SCAL #758

Add CudaUVMSpace specializations for cuBLAS IAMAX and SCAL #758

Comments

vqd8a commented Jul 2, 2020

srajama1 commented Jul 2, 2020

crtrott commented Jul 2, 2020

vqd8a commented Jul 2, 2020

crtrott commented Jul 2, 2020

vqd8a commented Jul 2, 2020

vqd8a commented Jul 2, 2020

crtrott commented Jul 2, 2020

vqd8a commented Jul 2, 2020

vqd8a commented Jul 2, 2020