[add] support for half (GPU only) #6

mangel9742 · 2023-06-22T14:37:29Z

Support added only for GPU use.
The half type used is from Cuda Math library and some of the functions are only usable for GPU.
Most of the work consists to specialize templates in order to avoid ambiguous calls.

[fix] pytorch binding case torch::ScalarType::Half

b-flo

Thanks for taking that work!

As we talked off-the-list, the following changes are needed:

torch bindings are WIP/borked
remove __host__ from HOSTDEVICE macro and re-use it.
Subsequently, log_sum_exp from rnnt_helper can be re-used from that.

Also, more a general note: This PR drop support for Kepler and Maxwell archs. I'm not against it but I have to confirm it won't affect too many people (unlikely though).

mangel9742 · 2023-06-28T14:48:11Z

Thank you for your feedback.
I've proceeded to the changes needed, and corrected the torch bindings.
Do not hesitate if you have further remarks.

b-flo

Sorry for the delay! Compilation is OK but import will fail with the following error:

ImportError: /home/b-flo/warp-transducer/pytorch_binding/warprnnt_pytorch/warp_rnnt.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZNK2at10TensorBase8data_ptrI6__halfEEPT_vI

I'll let you figure it out but feel free to ask off the list for any help/hints on resolving this error!

P.S.: Postponing the full review until it's in a stable state.

b-flo · 2023-07-06T09:29:34Z

CMakeLists.txt

+# Drop support for old GPU to use CUDA math library functions
 IF(NOT (CUDA_VERSION GREATER 10.2))
-    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_30,code=sm_30 -O2")
-    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_35,code=sm_35")
-    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_50,code=sm_50")
+#    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_30,code=sm_30 -O2")
+#    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_35,code=sm_35")
+#    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_50,code=sm_50")
 ENDIF()

-set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_52,code=sm_52")
+#set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_52,code=sm_52")


Btw, you can remove the condition + commented lines

Yann Geoffroy added 3 commits June 22, 2023 16:03

[add] support for half (not tested yet)

e10c014

[add] compute_rnnt_loss_half (main function)

e72b262

[reformat] adapted macro HOSTDEVICE for half support + coherence of code

ea190af

[fix] pytorch binding case torch::ScalarType::Half

b-flo requested changes Jun 23, 2023

View reviewed changes

Yann Geoffroy added 3 commits June 28, 2023 11:45

[WIP] debugging binding.cpp

b3d4343

[WIP] debugging binding.cpp

aa13e76

[fix] added definition of half function + [reformat] deprecated code

10195d5

b-flo reviewed Jul 6, 2023

View reviewed changes

Yann Geoffroy added 2 commits July 7, 2023 09:29

[fix] import problems related to cuda

c081f30

[fix] import problems related to cuda

0554674

mangel9742 marked this pull request as draft July 7, 2023 07:38

[rm] local cuda path from setup.py for binding

955b208

mangel9742 force-pushed the espnet_v1.1_half_support branch from a1bb689 to 77ae1f6 Compare July 7, 2023 13:39

[fix] ImportError when importing warprnnt_pytorch from python

7f48e85

mangel9742 force-pushed the espnet_v1.1_half_support branch from 77ae1f6 to 7f48e85 Compare July 7, 2023 13:48

mangel9742 marked this pull request as ready for review July 7, 2023 13:53

[rm] unnecessary comment in CMakeLists.txt

ac9ad92

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[add] support for half (GPU only) #6

[add] support for half (GPU only) #6

mangel9742 commented Jun 22, 2023

b-flo left a comment •

edited

Loading

mangel9742 commented Jun 28, 2023

b-flo left a comment

b-flo Jul 6, 2023

[add] support for half (GPU only) #6

Are you sure you want to change the base?

[add] support for half (GPU only) #6

Conversation

mangel9742 commented Jun 22, 2023

b-flo left a comment • edited Loading

Choose a reason for hiding this comment

mangel9742 commented Jun 28, 2023

b-flo left a comment

Choose a reason for hiding this comment

b-flo Jul 6, 2023

Choose a reason for hiding this comment

b-flo left a comment •

edited

Loading