[CodeGen][CUDA] Fix issues in cuda codegen #4876

wpan11nv · 2020-02-13T18:32:16Z

Do not emit shared etc. as part of type for casting
Fix fp16 reduction kernels with compiler errors:

"no operator "+" matches these operands, volatile half + volatile half

This patch inserts casts to remove volatile type qualifier following
volatile loads (fp16 only). CUDA fp16 library headers should add
volatile member functions.

Signed-off-by: Wei Pan weip@nvidia.com

Thanks for contributing to TVM! Please refer to guideline https://docs.tvm.ai/contribute/ for useful information and tips. After the pull request is submitted, please request code reviews from Reviewers by @ them in the pull request thread.

wpan11nv · 2020-02-13T18:48:53Z

This patch should fix errors observed below (I did not verify as I found no complete reproducers there). My own test works fine with CUDA 10.2.

https://discuss.tvm.ai/t/error-fp16-cuda-compilation-error/4586

This issue has been also reported to NVIDIA CUDA team.

tqchen · 2020-02-13T19:12:30Z

cc @vinx13 @ZihengJiang please help to take a look

vinx13 · 2020-02-15T02:06:34Z

We can remove these lines
https://github.com/apache/incubator-tvm/blob/master/src/target/source/codegen_cuda.cc#L60-L73

vinx13 · 2020-02-15T02:07:16Z

also cc @yzhliu @zxy844288792 @Hzfengsy

- Do not emit __shared__ etc. as part of type for casting - Fix fp16 reduction kernels with compiler errors: "no operator "+" matches these operands, volatile half + volatile half This patch inserts casts to remove volatile type qualifier following volatile loads (fp16 only). CUDA fp16 library headers should add volatile member functions. - Update have_fp16 to include compute 6.1 GPUs, which do support fp16, although their fp16 throughput is low. Updated tests. Signed-off-by: Wei Pan <weip@nvidia.com>

wpan11nv · 2020-02-15T03:56:37Z

Updated the patch as suggested. Thanks!

vinx13 · 2020-02-16T03:47:46Z

Thanks @wpan11nv this is merged

- Do not emit __shared__ etc. as part of type for casting - Fix fp16 reduction kernels with compiler errors: "no operator "+" matches these operands, volatile half + volatile half This patch inserts casts to remove volatile type qualifier following volatile loads (fp16 only). CUDA fp16 library headers should add volatile member functions. - Update have_fp16 to include compute 6.1 GPUs, which do support fp16, although their fp16 throughput is low. Updated tests. Signed-off-by: Wei Pan <weip@nvidia.com>

wpan11nv requested a review from vinx13 February 13, 2020 18:50

tqchen assigned vinx13 Feb 13, 2020

tqchen added the status: need review label Feb 14, 2020

wpan11nv force-pushed the fp16_reduction_fixes branch 2 times, most recently from c31bccd to d3d8b0c Compare February 14, 2020 22:37

wpan11nv force-pushed the fp16_reduction_fixes branch from d3d8b0c to 44d463a Compare February 15, 2020 03:53

vinx13 approved these changes Feb 16, 2020

View reviewed changes

vinx13 merged commit d50ba72 into apache:master Feb 16, 2020

vinx13 added status: accepted and removed status: need review labels Feb 16, 2020

wpan11nv deleted the fp16_reduction_fixes branch February 17, 2020 05:38

ChaiByte mentioned this pull request Sep 2, 2020

【Help-wanted】在ubuntu18.04编译MegEngine时报错 MegEngine/MegEngine#97

Closed

ZihengJiang mentioned this pull request Sep 17, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CodeGen][CUDA] Fix issues in cuda codegen #4876

[CodeGen][CUDA] Fix issues in cuda codegen #4876

wpan11nv commented Feb 13, 2020

wpan11nv commented Feb 13, 2020 •

edited

Loading

tqchen commented Feb 13, 2020

vinx13 commented Feb 15, 2020

vinx13 commented Feb 15, 2020 •

edited

Loading

wpan11nv commented Feb 15, 2020

vinx13 commented Feb 16, 2020

[CodeGen][CUDA] Fix issues in cuda codegen #4876

[CodeGen][CUDA] Fix issues in cuda codegen #4876

Conversation

wpan11nv commented Feb 13, 2020

wpan11nv commented Feb 13, 2020 • edited Loading

tqchen commented Feb 13, 2020

vinx13 commented Feb 15, 2020

vinx13 commented Feb 15, 2020 • edited Loading

wpan11nv commented Feb 15, 2020

vinx13 commented Feb 16, 2020

wpan11nv commented Feb 13, 2020 •

edited

Loading

vinx13 commented Feb 15, 2020 •

edited

Loading