Fix gather_op to avoid cudaErrorLaunchFailure for solov2, test=develop #34200

haohongxiang · 2021-07-15T15:24:57Z

Bug fixes

OPs

Fix gather_op to avoid cudaErrorLaunchFailure for solov2
icafe卡片：https://console.cloud.baidu-int.com/devops/icafe/issue/DLTP-32060/show
由于框架dev的改动，导致solov2模型评估/预测报错

经排查：在gather_op的cuda_kernel_loop对index做越界分析时，将index与上界input_size(the size of input)作比较时，报错信息如下：

目前未找到合适的解决方法，先取消上界越界分析，后续找到解决方法后再修复。

PR链接：#34096

再更：(解决方法是将input_size在GPUGather内从cpu拷贝到gpu后再传入cuda_kernel中进行比较，但会影响此基础Op的性能)

paddle-bot-old · 2021-07-15T15:25:01Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

ForFishes

LGTM

Fix gather_op to avoid cudaErrorLaunchFailure for solov2, test=develop

fb43245

ForFishes approved these changes Jul 16, 2021

View reviewed changes

ForFishes merged commit 380bc4e into PaddlePaddle:develop Jul 16, 2021

haohongxiang deleted the dev branch July 16, 2021 07:05

Provide feedback