[Bug] no suitable user-defined conversion from "int2" to "longlong2" exists #13379
Labels
needs-triage
PRs or issues that need to be investigated by maintainers to find the right assignees to address it
type: bug
PR #13317 introduces a bug when generating CUDA kernel for ops with index (e.g., gather).
With some investigations, it depends on the input shape. Taking the unit tests shown below, error only happens when the last dimension is an even number (e.g., 2).
cc @vinx13 @junrushao
Expected behavior
No error.
Actual behavior
Environment
CUDA 11.3 on NVIDIA T4.
Steps to reproduce
Triage
Please refer to the list of label tags here to find the relevant tags and add them below in a bullet format (example below).
The text was updated successfully, but these errors were encountered: