[Bug][Metaschedule] Tuning trial hanging after one task #12330
Labels
needs-triage
PRs or issues that need to be investigated by maintainers to find the right assignees to address it
type: bug
I encountered this when trying to run this script over RPC on machines with v100's. Though it was done using Relax, @zxybazh says he thinks this can probably be triggered on mainline as well.
I ran ResNet-50 on V100 with an input shape of (1, 3, 224, 224), using 5 tuning trials. The tuning task began started hanging on the first tuning task,
fused_conv2d_add_relu
. It appeared that there were failures encountered during the task.Output from the host:
The tail of the long of task 1 (excerpted, as it goes on for a long time):
cc @Hzfengsy @junrushao @junrushao1994
The text was updated successfully, but these errors were encountered: