[CUDA] [Codegen] Ensuring atleast one thread block to handle empty tensor #7273

anijain2305 · 2021-01-13T20:14:32Z

Topk was failing on CUDA when k is a var and its value is 0 at runtime. At closer inspection I found that there are 0 thread blocks at runtime. This PR ensures that there is atleast 1 thread block.

anijain2305 · 2021-01-13T20:35:39Z

@kevinthesun @masahi @mbrookhart @zhiics @trevor-m Please review.

masahi · 2021-01-13T21:56:40Z

hmm, I think I've already added a fix for such cases, here:

tvm/python/tvm/tir/ir_builder.py

Lines 205 to 206 in 82942fb

    
           if attr_key == "thread_extent": 
        
               value = op.max(1, value)

Do you know why it is not working? cc @mbrookhart

anijain2305 · 2021-01-13T22:01:14Z

hmm, I think I've already added a fix for such cases, here:

tvm/python/tvm/tir/ir_builder.py

Lines 205 to 206 in 82942fb

if attr_key == "thread_extent":

value = op.max(1, value)

Do you know why it is not working? cc @mbrookhart

Is this because this the lines that you suggested are specific to IR Builder, while the failure that I see is for injective schedule? My failures was coming for an injective schedule.

mbrookhart · 2021-01-13T22:06:01Z

Yeah, I think this change catches it at a lower level. We might not need the ir_builder change after this.

kevinthesun

LGTM

mbrookhart

LGTM

masahi

I see, thanks.

…he#7273)

[CUDA] [Codegen] Ensuring atleast one thread block for dynamism

8c20809

anijain2305 force-pushed the cuda_illegal branch from 779e437 to 8c20809 Compare January 13, 2021 20:34

anijain2305 changed the title ~~[CUDA] [Codegen] Ensuring atleast one thread block for dynamism~~ [CUDA] [Codegen] Ensuring atleast one thread block to handle empty tensor Jan 13, 2021

kevinthesun approved these changes Jan 13, 2021

View reviewed changes

mbrookhart approved these changes Jan 13, 2021

View reviewed changes

masahi approved these changes Jan 13, 2021

View reviewed changes

zhiics approved these changes Jan 13, 2021

View reviewed changes

masahi merged commit 8d3c0e7 into apache:main Jan 14, 2021

masahi pushed a commit to masahi/tvm that referenced this pull request Jan 18, 2021

[CUDA] [Codegen] Ensuring atleast one thread block for dynamism (apac…

87cbc2d

…he#7273)

TusharKanekiDey pushed a commit to TusharKanekiDey/tvm that referenced this pull request Jan 20, 2021

[CUDA] [Codegen] Ensuring atleast one thread block for dynamism (apac…

87c8083

…he#7273)

trevor-m pushed a commit to neo-ai/tvm that referenced this pull request Jan 21, 2021

[CUDA] [Codegen] Ensuring atleast one thread block for dynamism (apac…

36e95ee

…he#7273)

electriclilies pushed a commit to electriclilies/tvm that referenced this pull request Feb 18, 2021

[CUDA] [Codegen] Ensuring atleast one thread block for dynamism (apac…

c350ef7

…he#7273)

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA] [Codegen] Ensuring atleast one thread block to handle empty tensor #7273

[CUDA] [Codegen] Ensuring atleast one thread block to handle empty tensor #7273

anijain2305 commented Jan 13, 2021

anijain2305 commented Jan 13, 2021

masahi commented Jan 13, 2021

anijain2305 commented Jan 13, 2021 •

edited

Loading

mbrookhart commented Jan 13, 2021

kevinthesun left a comment

mbrookhart left a comment

masahi left a comment

[CUDA] [Codegen] Ensuring atleast one thread block to handle empty tensor #7273

[CUDA] [Codegen] Ensuring atleast one thread block to handle empty tensor #7273

Conversation

anijain2305 commented Jan 13, 2021

anijain2305 commented Jan 13, 2021

masahi commented Jan 13, 2021

anijain2305 commented Jan 13, 2021 • edited Loading

mbrookhart commented Jan 13, 2021

kevinthesun left a comment

Choose a reason for hiding this comment

mbrookhart left a comment

Choose a reason for hiding this comment

masahi left a comment

Choose a reason for hiding this comment

anijain2305 commented Jan 13, 2021 •

edited

Loading