[TIR][Schedule] Derive Nonnegative Bounds from Shape Var #15210

junrushao · 2023-07-03T03:51:22Z

This PR enhance the arithmetic analysis used in compute-at to further help symbolic bound simplification.

Previously, when a variable n appears in the shape of an input buffer T.Buffer((n * 32), "float32"), we could safely assume that n is nonnegative as it is part of the shape. This could help us simplify some bounds during scheduling as well as lowering.

For example, for integers n and bx where bx has a symbolic bound [0, 32 * n), if n is nonnegative, we could simplify the following expressions to True:

0 <= floordiv(bx, n) < 32
0 <= floormod(bx, n) < n

This PR depends on #15193 to provide an interface that hints analyzer.

tvm-bot · 2023-07-03T03:51:25Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @Hzfengsy, @quic-sanirudh, @shingjan _{See #10317 for details}

_{Generated by tvm-bot}

This PR enhance the arithmetic analysis used in compute-at to further help symbolic bound simplification. Previously, when a variable `n` appears in the shape of an input buffer `T.Buffer((n * 32), "float32")`, we could safely assume that `n` is nonnegative as it is part of the shape. This could help us simplify some bounds during scheduling as well as lowering. For example, for integers `n` and `bx` where `bx` has a symbolic bound `[0, 32 * n)`, if `n` is nonnegative, we could simplify the following expressions to True: ``` 0 <= floordiv(bx, n) < 32 0 <= floormod(bx, n) < n ``` This PR depends on apache#15193 to provide an interface that hints analyzer.

This PR enhances Decode-GEMV rule with the following changes: - Normalize the GEMV iter domain to S-R-C via transform-block-layout. This would help with further analysis and scheduling, in cases for example, when there was no spatial loop in the original reduction block. - Get rid of the ad hoc iter type analysis, including the logic calling into a TVM packed func `tir.schedule.GetLoopIterType` using `tvm._ffi.get_global_func`. - Split out the logic for two separate cases of scheduling, where the innermost dimension is spatial or reduction. - Introduces `suggest_threads_per_block` to guess the threads to be allocated each threadblock. This helps avoid the previous case where dlight allocates 256 threads for a workload whose degree of parallelism is only 128. - Misc improvements. This rest of the changes are split out to separate PRs that are already merged to main. - [x] Pass the hints to arithmetic analyzer that shape variables should be positive ones (apache#15210) - [x] Eliminate unnecessary block predicate generation - should be provable via affine analysis (apache#15193) - [x] Shrink local memory allocation if only one element `X[threadIdx.x]` is used (apache#15207)

junrushao marked this pull request as ready for review July 3, 2023 03:51

junrushao mentioned this pull request Jul 3, 2023

[Dlight] Enhance Decode-GEMV Schedule #15195

Merged

3 tasks

junrushao force-pushed the feature/2023-07-02/compute-at-symbolic-bound branch from 1052628 to 17faf25 Compare July 3, 2023 05:14

yzh119 approved these changes Jul 3, 2023

View reviewed changes

tqchen approved these changes Jul 3, 2023

View reviewed changes

tqchen merged commit 03ef29e into apache:main Jul 3, 2023

ysh329 mentioned this pull request Oct 18, 2023

[Release] v0.14.0 Release Candidate Notes #15948

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TIR][Schedule] Derive Nonnegative Bounds from Shape Var #15210

[TIR][Schedule] Derive Nonnegative Bounds from Shape Var #15210

junrushao commented Jul 3, 2023

tvm-bot commented Jul 3, 2023 •

edited

Loading

[TIR][Schedule] Derive Nonnegative Bounds from Shape Var #15210

[TIR][Schedule] Derive Nonnegative Bounds from Shape Var #15210

Conversation

junrushao commented Jul 3, 2023

tvm-bot commented Jul 3, 2023 • edited Loading

tvm-bot commented Jul 3, 2023 •

edited

Loading