Add support for bucketize #18040

kavin-sai-krishna · 2025-06-05T07:46:34Z

This PR adds support for bucketize op which is used in many vision models like Phi4, SmolVLM etc.,

tqchen · 2025-06-05T11:40:05Z

for operators like this one, we also need legalization rule to know how to lower them. We don;t want to be end up in a situation where we have the ops but canot lower/compile them. cc @tlopex

kavin-sai-krishna · 2025-06-05T11:45:00Z

for operators like this one, we also need legalization rule to know how to lower them

@tqchen I came across the PyTorch implementation of this operation and noticed that they used searchsorted. Following that approach, I’ve used topi.searchsorted to lower the operation. I also tested the implementation numerically with boundaries = [0, 2, 4, 6, 8, 10] and input = [-1, 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11], and the results appear to be correct.

tqchen · 2025-06-05T11:48:53Z

Thanks, would be good to go through the checklist below. Some checklist for adding a new op.

C0: Can the operator be decomposed into smaller ops (if so, then it may not be necessary to introduce the high-level op). It is helpful to reuse existing ops when possible so we don't have to introduce C1/C2, otherwise we need C1/C2
C1: Introduce legalization rules so the op at least compiles on CPU
C2: Make sure the op compiles on CUDA

tqchen · 2025-06-05T11:50:59Z

I think the main question is how to make sure it runs on cuda

kavin-sai-krishna · 2025-06-05T12:48:28Z

@tqchen Thank you. I understood the high-level idea you suggested, but I have a few specific questions regarding the design choices:

Q1: What’s the difference between decomposing an op using Relax ops vs. TOPI ops vs. TIR?
How does the abstraction level impact performance or correctness?
Q2: If the op compiles on CUDA, is numerical verification still required?
Q3: Are there nightly tests that check numerical correctness?
I ask because I found a case (fmod) where the op ran but didn't match PyTorch output.

tqchen · 2025-06-05T13:51:37Z

Q1: What’s the difference between decomposing an op using Relax ops vs. TOPI ops vs. TIR?

Given this is relax importer, we can chose either options as long as the correctness match. When possible, if we can decompose via relax then legalize, it gives most opportunities for possible choice of lowering path. We should aim to reduce total number of core relax ops

Q2/ Q3

yes ideally we should have a nightly test validating the correctness

We can add such tests to

https://github.com/apache/tvm/tree/main/tests/python/nightly

nightly/relax/test_relax_op_numeric.py

kavin-sai-krishna · 2025-06-06T04:21:21Z

@tqchen Thanks for your response. I'll make sure the checklists are satisfied. But I'm not sure what i should do if C2 is not met.

kavin-sai-krishna · 2025-06-16T12:32:13Z

@tqchen I've updated the op to compile and run on CUDA as you requested. Can you please review it.

tlopex · 2025-06-28T11:00:54Z

Please resolve the conflicts so that we can merge it：）
@kavin-sai-krishna

kavin-sai-krishna · 2025-06-30T09:53:40Z

@tlopex I've resolved the conflicts. Can you please take a look?

tlopex

LGTM! Thanks!

* add support for bucketize * fix lint issue * Fix lint issue * Add GPU code for bucketize * Resolve merge conflict * Fix lint issue

tqchen assigned tlopex Jun 5, 2025

kavin-sai-krishna added 4 commits June 16, 2025 11:24

add support for bucketize

2b5b10a

fix lint issue

46d2bb4

Fix lint issue

f77e164

Add GPU code for bucketize

1545a3e

kavin-sai-krishna force-pushed the bucketize branch from 6bd491f to 1545a3e Compare June 16, 2025 05:55

kavin-sai-krishna added 3 commits June 30, 2025 12:04

Resolve merge conflict

8165c1d

Merge branch 'main' into bucketize

00f0b16

Fix lint issue

55e8540

tlopex approved these changes Jun 30, 2025

View reviewed changes

tlopex merged commit 9eb8b30 into apache:main Jun 30, 2025
10 checks passed

ysh329 mentioned this pull request Jul 16, 2025

[Release] v0.21.0 Release Candidate Notes #18150

Closed

ShiboXing pushed a commit to ShiboXing/tvm that referenced this pull request Aug 10, 2025

Add support for bucketize (apache#18040)

7e77e05

* add support for bucketize * fix lint issue * Fix lint issue * Add GPU code for bucketize * Resolve merge conflict * Fix lint issue

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for bucketize #18040

Add support for bucketize #18040

Uh oh!

kavin-sai-krishna commented Jun 5, 2025

Uh oh!

tqchen commented Jun 5, 2025 •

edited

Loading

Uh oh!

kavin-sai-krishna commented Jun 5, 2025

Uh oh!

tqchen commented Jun 5, 2025

Uh oh!

tqchen commented Jun 5, 2025

Uh oh!

kavin-sai-krishna commented Jun 5, 2025

Uh oh!

tqchen commented Jun 5, 2025

Uh oh!

kavin-sai-krishna commented Jun 6, 2025

Uh oh!

kavin-sai-krishna commented Jun 16, 2025

Uh oh!

tlopex commented Jun 28, 2025

Uh oh!

kavin-sai-krishna commented Jun 30, 2025

Uh oh!

tlopex left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add support for bucketize #18040

Add support for bucketize #18040

Uh oh!

Conversation

kavin-sai-krishna commented Jun 5, 2025

Uh oh!

tqchen commented Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kavin-sai-krishna commented Jun 5, 2025

Uh oh!

tqchen commented Jun 5, 2025

Uh oh!

tqchen commented Jun 5, 2025

Uh oh!

kavin-sai-krishna commented Jun 5, 2025

Uh oh!

tqchen commented Jun 5, 2025

Uh oh!

kavin-sai-krishna commented Jun 6, 2025

Uh oh!

kavin-sai-krishna commented Jun 16, 2025

Uh oh!

tlopex commented Jun 28, 2025

Uh oh!

kavin-sai-krishna commented Jun 30, 2025

Uh oh!

tlopex left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tqchen commented Jun 5, 2025 •

edited

Loading