[IR] Support integer subbyte #403

xiaocenxiaocen · 2024-01-02T16:47:23Z

support sub byte integers in Hidet

a = register_tensor("int4b", [4, 4])
b = a[0, 2]
a[2, 2] = int4b(-5)
ptr = &a[0, 0]
ptr = ptr + 8

yaoyaoding · 2024-01-09T00:05:21Z

Hi @xiaocenxiaocen, let me know when the PR is ready to be reviewed, thanks!

xiaocenxiaocen · 2024-01-18T17:56:54Z

Hi @xiaocenxiaocen, let me know when the PR is ready to be reviewed, thanks!

Sure. I will work on this in this week and the next week.

xiaocenxiaocen · 2024-01-20T15:50:51Z

Hi, @yaoyaoding. This PR is ready for review. Please take a look at it. Thanks.

xiaocenxiaocen · 2024-01-20T15:52:10Z

hidet-ci launch

yaoyaoding

Thanks @xiaocenxiaocen for the support of sub-integer type!

It looks good to me overall. And I left some minor suggestions to make some part more consistent with the existing implementation (like the data type).

Feel free to merge this PR by yourself after you resolve those comments.

python/hidet/ir/type.py

python/hidet/transforms/lower_integer_subbyte.py

tests/ir/test_int_subbyte.py

xiaocenxiaocen · 2024-01-25T21:01:56Z

$hidet-ci launch

hjjq · 2024-01-25T21:05:34Z

$hidet-ci launch

xiaocenxiaocen · 2024-01-25T22:54:32Z

$hidet-ci launch

xiaocenxiaocen · 2024-01-26T01:23:28Z

$hidet-ci launch

1. Added `torch.Tensor.as_strided` and `torch.flip` 2. Added support for `rounding_mode == 'trunc'` in torch.divide 3. Registered `torch.new_ones` Longformer model compilation fails with: ``` RuntimeError: cudaDeviceSynchronize failed with error: cudaErrorMisalignedAddress ``` aftering running `fused_matmul_f16_pk_cute_rearrange_add` kernel. Also Nvidia Nsight Compute shows that matmul kernel fails to launch. This PR contains all changes needed to reproduce this issue. To reproduce: 1. check out to `zhumakhan/longformer` branch and 4. python3 tests/benchmarks/bench_transformer.py longformer --------- Co-authored-by: Zhumakhan <nazirzhumakhan@gmail,.com>

yaoyaoding changed the title ~~[Ir] support integer subbyte~~ [IR] Support integer subbyte Jan 9, 2024

xiaocenxiaocen added 3 commits January 20, 2024 10:34

[Ir] support integer subbyte

fd77cfa

[Tests] resolve lint & tests

23c53d4

[Transforms] fix

803b1c2

xiaocenxiaocen force-pushed the support-integer-subbyte branch from a2b7795 to 803b1c2 Compare January 20, 2024 15:47

yaoyaoding reviewed Jan 20, 2024

View reviewed changes

xiaocenxiaocen force-pushed the support-integer-subbyte branch 2 times, most recently from 25e9a56 to 87cc2b7 Compare January 25, 2024 22:42

[Ir][DTypes] resolve review comments

7121c88

xiaocenxiaocen force-pushed the support-integer-subbyte branch from 87cc2b7 to 7121c88 Compare January 26, 2024 01:23

xiaocenxiaocen merged commit 8befb62 into hidet-org:main Jan 26, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[IR] Support integer subbyte #403

[IR] Support integer subbyte #403

xiaocenxiaocen commented Jan 2, 2024

yaoyaoding commented Jan 9, 2024

xiaocenxiaocen commented Jan 18, 2024

xiaocenxiaocen commented Jan 20, 2024

xiaocenxiaocen commented Jan 20, 2024

yaoyaoding left a comment

xiaocenxiaocen commented Jan 25, 2024

hjjq commented Jan 25, 2024

xiaocenxiaocen commented Jan 25, 2024

xiaocenxiaocen commented Jan 26, 2024

[IR] Support integer subbyte #403

[IR] Support integer subbyte #403

Conversation

xiaocenxiaocen commented Jan 2, 2024

yaoyaoding commented Jan 9, 2024

xiaocenxiaocen commented Jan 18, 2024

xiaocenxiaocen commented Jan 20, 2024

xiaocenxiaocen commented Jan 20, 2024

yaoyaoding left a comment

Choose a reason for hiding this comment

xiaocenxiaocen commented Jan 25, 2024

hjjq commented Jan 25, 2024

xiaocenxiaocen commented Jan 25, 2024

xiaocenxiaocen commented Jan 26, 2024