[Library] Add cublas library #404

yaoyaoding · 2024-01-03T02:56:28Z

This PR add cublas to hidet.

Check the tests/cuda/test_cublas.py for the usage of cublas in hidet.

The issue is caused by a wrong layout for the bias tensor. For example, we consider a bias tensor of shape (64, ) and its layout can be written as `(64, ): (1, )` However, we can expand the layout by adding axes with 1-shape. For example, `(64, 1):(1, 1)` Since the shape is equal to 1, the stride can be any number. The stride corresponding to the 1-shape actually doesn't affect the computation of the address. But two strides that are equal to one will influence the instruction selection, and the invalid memory instruction leads to the misaligned access. To fix this issue, we force the stride paired with 1-shape to be 0. The layout is equivalent when computing the memory address, and this will help the compiler make the right decision in the instruction selection pass. closes #404 Co-authored-by: xiaocenxiaocen <xiao.zhang@centml.ai>

yaoyaoding added 4 commits January 2, 2024 21:53

lint & format

ce042ac

typos

c61516b

format

4500755

fix

af0fb02

yaoyaoding merged commit f70e5e6 into hidet-org:main Jan 3, 2024
2 checks passed

yaoyaoding deleted the cublass branch January 3, 2024 17:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Library] Add cublas library #404

[Library] Add cublas library #404

yaoyaoding commented Jan 3, 2024

[Library] Add cublas library #404

[Library] Add cublas library #404

Conversation

yaoyaoding commented Jan 3, 2024