Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Operators] Conv2d gemm fp16 performance enhancement #285

Closed
wants to merge 37 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
37 commits
Select commit Hold shift + click to select a range
803d302
lint again for some reason
Aalanli Jun 9, 2023
a59ba09
lint again for some reason
Aalanli Jun 9, 2023
0be7d38
Merge branch 'main' of https://github.com/Aalanli/hidet into main
Aalanli Jun 9, 2023
06a9cd0
nevermind
Aalanli Jun 9, 2023
f550d2f
Merge branch 'main' of https://github.com/Aalanli/hidet into main
Aalanli Jun 12, 2023
8ae48da
minor fixes
Jun 14, 2023
e0433b8
post-conv-algorithm
Aalanli Jun 14, 2023
455f519
algorithm benchmark changes
Jun 14, 2023
629a54a
docs
Jun 14, 2023
36f7288
todo convgemmfusedfp16
Aalanli Jun 14, 2023
4cfa8e7
prototype kernel
Jun 15, 2023
952d316
further bug fix
Jun 15, 2023
77a2fa3
fused fp16 conv gemm kernel
Aalanli Jun 16, 2023
a572d72
finished kernel for group=1
Aalanli Jun 17, 2023
ac49774
minor bug fix
Aalanli Jun 17, 2023
fd19d60
Merge branch 'hidet-org:main' into main
Aalanli Jun 17, 2023
28f4d88
Merge branch 'main' into conv2d-fp16
Aalanli Jun 17, 2023
3ace805
finished conv
Aalanli Jun 18, 2023
530148f
remove dead code
Aalanli Jun 18, 2023
30886cf
add tests
Aalanli Jun 18, 2023
a565775
parallel_k test
Aalanli Jun 18, 2023
eb92746
pk part heuristic
Aalanli Jun 18, 2023
8cfe845
update heuristic
Jun 18, 2023
29340df
lint
Jun 18, 2023
5ecbfaa
finished conv2d gemm
Aalanli Jun 19, 2023
3935aa5
performance alteration
Jun 19, 2023
3ba62b5
format
Jun 19, 2023
35b081d
disable cpu tests due to numerical instability
Jun 19, 2023
cfd4211
performance enhancement
Jun 20, 2023
3caf05e
format
Jun 20, 2023
e5acafa
Merge branch 'main' into conv2d-fp16
Jun 20, 2023
b435f45
temporary commit
Jun 21, 2023
b56b2c2
make matmul_f16 work with dim a multiple of 4
Aalanli Jun 21, 2023
a5485b8
remove
Aalanli Jun 21, 2023
d99eac7
new transform op
Aalanli Jun 22, 2023
cfd72bd
tests
Jun 23, 2023
8963719
pre_transform
Jun 23, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading