[BYOC] Enable bfloat16 in DNNL BYOC #11111

yangulei · 2022-04-25T08:00:14Z

Enable bfloat16 in DNNL BYOC following the path:

[float32 graph] --> <AMP> --> [bfloat16 graph] --> <BYOC> --> [TVM + oneDNN module]

Main work include:

Enable more data types in DNNL json runtime (only bfloat16 has been tested so far).
Consider dtype while querying optimal DNNL layout.
Add tests for bf16 DNNL BYOC.

With those improvements, a float32 graph could be converted to bfloat16 through AMP, and then be lowered by native codegen or consumed by oneDNN and finally inference in bfloat16 mode now.

yangulei · 2022-05-26T03:33:52Z

Relative modifications since the original PR:

All the CI checks have passed now, what a long journey.
@masahi Could you please help to review this PR, thanks.

masahi · 2022-05-26T07:31:05Z

tests/python/contrib/test_dnnl.py

@@ -37,6 +37,8 @@
    ids=["compile", "run"],
 )

+bf16_supported = "avx512" in open("/proc/cpuinfo", "r").read()


Probably need more precise detection, but ok.

masahi · 2022-05-26T07:36:20Z

cc @AndrewZhaoLuo this is cool (the first e2e run of AMP + bf16!!)

yangulei · 2022-05-26T07:43:01Z

Thanks a lot.
We are working on the e2e performance optimization in DNNL-BYOC 😄

for simplicity in DNNL run-time; we need to remove TR, and maybe move to apache#11111

yangulei force-pushed the upstream_byoc_bf16 branch from be5f228 to e2fcfd1 Compare April 26, 2022 02:53

This was referenced Apr 27, 2022

[CI] update oneDNN to v2.6 #11140

Merged

Fix mixed precision output type to original type #11142

Merged

yangulei force-pushed the upstream_byoc_bf16 branch 4 times, most recently from 41e9720 to 4caede4 Compare May 26, 2022 00:38

yangulei added 18 commits May 26, 2022 08:39

refine the code style (apache#10112)

686a9ff

support more data types in oneDNN BYOC

3b2b8e2

consider dtype when query layout

1fe4c4d

support more translation of blocked layout

bfcfe62

refine log for invalid layout transform

bc3d8ca

reset N and C for the weights

82c49c3

support multi-blocking in TransDims2Plain()

b5be62e

add tests for bf16 oneDNN BYOC

6ff6613

unregister 'round' OP in oneDNN BYOC

f7832c9

restore the criteria for fp32 tests

52fac7e

disable test_prune_dnnl_subgraph for bf16

6e194dc

fix typo in dnnl.py

43f6e78

delete tag::format_tag_last

388341d

delete 'is_weight' in layout2tag()

e91c23e

reuse dtype_dl2dnnl()

a161d2e

fix lint errors

a9d0ad8

change to WARNING for invalid laytout transform

a42764a

skip bf16 tests if AVX512 is unavailable

4caede4

masahi reviewed May 26, 2022

View reviewed changes

masahi approved these changes May 26, 2022

View reviewed changes

masahi merged commit 8135860 into apache:main May 26, 2022

masahi mentioned this pull request May 26, 2022

[DNNL] Add TensorRequisite concept. Multi instance support #11345

Merged

driazati mentioned this pull request Jul 14, 2022

TVM v0.9.0.rc0 Release Candidate Notes #12102

Closed

liaopeiyuan added a commit to zk-ml/tachikoma that referenced this pull request Sep 14, 2022

roll back to 8a0249c

39d543a

for simplicity in DNNL run-time; we need to remove TR, and maybe move to apache#11111

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BYOC] Enable bfloat16 in DNNL BYOC #11111

[BYOC] Enable bfloat16 in DNNL BYOC #11111

yangulei commented Apr 25, 2022

yangulei commented May 26, 2022 •

edited

Loading

masahi May 26, 2022

masahi commented May 26, 2022

yangulei commented May 26, 2022

[BYOC] Enable bfloat16 in DNNL BYOC #11111

[BYOC] Enable bfloat16 in DNNL BYOC #11111

Conversation

yangulei commented Apr 25, 2022

yangulei commented May 26, 2022 • edited Loading

masahi May 26, 2022

Choose a reason for hiding this comment

masahi commented May 26, 2022

yangulei commented May 26, 2022

yangulei commented May 26, 2022 •

edited

Loading