[v1.x][Feature] Add flag for disabling oneDNN BRGEMM implementation of FC #20450

bgawrych · 2021-07-15T08:10:59Z

Description

In new oneDNN version there are BRGEMM kernels for FullyConnected - it require special memory format of weights.
This PR let user to decide if BRGEMM implementation should be used by flag - it can significantly speedup FC execution for large tensors (got 42% speedup on BERT with 64 batch size ) - feature disabled by default as it's not so efficient on small tensors.

Checklist

Essentials

PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc)
Changes are complete (i.e. I finished coding on this PR)

mxnet-bot · 2021-07-15T08:11:03Z

Hey @bgawrych , Thanks for submitting the PR
All tests are already queued to run once. If tests fail, you can trigger one or more tests again with the following commands:

To trigger all jobs: @mxnet-bot run ci [all]
To trigger specific jobs: @mxnet-bot run ci [job1, job2]

CI supported jobs: [edge, sanity, centos-cpu, centos-gpu, windows-gpu, clang, miscellaneous, unix-gpu, unix-cpu, website, windows-cpu]

Note:
Only following 3 categories can trigger CI :PR Author, MXNet Committer, Jenkins Admin.
All CI tests must pass before the PR can be merged.

bgawrych · 2021-07-15T11:03:02Z

@mxnet-bot run ci [unix-cpu]

mxnet-bot · 2021-07-15T11:03:08Z

Jenkins CI successfully triggered : [unix-cpu]

anko-intel · 2021-07-15T11:29:29Z

src/operator/nn/mkldnn/mkldnn_base-inl.h

@@ -312,7 +312,8 @@ inline static mkldnn::memory::desc GetFCWeightDesc(const NDArray &arr, int dtype
  for (size_t i = 0; i < dims.size(); i++) dims[i] = arr.shape()[i];
  auto format = mkldnn::memory::format_tag::any;
  // for batch 256 alexnet benchmark test
-  if (dims.size() == 2) {
+  const bool brgemm_disabled = dmlc::GetEnv("MXNET_DISABLE_ONEDNN_BRGEMM_FC", true);


It will be good to add description to docs/static_site/src/pages/api/faq/env_var.md
Also I am not sure if for 1.x branch the name have to include ONEDNN ?
so maybe MXNET_MKLDNN_DISABLE_BRGEMM_FC

bgawrych · 2021-07-16T05:45:51Z

@szha Can you help with merge?

sfraczek

LGTM

szha

Looks good to me. Ideally we don't want to introduce more environment variables, but instead make the decision automatically on which implementation to use based on input size.

anko-intel · 2021-07-16T17:09:40Z

@mxnet-bot run ci [unix-gpu]

mxnet-bot · 2021-07-16T17:09:44Z

Unauthorized access detected.
Only following 3 categories can trigger CI :
PR Author, MXNet Committer, Jenkins Admin.

bgawrych · 2021-07-16T17:41:41Z

@mxnet-bot run ci [unix-gpu]

mxnet-bot · 2021-07-16T17:41:46Z

Jenkins CI successfully triggered : [unix-gpu]

TaoLv · 2021-08-10T08:22:23Z

src/operator/nn/mkldnn/mkldnn_base-inl.h

@@ -312,7 +312,8 @@ inline static mkldnn::memory::desc GetFCWeightDesc(const NDArray &arr, int dtype
  for (size_t i = 0; i < dims.size(); i++) dims[i] = arr.shape()[i];
  auto format = mkldnn::memory::format_tag::any;
  // for batch 256 alexnet benchmark test
-  if (dims.size() == 2) {
+  const bool brgemm_disabled = dmlc::GetEnv("MXNET_MKLDNN_DISABLE_BRGEMM_FC", true);
+  if (dims.size() == 2 && brgemm_disabled) {


Could you please provide more benchmarking data with different m/n/k and formats?

BTW, actually brgemm_disabled looks misleading to me. According to the code change, i would rather call the flag force_plain_format or force_ab_format.

…f FC (apache#20450) * Add flag for disabling oneDNN BRGEMM implementation of FC * Review fixes * Update env_var.md

* [v1.x][Feature] Add flag for disabling oneDNN BRGEMM implementation of FC (#20450) * Add flag for disabling oneDNN BRGEMM implementation of FC * Review fixes * Update env_var.md * [v1.x] Enabling BRGEMM FullyConnected based on shapes (#20533) * Enable brgemm based on input info * fix sanity * Review fixes * Change function name * Fix typo * Align variable assignments * Fix review * use const reference * Update flag name

…f FC (apache#20450) * Add flag for disabling oneDNN BRGEMM implementation of FC * Review fixes * Update env_var.md

Add flag for disabling oneDNN BRGEMM implementation of FC

702d620

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test pr-work-in-progress PR is still work in progress and removed pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jul 15, 2021

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test and removed pr-work-in-progress PR is still work in progress labels Jul 15, 2021

anko-intel approved these changes Jul 15, 2021

View reviewed changes

Review fixes

1f26964

bgawrych requested review from aaronmarkham and szha as code owners July 15, 2021 12:05

anko-intel approved these changes Jul 15, 2021

View reviewed changes

mseth10 added pr-awaiting-review PR is waiting for code review and removed pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jul 15, 2021

$sfraczek$

sfraczek approved these changes Jul 16, 2021

View reviewed changes

Update env_var.md

f24049a

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test and removed pr-awaiting-review PR is waiting for code review labels Jul 16, 2021

szha approved these changes Jul 16, 2021

View reviewed changes

mseth10 added pr-work-in-progress PR is still work in progress and removed pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jul 16, 2021

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test and removed pr-work-in-progress PR is still work in progress labels Jul 16, 2021

mseth10 added pr-awaiting-merge Review and CI is complete. Ready to Merge and removed pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jul 16, 2021

szha merged commit a4c4fa0 into apache:v1.x Jul 16, 2021

TaoLv reviewed Aug 10, 2021

View reviewed changes

bgawrych mentioned this pull request Aug 17, 2021

[v1.x] Enabling BRGEMM FullyConnected based on shapes #20533

Merged

bgawrych mentioned this pull request Sep 3, 2021

[Backport] Enabling BRGEMM FullyConnected based on shapes #20568

Merged

bgawrych mentioned this pull request Sep 17, 2021

[v1.9.x] Enable oneDNN BRGEMM kernel for FullyConnected #20591

Closed

bgawrych mentioned this pull request Feb 22, 2022

[v1.9.x] Port oneDNN BRGEMM kernels #20910

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v1.x][Feature] Add flag for disabling oneDNN BRGEMM implementation of FC #20450

[v1.x][Feature] Add flag for disabling oneDNN BRGEMM implementation of FC #20450

bgawrych commented Jul 15, 2021

mxnet-bot commented Jul 15, 2021

bgawrych commented Jul 15, 2021

mxnet-bot commented Jul 15, 2021

anko-intel Jul 15, 2021

bgawrych commented Jul 16, 2021

$@sfraczek$ sfraczek left a comment

szha left a comment

anko-intel commented Jul 16, 2021

mxnet-bot commented Jul 16, 2021

bgawrych commented Jul 16, 2021

mxnet-bot commented Jul 16, 2021

TaoLv Aug 10, 2021

[v1.x][Feature] Add flag for disabling oneDNN BRGEMM implementation of FC #20450

[v1.x][Feature] Add flag for disabling oneDNN BRGEMM implementation of FC #20450

Conversation

bgawrych commented Jul 15, 2021

Description

Checklist

Essentials

mxnet-bot commented Jul 15, 2021

bgawrych commented Jul 15, 2021

mxnet-bot commented Jul 15, 2021

anko-intel Jul 15, 2021

Choose a reason for hiding this comment

bgawrych commented Jul 16, 2021

sfraczek left a comment

Choose a reason for hiding this comment

szha left a comment

Choose a reason for hiding this comment

anko-intel commented Jul 16, 2021

mxnet-bot commented Jul 16, 2021

bgawrych commented Jul 16, 2021

mxnet-bot commented Jul 16, 2021

TaoLv Aug 10, 2021

Choose a reason for hiding this comment

$@sfraczek$ sfraczek left a comment