[quant] Enable fusion for conv modules with bias #36173

supriyar · 2020-04-07T21:48:56Z

Stack from ghstack:

[quant] Enable fusion for conv modules with bias #36173 [quant] Enable fusion for conv modules with bias

Summary:
Previously we were ignoring the conv bias during training if it existed
This PR adds the bias from the conv op during the conv+bn fusion process

Test Plan:
python test/quantization/test_quantization.py

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D20921613

Summary: Previously we were ignoring the conv bias during training if it existed This PR adds the bias from the conv op during the conv+bn fusion process Test Plan: python test/quantization/test_quantization.py Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

dr-ci · 2020-04-07T21:50:00Z

💊 CircleCI build failures summary and remediations

As of commit 3a2e166 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no CircleCI failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

See how this bot performed.

This comment has been revised 11 times.

torch/nn/intrinsic/qat/modules/conv_fused.py

raghuramank100 · 2020-04-07T22:43:40Z

torch/nn/intrinsic/qat/modules/conv_fused.py

-            conv = conv * rescale_factor.reshape([1, -1, 1, 1])
-            conv = conv + (self.beta - self.gamma * batch_mean * batch_rstd).reshape([1, -1, 1, 1])
-
+            conv = (self.gamma * batch_rstd).reshape([1, -1, 1, 1]) * conv_orig + \


The correct equation is

conv = (self.gamma * batch_rstd).reshape([1, -1, 1, 1]) * (conv_orig - batch_mean).reshape([1, -1, 1, 1]) + self.beta.reshape([1, -1, 1,1])

This would give incorrect results as batch_mean and batch_var need to take the bias into account

raghuramank100 · 2020-04-07T22:46:46Z

torch/nn/intrinsic/qat/modules/conv_fused.py

-            conv = conv + (self.beta - self.gamma * batch_mean * batch_rstd).reshape([1, -1, 1, 1])
-
+            conv = (self.gamma * batch_rstd).reshape([1, -1, 1, 1]) * conv_orig + \
+                (self.beta - self.gamma * batch_rstd * batch_mean).reshape([1, -1, 1, 1])


We need to also change the equation for the else condition (training is false to):

conv = conv + self.gamma * (self.bias - self.running_mean)/running_std).reshape([1, -1, 1, 1]) + self.beta.reshape([1, -1, 1,1])

I don't think this needs to change since the bias is baked into self.running_mean and running_std after training

That's not true, think of it this way: If I add a bias to my output during training, the running mean will be bias (assume that the conv output prior to bias add is zero mean).
So, the output will be gamma/sigma * (y + bias- mean) + beta.
If you dont have the bias now the output will be:
gamma/sigma * (y - mean) + beta.
which will be incorrect.

Thanks for the explanation! Updated it.

torch/nn/intrinsic/qat/modules/conv_fused.py

test/quantization/test_quantization.py

Summary: Previously we were ignoring the conv bias during training if it existed This PR adds the bias from the conv op during the conv+bn fusion process Test Plan: python test/quantization/test_quantization.py Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: Previously we were ignoring the conv bias during training if it existed This PR adds the bias from the conv op during the conv+bn fusion process Test Plan: python test/quantization/test_quantization.py Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 38013245eef18c765897ad1ce8f2977197ff9581 Pull Request resolved: #36173

test/quantization/test_quantization.py

raghuramank100

Please see comments, thanks!

Summary: Previously we were ignoring the conv bias during training if it existed This PR adds the bias from the conv op during the conv+bn fusion process Test Plan: python test/quantization/test_quantization.py Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: Previously we were ignoring the conv bias during training if it existed This PR adds the bias from the conv op during the conv+bn fusion process Test Plan: python test/quantization/test_quantization.py Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: d8cee7eef326b4225d52ba7081e11d1e905ac69e Pull Request resolved: #36173

facebook-github-bot · 2020-04-09T00:12:09Z

This pull request has been merged in 6972c27.

Summary: Pull Request resolved: pytorch#36173 Previously we were ignoring the conv bias during training if it existed This PR adds the bias from the conv op during the conv+bn fusion process Test Plan: python test/quantization/test_quantization.py Imported from OSS Differential Revision: D20921613 fbshipit-source-id: eacb2ccf9107f413ac4ef23163ba914af9b90924

supriyar requested a review from apaszke as a code owner April 7, 2020 21:48

supriyar requested a review from raghuramank100 April 7, 2020 21:49

raghuramank100 reviewed Apr 7, 2020

View reviewed changes

torch/nn/intrinsic/qat/modules/conv_fused.py Outdated Show resolved Hide resolved

raghuramank100 reviewed Apr 7, 2020

View reviewed changes

torch/nn/intrinsic/qat/modules/conv_fused.py Outdated Show resolved Hide resolved

raghuramank100 reviewed Apr 7, 2020

View reviewed changes

torch/nn/intrinsic/qat/modules/conv_fused.py Outdated Show resolved Hide resolved

raghuramank100 reviewed Apr 7, 2020

View reviewed changes

test/quantization/test_quantization.py Show resolved Hide resolved

supriyar requested a review from raghuramank100 April 8, 2020 00:53

raghuramank100 reviewed Apr 8, 2020

View reviewed changes

test/quantization/test_quantization.py Show resolved Hide resolved

raghuramank100 approved these changes Apr 8, 2020

View reviewed changes

facebook-github-bot closed this in 6972c27 Apr 8, 2020

facebook-github-bot added the merged label Apr 9, 2020

facebook-github-bot deleted the gh/supriyar/80/head branch April 12, 2020 14:16

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quant] Enable fusion for conv modules with bias #36173

[quant] Enable fusion for conv modules with bias #36173

supriyar commented Apr 7, 2020 •

edited

Loading

dr-ci bot commented Apr 7, 2020 •

edited

Loading

raghuramank100 Apr 7, 2020 •

edited

Loading

supriyar Apr 8, 2020

raghuramank100 Apr 7, 2020

supriyar Apr 8, 2020

raghuramank100 Apr 8, 2020

supriyar Apr 8, 2020

raghuramank100 left a comment

facebook-github-bot commented Apr 9, 2020

[quant] Enable fusion for conv modules with bias #36173

[quant] Enable fusion for conv modules with bias #36173

Conversation

supriyar commented Apr 7, 2020 • edited Loading

dr-ci bot commented Apr 7, 2020 • edited Loading

💊 CircleCI build failures summary and remediations

raghuramank100 Apr 7, 2020 • edited Loading

Choose a reason for hiding this comment

supriyar Apr 8, 2020

Choose a reason for hiding this comment

raghuramank100 Apr 7, 2020

Choose a reason for hiding this comment

supriyar Apr 8, 2020

Choose a reason for hiding this comment

raghuramank100 Apr 8, 2020

Choose a reason for hiding this comment

supriyar Apr 8, 2020

Choose a reason for hiding this comment

raghuramank100 left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Apr 9, 2020

supriyar commented Apr 7, 2020 •

edited

Loading

dr-ci bot commented Apr 7, 2020 •

edited

Loading

raghuramank100 Apr 7, 2020 •

edited

Loading