support dp multi-gpu training for QAT quantizer #4127

chenbohua3 · 2021-08-30T12:18:38Z

No description provided.

chenbohua3 · 2021-08-30T12:20:51Z

~~I will rebase the master when #4084 is merged~~

Have rebased.

linbinskn · 2021-09-03T08:35:22Z

nni/algorithms/compression/pytorch/quantization/quantizers.py

+            module = layer.module
+            module.register_buffer("zero_point", torch.tensor([0.0]))
+            module.register_buffer("scale", torch.tensor([1.0]))
+            module.register_buffer('ema_decay', torch.tensor([0.99]))
            if "weight" in config.get("quant_types", []):
                layer.module.register_buffer('weight_bits', torch.zeros(1))


May be we can also del layer here.

linbinskn · 2021-09-03T08:46:45Z

nni/algorithms/compression/pytorch/quantization/quantizers.py

@@ -479,43 +480,55 @@ def quantize_weight(self, wrapper, **kwargs):
        quant_start_step = config.get('quant_start_step', 0)
        assert weight_bits >= 1, "quant bits length should be at least 1"

-        # we dont update weight in evaluation stage
-        if quant_start_step > self.bound_model.steps:
+        if quant_start_step > int(self.bound_model.steps):


Why we add specific datatype convert here?

It is not a good idea to directly compare a python int with a torch.Tensor, which may lead to unpredictable error.

J-shang · 2021-09-01T07:03:18Z

nni/algorithms/compression/pytorch/quantization/quantizers.py

-        module.scale, module.zero_point = update_quantization_param(weight_bits, rmin, rmax)
+        scale, zero_point = update_quantization_param(weight_bits, rmin, rmax)
+        module.scale.copy_(scale)
+        module.zero_point.copy_(zero_point)


what is the difference between copy_ and assignment directly?

As described here:

In each forward, module is replicated on each device, so any updates to the running module in forward will be lost. For example, if module has a counter attribute that is incremented in each forward, it will always stay at the initial value because the update is done on the replicas which are destroyed after forward. However, DataParallel guarantees that the replica on device[0] will have its parameters and buffers sharing storage with the base parallelized module. So in-place updates to the parameters or buffers on device[0] will be recorded.

If we assign directly, the update of scale & zero_point & tracked information will be lost

thx! got it

J-shang · 2021-09-01T07:04:08Z

nni/algorithms/compression/pytorch/quantization/quantizers.py

+            tracked_min_output = update_ema(module.tracked_min_output, current_min,
+                                            module.ema_decay)
+            tracked_max_output = update_ema(module.tracked_max_output, current_max,
+                       module.ema_decay)


support dp multi-gpu training for QAT quantizer

102a0a4

chenbohua3 force-pushed the support_multi_gpu branch from 50e858b to 102a0a4 Compare August 31, 2021 08:50

acured requested review from linbinskn and J-shang September 1, 2021 02:44

chenbohua3 mentioned this pull request Sep 1, 2021

support dtype&scheme customization for QAT quantizer #4137

Merged

linbinskn reviewed Sep 3, 2021

View reviewed changes

J-shang reviewed Sep 3, 2021

View reviewed changes

refine

c00d65d

J-shang approved these changes Sep 6, 2021

View reviewed changes

acured mentioned this pull request Sep 6, 2021

NNI 2021 August~September Iteration Planning #3986

Closed

78 tasks

linbinskn approved these changes Sep 6, 2021

View reviewed changes

QuanluZhang approved these changes Sep 8, 2021

View reviewed changes

QuanluZhang merged commit 396ae65 into microsoft:master Sep 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support dp multi-gpu training for QAT quantizer #4127

support dp multi-gpu training for QAT quantizer #4127

chenbohua3 commented Aug 30, 2021

chenbohua3 commented Aug 30, 2021 •

edited

Loading

linbinskn Sep 3, 2021

chenbohua3 Sep 6, 2021

linbinskn Sep 3, 2021

chenbohua3 Sep 6, 2021

J-shang Sep 1, 2021

chenbohua3 Sep 6, 2021

J-shang Sep 6, 2021

J-shang Sep 1, 2021

chenbohua3 Sep 6, 2021

support dp multi-gpu training for QAT quantizer #4127

support dp multi-gpu training for QAT quantizer #4127

Conversation

chenbohua3 commented Aug 30, 2021

chenbohua3 commented Aug 30, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chenbohua3 commented Aug 30, 2021 •

edited

Loading