[bug fix] Torch Classifier agent should call self.backward(loss) #4270

dexterju27 · 2021-12-21T18:11:28Z

Patch description
The current torch classifier agent uses loss.backward instead of self.backward(loss). A lot of optimizer behaviors depend on the backward function. While the backward function was ignored in the previous version.
For example:

In SafeFP16Optimizer, the fp32 gradient will be zeros as no synchronizing flag is set. The model doesn't train.

ParlAI/parlai/utils/fp16.py

Line 200 in 66c71e0

if self._needs_sync:

.
In MemoryEfficientFP16Optimizer the loss loss_scale multiplier is omitted. We should improve performance with this fix.

ParlAI/parlai/utils/fp16.py

Line 521 in 66c71e0

loss = loss * self.scaler.loss_scale

Testing steps
Train any classifier agent. Verified with a classifier agent training with SafeFP16Optimizer.
In our case, we retrained a safety classifier and compared its performance with the model in the zoo.
FP32 matched the performance, and FP16 performance worse. The performance shifted a bit between the tasks, we are comparing the average class_notok__ f1, it is the validation metric used for all the trainings.

classifier agent should call self.backward(loss)

e1ee3ce

dexterju27 requested review from stephenroller, EricMichaelSmith, jaseweston, emilydinan and jxmsML December 21, 2021 18:11

facebook-github-bot added the CLA Signed label Dec 21, 2021

jaseweston approved these changes Dec 31, 2021

View reviewed changes

jaseweston merged commit 27f22d5 into main Dec 31, 2021

jaseweston deleted the classifier-agent-use-backward branch December 31, 2021 01:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bug fix] Torch Classifier agent should call self.backward(loss) #4270

[bug fix] Torch Classifier agent should call self.backward(loss) #4270

dexterju27 commented Dec 21, 2021 •

edited

Loading

[bug fix] Torch Classifier agent should call self.backward(loss) #4270

[bug fix] Torch Classifier agent should call self.backward(loss) #4270

Conversation

dexterju27 commented Dec 21, 2021 • edited Loading

dexterju27 commented Dec 21, 2021 •

edited

Loading