Differentiate cost layer with other layers #380

emailweixu · 2016-11-07T18:18:25Z

Currently, there is no way to tell whether a layer is a cost (loss) layer or not. However, there is a crucial difference between cost layer and non-cost layers. During backpropagation, cost layer does not need gradient from its output. The gradient of its output is implicitly assumed to be 1's. There are several benefits of adding a mechanism to differentiate cost layer with other layers

Prevent the incorrect use of the output of a cost layer as the input of other layers
When there are multiple outputs of a model including both cost layer and non-cost layers, when calculating cost (using Argument::sumCost), the trainer should only sum over the cost layers, excluding the non-cost layers, so that it can show the correct cost during training.

typhoonzero · 2017-07-23T06:20:17Z

Can we close this issue now while we are refactoring with new "op" design?

jacquesqiao · 2017-12-29T07:06:12Z

the future work will be done in fluid, so close this

[develop]fix uninstall method

…PaddlePaddle#380) (PaddlePaddle#380) * add MergeExprs to reuse the common part of mergeSum and mergeProduct * update grama format,remove comment code,revert build.sh

* "add mlm params to dygraph ernie1.0" * finish p-tuning v1.0 * mend * delete unused coment * add label_normalized * P-tuning: support Chid task of FewCLUE * 1. decouple evaluate and train * 1.add FewCLUE datasets(9/9) 2.implement p-tuning strategy by transform_function 3.unify train_script beteween `chid` task and other 8 tasks of FewCLUE * add README.md

emailweixu added the enhancement label Nov 7, 2016

jacquesqiao closed this as completed Dec 29, 2017

zhhsplendid pushed a commit to zhhsplendid/Paddle that referenced this issue Sep 25, 2019

Merge pull request PaddlePaddle#380 from tink2123/cherry_pick_1126

da6caab

[develop]fix uninstall method

gglin001 added a commit to graphcore/Paddle-fork that referenced this issue Mar 17, 2022

remove supported_ops.h (PaddlePaddle#380)

504e02d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Differentiate cost layer with other layers #380

Differentiate cost layer with other layers #380

emailweixu commented Nov 7, 2016

typhoonzero commented Jul 23, 2017

jacquesqiao commented Dec 29, 2017

Differentiate cost layer with other layers #380

Differentiate cost layer with other layers #380

Comments

emailweixu commented Nov 7, 2016

typhoonzero commented Jul 23, 2017

jacquesqiao commented Dec 29, 2017