- D由双向LSTM组成(a bidirectional layout)
- G is unidirectional
- freezing: stopping the updates of D whenever its training loss is less than 70% of the training loss of G [1]
以免D训得过好
- feature matching: objective of G is to produce an internal representation at some level in the discriminator that matches that of real data. $$ \hat L_G=\frac{1}{m}\sum_{i=1}^m(R(x^{(i)})-R(G(x^{(i)}))^2 $$
$R$ 是D分类层前的最后一层,有点像pix2pix的D
[1]. Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, and Xi Chen. Improved techniques for training gans. In Advances in Neural Information Processing Systems, pages 2226–2234, 2016.
- G单向LSTM保证online,而D可以采用bi-LSTM
- 两个训练trick很重要