Training Log 2022-11-22 #246
zh-zheng
announced in
Training Logs 训练日志
Replies: 1 comment
-
可以详细介绍一下如何应用scaling weights吗?具体怎么做的?为什么要用这种方法解决NaN问题呢?谢谢,提前。 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
CPM-Live Training Log (November, 22)
Time: November, 22 2022 19:00
Recorder: @zh-zheng
Loss
Completed Data
Average Grad Norm
Progress
Comment
Today, the training loss became NaN at around 12:00. We solved this problem by scaling weights. We'll keep an eye on the model in the next few days.
Beta Was this translation helpful? Give feedback.
All reactions