You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
在增量预训练阶段,使用fp16时候,loss经常飘了,切换成bf16,loss不飘,但是loss下降速度要比fp16慢很多,在fp16时候,训练30k steps,loss从6.x降低到2.1,但是在bf16时候,训练到55k steps,loss还在3.6附近。
fp loss tensorboard
bf16 loss tensorboard
这个是正常的嘛?
Beta Was this translation helpful? Give feedback.
All reactions