损失函数的值不下降，一直维持在3到4 #118

MichealZhangxa · 2024-09-24T04:19:34Z

你好，我在你的代码的基础上训练，前期的损失一直在3到4之间正常吗，训练的也是llava的预训练数据集

shiym2000 · 2024-09-27T14:02:11Z

您好，请问可以看一下您的实验脚本/配置吗？

shiym2000 · 2024-09-27T14:29:34Z

Please provide your training script and any modified code (if applicable) so that we can better identify the issues in the code.

shiym2000 · 2024-09-27T14:58:01Z

You might try using the two encoders separately to train the whole model, ensuring that each individual encoder can properly train the whole model.

shiym2000 · 2024-09-27T15:13:03Z

And you can adjust the model training length by modifying the num_train_epochs parameter.

Provide feedback