Validation loss on pretraining?[Feature request] #20

james20141606 · 2024-06-21T03:00:34Z

feature

Hi, I am trying to redo the pretrain step as you described in the readme doc. The training loss converges pretty fast. I find the logs in wandb and it turned out to be only containing the training loss. I wonder if you could add other metrics, like validation loss and perplexity.

Thanks a lot!

mu-cai · 2024-06-24T22:13:34Z

Thanks for the question. However, I do not have validation dataset incorporated during training. Feel free to try it by your own!

james20141606 · 2024-06-25T06:03:17Z

Thanks for your reply! By the way do you have validation data in the finetuning stage?

james20141606 · 2024-06-25T06:45:24Z

And I have two extra questions which I am confused with:

I tried to pretrain the vip-llava using either your provided data or my custom data and they both converge very fast. The loss plateaus within 5 hrs on one single A100. Does that happen to your experiments as well?
To create a vip-llava model on a specific domain, for example satellite data. Do you think we should pretrain vip-llava using satellite data and then ft with instructions? Or do you think it is enough to load your pretrained checkpoint and ft on custom data? Do you have any intuitions on it?
I would appreciate it a lot if you could answer my questions. Thanks!

mu-cai · 2024-06-27T21:25:24Z

Yes, LLMs's loss decrease very fast.
I think either works, and all of those depends on the quality and quantity of your data!

james20141606 · 2024-07-17T03:22:15Z

Thanks for your reply! I'd like to confirm again that for pretraining stage, do you freeze the LLM weights?

mu-cai · 2024-07-17T04:00:48Z

For pretraining, I never freeze the LLM weights.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validation loss on pretraining?[Feature request] #20

Validation loss on pretraining?[Feature request] #20

james20141606 commented Jun 21, 2024

mu-cai commented Jun 24, 2024

james20141606 commented Jun 25, 2024

james20141606 commented Jun 25, 2024

mu-cai commented Jun 27, 2024

james20141606 commented Jul 17, 2024

mu-cai commented Jul 17, 2024

Validation loss on pretraining?[Feature request] #20

Validation loss on pretraining?[Feature request] #20

Comments

james20141606 commented Jun 21, 2024

feature

mu-cai commented Jun 24, 2024

james20141606 commented Jun 25, 2024

james20141606 commented Jun 25, 2024

mu-cai commented Jun 27, 2024

james20141606 commented Jul 17, 2024

mu-cai commented Jul 17, 2024