-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training loss is NaN now. #17
Comments
Thanks for your interest. Is the training loss NaN at the beginning, what dataset did you use? The latest version of the code may not be released so soon. We will prioritize the release of data processing code and the integration of ConsisID into diffusers. |
I downloaded your dataset and processed it appropriately, using CogvideoX-5B-I2V to initialize the weights,which bs is 5 and lr is 3e-7. It has loss in the initial training, but NaN appears after about 500 iterations. Is it because I processed the data wrong? And “fail to detect face using insightface, extract embedding on align face“ occurs during training。 |
Oh, I see. This may be a problem with |
|
or you can try to train only LoRA instead of all parameters.
|
we have release the data processing code, please refer to here for more details. |
Thank you!I'll give it a try. |
Hi! Have you solved this problem? I meet the same issue and wonder which way to avoid loss NaN. |
You may need to construct a higher dataset to continue finetuning ConsisID, or have a larger batch size. Since ConsisID is trained on a higher quality internal dataset, if it continues to be trained on the ConsisID-Preview-Data, it is likely to get worse. |
Or you can load the ckpt of CogVideoX-5B-I2V for training IPT2V from scratch. (Instead of load ConsisID-Preview for continue finetuning.) |
Some solutions can refer to #31. |
I try a larger batch size and solve this problem. |
When will the latest version of the code and data processing code be released?
The text was updated successfully, but these errors were encountered: