-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
training #5
Comments
Hi, I encountered similar situations where model outputs all zeros in the density maps. Some tricks and experience might be helpful: 1) try some other random seeds (weird but sometimes useful); 2) try to initialize model parameters with our pretrained model on FSC-147; 3) change batch size or learning rate. Training density map-based models can sometimes be trapped in local optima. According to my experience, the training usually get back to track after some epochs in such occasion. You can also wait for maybe 10 epochs to see the results. |
Thank you @flyinglynx for your suggestion. |
You can also check the resolutions of images. Too high or too low may both be harmful. You can follow the data pro-processing of FSC-147 dataset to constrain the resolutions. I only provide the model parameters in the checkpoint file for convenience. However, if you run our code, the checkpoint file will store the model parameters, optimizer parameters, etc, which is more larger. |
Yeah @flyinglynx , I just checked the code, during training it saved some other information beside the parameters. |
Hi @flyinglynx it's me again ^^. |
Hi, we choose the exemplar size following UP-DETR. We use 128 * 128 and the results turned out to be satisfying. So we did not further finetune this parameter. I tested a few smaller sizes these days (64 * 64 and 96 * 96), and the performance drops slightly. But if using a simple multi-scale strategy by resizing exemplar into different sizes (64, 96, 128), and matching the exemplars of different sizes with the query image, the performance will be slightly better. |
Dear @flyinglynx
When training on my custom dataset, after 2 epochs, I tested the trained model (my dataset is quite big with 50,000 images), the results are pretty good but after 4 epochs, the model outputs nothing.
Do you have any explanation about this situation ?
The text was updated successfully, but these errors were encountered: