Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

request for initial model #5

Open
chrisliu54 opened this issue Jul 4, 2018 · 5 comments
Open

request for initial model #5

chrisliu54 opened this issue Jul 4, 2018 · 5 comments

Comments

@chrisliu54
Copy link

Nice work!
But some initial model required in your code is not provided, such as ssd512_vgg16.pth in this line:

parser.add_argument('--model', default='./examples/ssd/model/ssd512_vgg16.pth', type=str, help='initialized model path')

Hope all required initial models of this repo be exposed soon.

@Ellenisawake
Copy link

Similarly please share ImageNet model used as initialization!

No such file or directory: '/scratch2/model_weights/ssd300_12_imagenet.pth'

@MInner
Copy link
Collaborator

MInner commented Jul 5, 2018

My bad. I believe it must be this one, and you convert it to proper format using this script. And I believe that the best file to start with would be this one

@chrisliu54
Copy link
Author

chrisliu54 commented Jul 6, 2018

Same problem after following your instruction.
image

I suppose it could be caused by pytorch version mismatch. I run this script under pytorch=0.4 and I noticed your pytorch version is older(you are using Variable() API).
I suggest you provide us a requirements.txt or environment.yaml(for conda) to make thins clear.

@MInner
Copy link
Collaborator

MInner commented Jul 6, 2018

That probably means that, for some reason, one of tensors turn contained nan or value outsize of propeor range, I guess? If you could share somehow what are vales of these tensors, which of them are suspicious and suggest how did they manage to get these values, I might try to guess why it might have happened. This code worked fine for me, I usually do not push broken code intentionally :)

@chrisliu54
Copy link
Author

When I turned into torchcv, most of the problems disappeared.
But the so called device-side assert triggered is caused by misconfiguration(the number of class of a model should be #foreground_classes + 1, which included the background as one class).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants