Train model when I have image only without any bbox info #39

Decalogue · 2021-01-13T04:20:31Z

Hi, Thanks for your great work!
I want to know whether the model can be trained without regions. In other words, I have caption and image only without any bbox info, how can I make the model work?
Thank you so much!

LuoweiZhou · 2021-03-16T22:53:37Z

@Decalogue You will need to adjust the image input (e.g., feature vectors and positional encodings) accordingly. If you intend to take CNN activations as the input, you might want to refer to our recent work ClipBERT: https://github.com/jayleicn/ClipBERT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train model when I have image only without any bbox info #39

Train model when I have image only without any bbox info #39

Decalogue commented Jan 13, 2021

LuoweiZhou commented Mar 16, 2021

Train model when I have image only without any bbox info #39

Train model when I have image only without any bbox info #39

Comments

Decalogue commented Jan 13, 2021

LuoweiZhou commented Mar 16, 2021