You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In caption model, we need region features from bottom-up attention model. But if we got different shape of features, for example, extract 10 regions from image A and 20 regions from image B, how can I train them in ONE batch?
Thanks in advance!
The text was updated successfully, but these errors were encountered:
In caption model, we need region features from bottom-up attention model. But if we got different shape of features, for example, extract 10 regions from image A and 20 regions from image B, how can I train them in ONE batch?
Thanks in advance!
The text was updated successfully, but these errors were encountered: