-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integration with LXMERT #6
Comments
Yea; It would work well (at least in my test). But the NMS approach would be the best to use this one: py-bottom-up-attention/demo/detectron2_mscoco_proposal_maxnms.py Lines 54 to 65 in 834fa8b
|
Thank you, I will try the Non Maximal Suppression. But, just curious, does this mean that other SOTA recurrent vision models could be used too in the future? rCNN is now several years old, I was wondering if you experimented with more modern vision models, and perhaps can get better performance |
Hmmm... This code does not provide a training, just the weight converted. from the original CAFFE weight. You could try this and switch the backbone: |
Hi @johntiger1, before I finish coding my project: How long does it take to extract NLPR2's 107,292 images when LXMERT Would you mind taking a time estimate? Thanks. |
Hi @johntiger1 , I get my solution for this question of time estimate and summarize it here. Thanks anyway. |
If I want to use this repo to extract RCNN image features to train LXMERT, how can I do that? Do I just dump the features from
(from https://github.com/airsplay/py-bottom-up-attention/blob/master/demo/demo_feature_extraction_attr.ipynb)
into a
.tsv
file?Btw, what is the difference between with and without attributes? Thanks!
The text was updated successfully, but these errors were encountered: