Integration with LXMERT #6

johntiger1 · 2020-06-04T15:18:08Z

If I want to use this repo to extract RCNN image features to train LXMERT, how can I do that? Do I just dump the features from

# Show the boxes, labels, and features
pred = instances.to('cpu')
v = Visualizer(im[:, :, :], MetadataCatalog.get("vg"), scale=1.2)
v = v.draw_instance_predictions(pred)
showarray(v.get_image()[:, :, ::-1])
print('instances:\n', instances)
print()
print('boxes:\n', instances.pred_boxes)
print()
print('Shape of features:\n', features.shape)

(from https://github.com/airsplay/py-bottom-up-attention/blob/master/demo/demo_feature_extraction_attr.ipynb)

into a .tsv file?

Btw, what is the difference between with and without attributes? Thanks!

The text was updated successfully, but these errors were encountered:

airsplay · 2020-06-04T17:41:45Z

Yea; It would work well (at least in my test).

But the NMS approach would be the best to use this one:

py-bottom-up-attention/demo/detectron2_mscoco_proposal_maxnms.py

Lines 54 to 65 in 834fa8b

 # Select max scores 

 max_scores, max_classes = scores.max(1) # R x C --> R 

 num_objs = boxes.size(0) 

 boxes = boxes.view(-1, 4) 

 idxs = torch.arange(num_objs).cuda() * num_bbox_reg_classes + max_classes 

 max_boxes = boxes[idxs] # Select max boxes according to the max scores. 

 # Apply NMS 

 keep = nms(max_boxes, max_scores, nms_thresh) 

 if topk_per_image >= 0: 

 keep = keep[:topk_per_image] 

 boxes, scores = max_boxes[keep], max_scores[keep]

johntiger1 · 2020-06-04T18:45:13Z

Thank you, I will try the Non Maximal Suppression. But, just curious, does this mean that other SOTA recurrent vision models could be used too in the future? rCNN is now several years old, I was wondering if you experimented with more modern vision models, and perhaps can get better performance

airsplay · 2020-06-04T19:31:07Z

Hmmm... This code does not provide a training, just the weight converted. from the original CAFFE weight.

You could try this and switch the backbone:
https://github.com/MILVLG/bottom-up-attention.pytorch

yezhengli-Mr9 · 2021-01-13T04:43:32Z

If I want to use this repo to extract RCNN image features to train LXMERT, how can I do that? Do I just dump the features from
# Show the boxes, labels, and features
pred = instances.to('cpu')
v = Visualizer(im[:, :, :], MetadataCatalog.get("vg"), scale=1.2)
v = v.draw_instance_predictions(pred)
showarray(v.get_image()[:, :, ::-1])
print('instances:\n', instances)
print()
print('boxes:\n', instances.pred_boxes)
print()
print('Shape of features:\n', features.shape)
(from https://github.com/airsplay/py-bottom-up-attention/blob/master/demo/demo_feature_extraction_attr.ipynb)

into a .tsv file?

Btw, what is the difference between with and without attributes? Thanks!

Hi @johntiger1, before I finish coding my project:

How long does it take to extract NLPR2's 107,292 images when LXMERT takes around 5 to 6 hours for the training split and 1 to 2 hours for the valid and test splits?

Would you mind taking a time estimate? Thanks.

yezhengli-Mr9 · 2021-01-14T23:46:21Z

If I want to use this repo to extract RCNN image features to train LXMERT, how can I do that? Do I just dump the features from
# Show the boxes, labels, and features
pred = instances.to('cpu')
v = Visualizer(im[:, :, :], MetadataCatalog.get("vg"), scale=1.2)
v = v.draw_instance_predictions(pred)
showarray(v.get_image()[:, :, ::-1])
print('instances:\n', instances)
print()
print('boxes:\n', instances.pred_boxes)
print()
print('Shape of features:\n', features.shape)
(from https://github.com/airsplay/py-bottom-up-attention/blob/master/demo/demo_feature_extraction_attr.ipynb)
into a .tsv file?
Btw, what is the difference between with and without attributes? Thanks!
Hi @johntiger1, before I finish coding my project:

How long does it take to extract NLPR2's 107,292 images when LXMERT takes around 5 to 6 hours for the training split and 1 to 2 hours for the valid and test splits?

Would you mind taking a time estimate? Thanks.

Hi @johntiger1 , I get my solution for this question of time estimate and summarize it here. Thanks anyway.

yezhengli-Mr9 mentioned this issue Feb 12, 2021

How can i generate label.tsv? #19

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration with LXMERT #6

Integration with LXMERT #6

johntiger1 commented Jun 4, 2020

airsplay commented Jun 4, 2020

johntiger1 commented Jun 4, 2020

airsplay commented Jun 4, 2020

yezhengli-Mr9 commented Jan 13, 2021

yezhengli-Mr9 commented Jan 14, 2021

Integration with LXMERT #6

Integration with LXMERT #6

Comments

johntiger1 commented Jun 4, 2020

airsplay commented Jun 4, 2020

johntiger1 commented Jun 4, 2020

airsplay commented Jun 4, 2020

yezhengli-Mr9 commented Jan 13, 2021

yezhengli-Mr9 commented Jan 14, 2021