how to aggregate nx2048 features into one 2048 feature ? #2

dixonhsiao · 2019-09-10T07:08:41Z

It seems that in your training/eval data there is only one 2048 2d feature and one 2048 3d feature for a sentence. But using the feature extractor in https://github.com/antoine77340/video_feature_extractor , it seems that there will be nx2048 features for a sentence (if the sentence is n seconds in duration for 2d, and approximately n/1.5 seconds for 3d). How do I aggregate nx2048 features into one 2048 feature as stated in your paper by using temporal max-pooling ? Just select the max value for each dimension ?

bjuncek · 2019-12-30T13:53:14Z

Yes you can either max pool along the dimensions. For example, you could add
nn.AdaptiveMaxPool2d((1, 2048))
after feature loading.

dixonhsiao changed the title ~~how to aggregate n*2048 features into one 2048 feature ?~~ how to aggregate nx2048 features into one 2048 feature ? Sep 10, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to aggregate nx2048 features into one 2048 feature ? #2

how to aggregate nx2048 features into one 2048 feature ? #2

dixonhsiao commented Sep 10, 2019 •

edited

Loading

bjuncek commented Dec 30, 2019

how to aggregate nx2048 features into one 2048 feature ? #2

how to aggregate nx2048 features into one 2048 feature ? #2

Comments

dixonhsiao commented Sep 10, 2019 • edited Loading

bjuncek commented Dec 30, 2019

dixonhsiao commented Sep 10, 2019 •

edited

Loading