Evaluation protocol #7

Alen-T · 2020-09-24T04:50:49Z

Why “Learning a Text-Video Embedding from Incomplete and Heterogeneous Data” and “HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips” evaluation protocol different?

Is there a test set of 1k-A and 1k-B each representing 1000 randomly sampled text-video pairs?

I am very confused

Alen-T closed this as completed Jan 11, 2021

Alen-T reopened this Jan 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation protocol #7

Evaluation protocol #7

Alen-T commented Sep 24, 2020

Evaluation protocol #7

Evaluation protocol #7

Comments

Alen-T commented Sep 24, 2020