[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
nlp video vision srl captioning captioning-videos vision-and-language grounding video-language event-relations semantic-roles
-
Updated
Aug 17, 2021 - Python