🎯
Focusing
Graduate student.
Research area: Action Recognition, Multimodal Large Language Models.
-
Nankai University
- Tianjin, China
- https://rikeilong.github.io/
-
-
rikeilong.github.io Public
Forked from rshaojimmy/rshaojimmy.github.ioGithub Pages of Qilang Ye
HTML UpdatedDec 16, 2024 -
Bay-CAT Public
[ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
-
CueNet Public
Official Implementation for "Cue-N: Cue-Aware Network for Audio-Visual Question Answering"
2 UpdatedJul 2, 2024 -
Ppromo-IAR Public
[IEEE SPL] Official Implementation for Pose-promote: Progressive Visual Perception for Indoor Action Recognition
-
MCD-forAVQA Public
Official Implementation for Answering Diverse Questions via Text Attached with Key Audio-Visual Clues
6 UpdatedOct 31, 2023