PhD Student at MIT CSAIL
-
Massachusetts Institute of Technology
- Cambridge, Massachusetts
- http://people.csail.mit.edu/roudi/
- @arouditchenko
Highlights
- Pro
Pinned Loading
-
whisper-flamingo
whisper-flamingo Public[Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation
-
-
Sound-of-Pixels
Sound-of-Pixels PublicForked from hangzhaomit/Sound-of-Pixels
Codebase for ECCV18 "The Sound of Pixels"
Python 1
-
everything_at_once
everything_at_once PublicForked from ninatu/everything_at_once
Implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval" (CVPR 2022)
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.