Skip to content
Change the repository type filter

All

    Repositories list

    • A curated list of balanced multimodal learning methods.
      23210Updated Dec 10, 2024Dec 10, 2024
    • Ref-AVS

      Public
      The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
      Python
      MIT License
      12910Updated Dec 4, 2024Dec 4, 2024
    • A curated list of audio-visual learning methods and datasets.
      1723700Updated Dec 3, 2024Dec 3, 2024
    • The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
      Python
      MIT License
      19246340Updated Nov 28, 2024Nov 28, 2024
    • The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
      Python
      14130Updated Nov 5, 2024Nov 5, 2024
    • HTML
      1000Updated Oct 31, 2024Oct 31, 2024
    • MS-Bot

      Public
      The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)
      Python
      1600Updated Oct 28, 2024Oct 28, 2024
    • TSPM

      Public
      Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
      Python
      11420Updated Oct 25, 2024Oct 25, 2024
    • LFAV

      Public
      Towards Long Form Audio-visual Video Understanding
      Python
      MIT License
      0910Updated Oct 23, 2024Oct 23, 2024
    • The repo for "KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance", CoRL 2024
      Python
      1200Updated Oct 17, 2024Oct 17, 2024
    • The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
      Python
      11210Updated Oct 11, 2024Oct 11, 2024
    • Official repository for "Unveiling and Mitigating Bias in Audio Visual Segmentation" in ACM MM 2024
      Python
      0500Updated Oct 10, 2024Oct 10, 2024
    • The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024
      Python
      0310Updated Sep 29, 2024Sep 29, 2024
    • Python
      0900Updated Aug 21, 2024Aug 21, 2024
    • A python implement for Geometric-Inspired Graph-based Incomplete Multi-view Clustering
      Python
      1500Updated Aug 16, 2024Aug 16, 2024
    • A python implement for Certifiable Robust Multi-modal Training
      Python
      01510Updated Aug 2, 2024Aug 2, 2024
    • The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024
      Python
      02230Updated Jul 30, 2024Jul 30, 2024
    • .github

      Public
      0000Updated Jul 19, 2024Jul 19, 2024
    • The official repo for "Can Textual Semantics Mitigate Sounding Object Segmentation Preference?", ECCV 2024
      Python
      0400Updated Jul 18, 2024Jul 18, 2024
    • JavaScript
      0000Updated Jul 18, 2024Jul 18, 2024
    • The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024
      Python
      33630Updated Jun 28, 2024Jun 28, 2024
    • Official repository of "Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer", AAAI 2024
      Python
      MIT License
      31500Updated Mar 26, 2024Mar 26, 2024
    • JavaScript
      0000Updated Feb 18, 2024Feb 18, 2024
    • Python
      23260Updated Feb 18, 2024Feb 18, 2024
    • PSTP-Net

      Public
      Python
      11620Updated Aug 11, 2023Aug 11, 2023
    • MWAFM

      Public
      Multi-Scale Attention for Audio Question Answering
      Python
      22810Updated Jul 19, 2023Jul 19, 2023
    • JavaScript
      0000Updated Jun 4, 2023Jun 4, 2023
    • The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
      Python
      11840Updated May 18, 2023May 18, 2023
    • Python
      0700Updated Apr 27, 2023Apr 27, 2023
    • MMCosine

      Public
      Project page for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
      JavaScript
      0000Updated Mar 10, 2023Mar 10, 2023