Skip to content
Change the repository type filter

All

    Repositories list

    • FTVSR

      Public
      [ECCV'22] FTVSR: Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution
      Python
      MIT License
      13159160Updated Oct 22, 2024Oct 22, 2024
    • VQD-SR

      Public
      [ICCV'23] VQD-SR: Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution
      Python
      33740Updated Jun 19, 2024Jun 19, 2024
    • [CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
      Python
      MIT License
      22394150Updated Jun 5, 2024Jun 5, 2024
    • [TVCG'2023] AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)
      Python
      Apache License 2.0
      71437141Updated May 8, 2024May 8, 2024
    • Stark

      Public
      [ICCV'21] Learning Spatio-Temporal Transformer for Visual Tracking
      Python
      MIT License
      143649710Updated Apr 13, 2024Apr 13, 2024
    • TracKit

      Public
      [ECCV'20] Ocean: Object-aware Anchor-Free Tracking
      Python
      MIT License
      97613281Updated Aug 7, 2023Aug 7, 2023
    • JavaScript
      01200Updated Jun 17, 2023Jun 17, 2023
    • [TMM 2023] Language-Guided Face Animation by Recurrent StyleGAN-based Generator
      Python
      MIT License
      01110Updated Apr 23, 2023Apr 23, 2023
    • [MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
      Python
      MIT License
      21100Updated Apr 3, 2023Apr 3, 2023
    • STTR

      Public
      [ACCV'22] Fine-Grained Image Style Transfer with Visual Transformers
      Python
      MIT License
      61400Updated Dec 6, 2022Dec 6, 2022
    • soho

      Public
      [CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
      Python
      1920690Updated Sep 30, 2022Sep 30, 2022
    • TTSR

      Public
      [CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution
      Python
      MIT License
      11576530Updated Jul 24, 2022Jul 24, 2022
    • TTVSR

      Public
      [CVPR'22 Oral] TTVSR: Learning Trajectory-Aware Transformer for Video Super-Resolution
      Python
      MIT License
      1320190Updated Jul 24, 2022Jul 24, 2022
    • CKDN

      Public
      [ICCV'21] CKDN: Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment
      Python
      MIT License
      55660Updated Apr 9, 2022Apr 9, 2022
    • CyDAS

      Public
      Cyclic Differentiable Architecture Search
      Python
      MIT License
      63510Updated Feb 14, 2022Feb 14, 2022
    • [CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search
      Python
      MIT License
      59397230Updated Dec 29, 2021Dec 29, 2021
    • tasn

      Public
      Trilinear Attention Sampling Network for Fine-grained Image Recognition
      Python
      40218166Updated Dec 14, 2021Dec 14, 2021
    • [CVPR'2019] PEN-Net: Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting
      Python
      MIT License
      77361231Updated Nov 29, 2021Nov 29, 2021
    • A collection of models for image<->text generation in ACM MM 2021.
      Python
      MIT License
      86620Updated Oct 31, 2021Oct 31, 2021
    • [MM'20] Aesthetic-Aware Image Style Transfer
      Python
      31420Updated Sep 16, 2021Sep 16, 2021
    • img2poem

      Public
      [MM'18] Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training
      Python
      6028030Updated Aug 23, 2021Aug 23, 2021
    • STTN

      Public
      [ECCV'2020] STTN: Learning Joint Spatial-Temporal Transformations for Video Inpainting
      Jupyter Notebook
      72474101Updated Jul 26, 2021Jul 26, 2021
    • AutoML

      Public
      AutoFormer, Cream
      Python
      MIT License
      227100Updated Jul 4, 2021Jul 4, 2021
    • SiamDW

      Public
      [CVPR'19 Oral] Deeper and Wider Siamese Networks for Real-Time Visual Tracking
      Python
      MIT License
      180750201Updated May 18, 2021May 18, 2021
    • SariGAN

      Public
      [NeurIPS'20] Learning Semantic-aware Normalization for Generative Adversarial Networks
      Python
      35310Updated May 14, 2021May 14, 2021
    • NEAS

      Public
      Python
      51910Updated May 11, 2021May 11, 2021
    • WSOD2

      Public
      [ICCV'19] WSOD^2: Learning Bottom-up and Top-down Objectness Distillation for Weakly-supervised Object Detection
      Python
      MIT License
      34840Updated Jan 26, 2021Jan 26, 2021
    • [AAAI‘20] - Learning 2D Temporal Localization Networks for Moment Localization with Natural Language
      Python
      Other
      161001Updated Feb 16, 2020Feb 16, 2020
    • DBTNet

      Public
      Code for our NeurIPS'19 paper "Learning Deep Bilinear Transformation for Fine-grained Image Representation"
      Python
      1810560Updated Jan 20, 2020Jan 20, 2020
    • 2D-TAN

      Public
      AAAI2020 - Learning 2D Temporal Localization Networks for Moment Localization with Natural Language
      Python
      31710Updated Dec 10, 2019Dec 10, 2019