Multimedia Research

All

31 repositories

FTVSR
Public
[ECCV'22] FTVSR: Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution
video-super-resolution video-restoration
Python
•
MIT License
•13•159•16•0•Updated Oct 22, 2024Oct 22, 2024
VQD-SR
Public
[ICCV'23] VQD-SR: Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution
animation super-resolution
Python
•3•37•4•0•Updated Jun 19, 2024Jun 19, 2024
MM-Diffusion
Public
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
video-generation multi-modality diffusion-models content-creation audio-generation
Python
•
MIT License
•22•394•15•0•Updated Jun 5, 2024Jun 5, 2024
AOT-GAN-for-Inpainting
Public
[TVCG'2023] AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)
codebase high-resolution image-inpainting multi-scale
Python
•
Apache License 2.0
•71•437•14•1•Updated May 8, 2024May 8, 2024
Stark
Public
[ICCV'21] Learning Spatio-Temporal Transformer for Visual Tracking
transformer
Python
•
MIT License
•143•649•71•0•Updated Apr 13, 2024Apr 13, 2024
TracKit
Public
[ECCV'20] Ocean: Object-aware Anchor-Free Tracking
tracking anchor-free eccv2020 segmentation
Python
•
MIT License
•97•613•28•1•Updated Aug 7, 2023Aug 7, 2023
davinci-videofactory
Public
JavaScript
•0•12•0•0•Updated Jun 17, 2023Jun 17, 2023
language-guided-animation
Public
[TMM 2023] Language-Guided Face Animation by Recurrent StyleGAN-based Generator
Python
•
MIT License
•0•11•1•0•Updated Apr 23, 2023Apr 23, 2023
AI_Illustrator
Public
[MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
Python
•
MIT License
•2•11•0•0•Updated Apr 3, 2023Apr 3, 2023
STTR
Public
[ACCV'22] Fine-Grained Image Style Transfer with Visual Transformers
Python
•
MIT License
•6•14•0•0•Updated Dec 6, 2022Dec 6, 2022
soho
Public
[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
Python
•19•206•9•0•Updated Sep 30, 2022Sep 30, 2022
TTSR
Public
[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution
image-restoration image-super-resolution
Python
•
MIT License
•115•765•3•0•Updated Jul 24, 2022Jul 24, 2022
TTVSR
Public
[CVPR'22 Oral] TTVSR: Learning Trajectory-Aware Transformer for Video Super-Resolution
video-super-resolution video-restoration
Python
•
MIT License
•13•201•9•0•Updated Jul 24, 2022Jul 24, 2022
CKDN
Public
[ICCV'21] CKDN: Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment
Python
•
MIT License
•5•56•6•0•Updated Apr 9, 2022Apr 9, 2022
CyDAS
Public
Cyclic Differentiable Architecture Search
Python
•
MIT License
•6•35•1•0•Updated Feb 14, 2022Feb 14, 2022
LightTrack
Public
[CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search
efficient nas cvpr2021
Python
•
MIT License
•59•397•23•0•Updated Dec 29, 2021Dec 29, 2021
tasn
Public
Trilinear Attention Sampling Network for Fine-grained Image Recognition
Python
•40•218•16•6•Updated Dec 14, 2021Dec 14, 2021
PEN-Net-for-Inpainting
Public
[CVPR'2019] PEN-Net: Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting
attention attention-transfer-network image-inpainting pen-net
Python
•
MIT License
•77•361•23•1•Updated Nov 29, 2021Nov 29, 2021
generate-it
Public
A collection of models for image<->text generation in ACM MM 2021.
Python
•
MIT License
•8•66•2•0•Updated Oct 31, 2021Oct 31, 2021
AAST-pytorch
Public
[MM'20] Aesthetic-Aware Image Style Transfer
Python
•3•14•2•0•Updated Sep 16, 2021Sep 16, 2021
img2poem
Public
[MM'18] Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training
code dataset poem-generator
Python
•60•280•3•0•Updated Aug 23, 2021Aug 23, 2021
STTN
Public
[ECCV'2020] STTN: Learning Joint Spatial-Temporal Transformations for Video Inpainting
transformer spatial-temporal video-inpainting completing-videos
Jupyter Notebook
•72•474•10•1•Updated Jul 26, 2021Jul 26, 2021
AutoML
Public
AutoFormer, Cream
Python
•
MIT License
•227•1•0•0•Updated Jul 4, 2021Jul 4, 2021
SiamDW
Public
[CVPR'19 Oral] Deeper and Wider Siamese Networks for Real-Time Visual Tracking
tracking
Python
•
MIT License
•180•750•20•1•Updated May 18, 2021May 18, 2021
SariGAN
Public
[NeurIPS'20] Learning Semantic-aware Normalization for Generative Adversarial Networks
Python
•3•53•1•0•Updated May 14, 2021May 14, 2021
NEAS
Public
nas
Python
•5•19•1•0•Updated May 11, 2021May 11, 2021
WSOD2
Public
[ICCV'19] WSOD^2: Learning Bottom-up and Top-down Objectness Distillation for Weakly-supervised Object Detection
Python
•
MIT License
•3•48•4•0•Updated Jan 26, 2021Jan 26, 2021
2D-TAN-Microsoft
Public
[AAAI‘20] - Learning 2D Temporal Localization Networks for Moment Localization with Natural Language
Python
•
Other
•161•0•0•1•Updated Feb 16, 2020Feb 16, 2020
DBTNet
Public
Code for our NeurIPS'19 paper "Learning Deep Bilinear Transformation for Fine-grained Image Representation"
Python
•18•105•6•0•Updated Jan 20, 2020Jan 20, 2020
2D-TAN
Public
AAAI2020 - Learning 2D Temporal Localization Networks for Moment Localization with Natural Language
Python
•3•17•1•0•Updated Dec 10, 2019Dec 10, 2019