Highlights
- Pro
Stars
Official repo for the paper "Mojito: Motion Trajectory and Intensity Control for Video Generation""
verl: Volcano Engine Reinforcement Learning for LLMs
A benchmark for evaluating video generative models in generating short stories
HunyuanVideo: A Systematic Framework For Large Video Generation Model
[ECCV 2024] DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
[CVPR 2024 Highlight] Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields
A suite of image and video neural tokenizers
Janus-Series: Unified Multimodal Understanding and Generation Models
The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
SEED-Voken: A Series of Powerful Visual Tokenizers
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
[ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Official inference repo for FLUX.1 models
SEED-Story: Multimodal Long Story Generation with Large Language Model
A research project for natural language generation, containing the official implementations by MSRA NLC team.
Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"
Code repository for T2V-Turbo and T2V-Turbo-v2
Ongoing research training gaussian splatting at scale by distributed system