-
Tsinghua University
- Beijing
Highlights
- Pro
Pinned Loading
-
raoyongming/DynamicViT
raoyongming/DynamicViT Public[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
-
yuxumin/PoinTr
yuxumin/PoinTr Public[ICCV 2021 Oral] PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers
-
wl-zhao/VPD
wl-zhao/VPD Public[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.
-
Oryx-mllm/Oryx
Oryx-mllm/Oryx PublicMLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution
-
ElasticCache
ElasticCache Public[ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache
-
dongyh20/Chain-of-Spot
dongyh20/Chain-of-Spot PublicChain-of-Spot: Interactive Reasoning Improves Large Vision-language Models
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.