Change the repository type filter
All
Repositories list
56 repositories
llm-awq
Public- SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
- Efficient vision foundation models for high-resolution generation and perception.
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory
- A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.
tinychat-tutorial
Publicduo-attention
Publicvila-u
Publichart
PublicBlock-Sparse-Attention
Publicdata-efficient-gans
Public[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Trainingproxylessnas
Public[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardwarespatten
Public[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruningfastcomposer
Publicbevfusion
Public archive[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
smoothquant
Publicspvnas
Public archive[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolutionlite-transformer
Public archivetemporal-shift-module
Public[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understandingstreaming-llm
PublicTinyChatEngine
PublicTinyChatEngine: On-Device LLM Inference Librarylitepose
Public[CVPR'22] Lite Pose: Efficient Architecture Design for 2D Human Pose Estimationgan-compression
Public[CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANspatch_conv
Public