Paper collection, Summary, Code for Deep Neural Network Compression, including:
- Quantization,
- Pruning (Unstructure, structure)
- Distillation
and so on.
-
By Topic:
-
Related Topic:
- My Implementation: My re-implementation of state-of-the-art compression methods.
My summary (slides) for network compression. Some papers are chosen to be represented.
- Quantization Summary
- Pruning Summary
- Theory: From basic convex optimization to quantization