Skip to content

ananddb90/CaffeModelCompression

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Caffe Model Compression

This is a python tool used to compress the trained caffe weights. For Alexnet, we got 17x compression rate (~233M bytes to 14M bytes). The idea comes from Deep Compression . This work does not implement purning and Huffman coding, but implement the Kmeans -based quantization to compress the weights of convolution and full-connected layer. One contribution of this work is using OpenMP to accelerate the Kmeans processing.


####Dependency

  • Python/Numpy
  • Caffe

####Authors

####How to Build:

cd quantz_kit 
 ./build.sh

####How to use it:

caffe_model_compress: #function to compress model 
caffe_model_decompress: #function to decompress model 

About

Tool to compress trained caffe weights

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C 86.3%
  • Python 6.8%
  • C++ 6.5%
  • Shell 0.4%