Skip to content

TF-Implementation of "Data-Driven Sparse Structure Selection for Deep Neural Networks"

Notifications You must be signed in to change notification settings

mengrang/Sparse-Structure-Selection-tensorflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sparse-Structure-Selection-tensorflow (Update In Progress)

Implementation of Tusample Naiyan Wang's paper Data-Driven Sparse Structure Selection for Deep Neural Networks on Fine-grained Recognition in Tensorflow.

Abstract

In our framework, a new type of parameter -- scaling factor is first introduced to scale the outputs of specific structures, such as neurons, groups or residual blocks. Then we add sparsity regularizations on these factors, and solve this optimization problem by a modified stochastic Accelerated Proximal Gradient (APG) method. By forcing some of the factors to zero, we can safely remove the corresponding structures, thus prune the unimportant parts of a CNN. Comparing with other structure selection methods that may need thousands of trials or iterative fine-tuning, our method is trained fully end-to-end in one training pass without bells and whistles. We evaluate our method, Sparse Structure Selection with several state-of-the-art CNNs, and demonstrate very promising results with adaptive depth and width selection.

Requiremwnts

Implementation details

Reference

Huang, Z., & Wang, N. (2017). Data-driven sparse structure selection for deep neural networks.

TODO

  • [ ]

About

TF-Implementation of "Data-Driven Sparse Structure Selection for Deep Neural Networks"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages