Combined Group and Exclusive Sparsity for Deep Networks(CGES)

Jaehong Yoon(UNIST), and Sung Ju Hwang(UNIST)

This project hosts the code for our ICML 2017 paper.

We propose a sparsity regularization method that exploits both positive and negative correlations among the features to enforce the network to be sparse, and at the same time remove any redundancies among the features to fully utilize the capacity of the network. Specifically, we propose to use an exclusive sparsity regularization based on (1; 2)-norm, which promotes competition for features between different weights, thus enforcing them to fit to disjoint sets of features. We further combine the exclusive sparsity with the group sparsity based on (2; 1)-norm, to promote both sharing and competition for features in training of a deep neural network. We validate our method on multiple public datasets, and the results show that our method can obtain more compact and efficient networks while also improving the performance over the base networks with full weights, as opposed to existing sparsity regularizations that often obtain efficiency at the expense of prediction accuracy.

Reference

If you use this code as part of any published research, please refer the following paper.

@inproceedings{yoon2017cges,
    author    = {Jaehong Yoon and Sung Ju Hwang},
    title     = {Combined Group and Exclusive Sparsity for Deep Networks},
    booktitle = {ICML},
    year      = {2017}
}

Running Code

We implemented a combined regularizer as described in the paper based on Tensorflow library, Tensorflow.

Get our code

git clone --recursive https://github.com/jaehong-yoon93/CGES.git CGES

Run examples

In this code, you can run our model on MNIST dataset. Then, you don't need to download dataset on your own, just you get the dataset when you run our code. If you want to apply the model to your own data, you need to edit code a little bit.

For convinence, we added the logs that are printed out sparsity of each layer, training & test accuracy, and several parameter information. If you execute run.sh script, you can reproduce our model. And when you want to compare with L2 baseline, turn off the cges option to False.

./run.sh

Acknowledgement

This work was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (NRF-2016M3C4A7952634).

Authors

Jaehong Yoon¹, and Sung Ju Hwang¹²

¹MLVR Lab @ School of Electrical and Computer Engineering, UNIST, Ulsan, South Korea

²AItrics @ Seoul, South Korea

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
README.md		README.md
main.py		main.py
mnist_model.py		mnist_model.py
run.sh		run.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Combined Group and Exclusive Sparsity for Deep Networks(CGES)

Reference

Running Code

Get our code

Run examples

Acknowledgement

Authors

About

Releases

Packages

Languages

jaehong31/CGES

Folders and files

Latest commit

History

Repository files navigation

Combined Group and Exclusive Sparsity for Deep Networks(CGES)

Reference

Running Code

Get our code

Run examples

Acknowledgement

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages