Adapting Unstructured Sparsity Techniques for Structured Sparsity

This repository contains code for the CNN experiments presented in the paper along with more functionalities.

This code base is built upon the STR modified for STR-BN.

Set Up

Clone this repository.
Using Python 3.6, create a venv with python -m venv myenv and run source myenv/bin/activate. You can also use conda to create a virtual environment.
Install requirements with pip install -r requirements.txt for venv and appropriate conda commands for conda environment.
Create a data directory <data-dir>. To run the ImageNet experiments there must be a folder <data-dir>/imagenet that contains the ImageNet train and val.

STR-BN

STR-BN. Users can take STR-BN and use it in most of the PyTorch based models as it inherits from nn.BatchNorm2d or also mentioned here as LearnedBatchNorm. The hyperparameters of STR-BN which includes the sparseFunction are not well explored to provide the users with default settings. This is experimental code and contributions are welcome.

Vanilla Training

This codebase contains model architectures for ResNet18, ResNet50 and MobileNetV1 and support to train them on ImageNet-1K. We have provided some config files for training ResNet50 and MobileNetV1 which can be modified for other architectures and datasets. To support more datasets, please add new dataloaders to data folder.

Training across multiple GPUs is supported, however, the user should check the minimum number of GPUs required to scale ImageNet-1K.

Train dense models on ImageNet-1K:

ResNet50: python main.py --config configs/largescale/resnet50-dense.yaml --multigpu 0,1,2,3

MobileNetV1: python main.py --config configs/largescale/mobilenetv1-dense.yaml --multigpu 0,1,2,3

Train models with STR-BN on ImageNet-1K:

ResNet50: python main.py --config configs/largescale/resnet50-str-bn.yaml --multigpu 0,1,2,3

MobileNetV1: python main.py --config configs/largescale/mobilenetv1-str-bn.yaml --multigpu 0,1,2,3

The user can explore and search for right hyperparameters of STR-BN through the configs.

Sparsity Budgets

The folder budgets contains the csv files containing all the non-uniform sparsity budgets STR learnt for ResNet50 on ImageNet-1K across all the sparsity regimes along with baseline budgets for 90% sparse ResNet50 on ImageNet-1K. In case, you are not able to use the pretraining models to extract sparsity budgets, you can directly import the same budgets using these files. Structured sparsity methods which take in a layer-wise sparsity budget could potentially utilize these budgets learnt through STR for unstructured sparsity.

Citation

If you find this project useful in your research, please consider citing:

@article{Kusupati20a
  author    = {Kusupati, Aditya},
  title     = {Adapting Unstructured Sparsity Techniques for Structured Sparsity},
  booktitle = {Technical Report},
  year      = {2020},
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
data		data
models		models
utils		utils
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
args.py		args.py
main.py		main.py
requirements.txt		requirements.txt
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adapting Unstructured Sparsity Techniques for Structured Sparsity

Set Up

STR-BN

Vanilla Training

Train dense models on ImageNet-1K:

Train models with STR-BN on ImageNet-1K:

Sparsity Budgets

Citation

About

Releases

Packages

Languages

License

RAIVNLab/STR-BN

Folders and files

Latest commit

History

Repository files navigation

Adapting Unstructured Sparsity Techniques for Structured Sparsity

Set Up

STR-BN

Vanilla Training

Train dense models on ImageNet-1K:

Train models with STR-BN on ImageNet-1K:

Sparsity Budgets

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages