Weight Fixing Networks

This repo contains the Pytorch + Lightning code to apply the method proposed in 'Weight Fixing Networks' an accepted paper in ECCV 2022.

Quantized Model Saves

Below we link to the quantised model saves quoted in the results section of the paper.

Model	δ	Unique Param Count	Entropy	Acc	Link
ResNet-18	0.0075	193	4.15	70.3	link
ResNet-18	0.01	164	3.01	69.7	link
ResNet-18	0.015	90	2.72	67.3	link
ResNet-34	0.0075	233	3.87	73.0	link
ResNet-34	0.01	164	3.48	72.6	link
ResNet-34	0.015	117	2.83	72.2	link
ResNet-50	0.0075	261	4.11	76.0	link
ResNet-50	0.01	199	4.00	75.4	link
ResNet-50	0.015	125	3.55	75.1	link

To Run

First make sure you have installed the requirements found in requirements.txt
If you want to run the ImageNet experiments, you'll need to update the data_dir (see - Setting ImageNet file locations)
Now just run python pretrained_model_experiments.py with any options arguments you wish to change from:

Optional arguments:

--distance_allowed : δ in the paper

--percentages : the percentage of weights clustered in each iteration

--optimizer : training optimizer

--experiment_name: used to save tb logs and model saves

--scheduler : the learning rate scheduler

-- lr : the learning rate

--first_epoch : the number of training iterations before any clustering (set to zero for pre-trained models)

--fixing_epochs : the number of training epochs within a single clustering iteration (3 was used in the paper)

--model : the name of the model to train see get_model() for a list of out-the-box supported dataset-model combinations

--dataset : the dataset to train on, currently we support CIFAR-10 and Imagenet

--zero_distance : $\gamma_0$ in the paper, any abs weight less than this will be set to zero and prunned

--regularisation_ratio : the weighting of the $\mathcal{L}_{reg}$ term

--bn_inc : whether to quantize batch-norm layers

--resume : if continuing training, set this to the iteration you wish to continue from

--calculation_type: the distance type you want to use, relative and euclidean supported

Applying WFN to Different Types of Models

To add new models for WFN quantisation, go to the get_model function within pre_trained_model_experiments.py and follow the format. Once added here, the new model will automatically be converted in a weight_fix_base which contains all the functionality needed to apply the clustering.

def get_model(model_name, data):
    """ Here is where the models are defined, if you would like to use a new model, you can insert it into here """

    if model_name == 'conv4':
        model = All_Conv_4()
        model = model.load_from_checkpoint(checkpoint_path="Pretrained_Models/PyTorch_CIFAR10/cifar10_models/state_dicts/all_conv4")

    if model_name == 'resnet18' and data == 'cifar10':
        model = resnet18(pretrained=True)

    if model_name == 'resnet34' and data == 'imnet':
        model = models.resnet34(pretrained=True)

Setting ImageNet File Locations

Go to the ImageNet module within Datasets/imagenet.py and edit the line self.data_dir to point to your imagenet data directory.

class ImageNet_Module(pl.LightningDataModule):
        def __init__(self, data_dir = 'Datasets/', shuffle_pixels=False, shuffle_labels=False, random_pixels=False):
                super().__init__()
                self.data_dir = 'Your data directory here'
                self.mean = [0.485, 0.456, 0.406]
                self.std = [0.229, 0.224, 0.225]
                self.normalise = transforms.Normalize(mean=self.mean, std=self.std)
                self.transform = self.transform_select(shuffle_pixels, random_pixels)
                self.test_trans = self.test_transform()
                self.target_transform = self.target_transform_select(shuffle_labels)
                self.targets = 1000
                self.dims = (3,224,224)
                self.bs = 64

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
Comparison_Models		Comparison_Models
Datasets		Datasets
External_Models		External_Models
Models		Models
PyTorch_CIFAR10/cifar10_models		PyTorch_CIFAR10/cifar10_models
Utility		Utility
.all_conv_experiments.py.swo		.all_conv_experiments.py.swo
.all_conv_experiments.py.swp		.all_conv_experiments.py.swp
.gitignore		.gitignore
.test.py.swp		.test.py.swp
README.md		README.md
Test.ipynb		Test.ipynb
check_all_results.ipynb		check_all_results.ipynb
experiment_bash_run.py		experiment_bash_run.py
pre_trained_model_experiments.py		pre_trained_model_experiments.py
requirements.txt		requirements.txt
script.sh		script.sh
test_a_model.py		test_a_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Weight Fixing Networks

Quantized Model Saves

To Run

Applying WFN to Different Types of Models

Setting ImageNet File Locations

About

Releases

Packages

Languages

wanderingweights/Weight_Fix_Networks

Folders and files

Latest commit

History

Repository files navigation

Weight Fixing Networks

Quantized Model Saves

To Run

Applying WFN to Different Types of Models

Setting ImageNet File Locations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages