Achieving Sparse Activation in Small Language Models

Introduction

This is the official code repository for the paper "Achieving Sparse Activation in Small Language Models". We aim to achieve sparse activation in SLMs. We demonstrated that the existing magnitude-based sparse activation cannot be applied to SLMs, and using gradient-based attribution scores for sparse activation is a better choice. By applying a corrective term onto the existing GxO attribution metric, our approach can achieve 80% sparsification ratio on SLMs with <5% accuracy loss.

Requirement

Install all the required packages.

pip install -r requirements.txt

General Usage

We use Phi-2 and the TruthfulQA dataset to demonstrate an example of sparse activation. The following steps can be used to generate accuracy-sparsity trade-off curves based on various metrics, including the proposed Corrected GxO metric.

Folder Creation

Create the folder for generated results as following code

python3 folder_creation.py

Label Generation

Generate the labels for sparse activation

python3 label_generation.py

Mganitude Generation

we can use the following code to generate the output magnitude of each attention head and MLP neurons

python3 Mag_attention.py

python3 Mag_mlp.py

Attribution scores Generation

we can use the following code to generate the various attribution-based scores of each attention head and MLP neurons

gradient

python3 attribution_attention.py --metric gradient

Gradient*Output (GxO)

python3 attribution_attention.py --metric gxo

Integrated gradients (IG) with 20 interpolations (The number of interpolations can be any positive integer)

python3 attribution_attention.py --metric ig --n_steps 20

Apply sparse activation based on output magnitude and different attribution scores and plot the accuracy-sparsity trade-off curves

magnitude

python3 main.py --metric_name magnitude

gradient

python3 main.py --metric_name gradient

GxO

python3 main.py --metric_name gxo

SNIP

python3 main.py --metric_name snip

Corrected GxO

python3 main.py --metric_name cor_gxo

We can then check the accuracy-sparsity trade-off curves in the path: result/truthfulqa/res/both.

Citation

@article{song2024achieving,
  title={Achieving Sparse Activation in Small Language Models},
  author={Song, Jifeng and Huang, Kai and Yin, Xiangyu and Yang, Boyuan and Gao, Wei},
  journal={arXiv e-prints},
  pages={arXiv--2406},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
generation		generation
model		model
phi		phi
.gitattributes		.gitattributes
LICENSE		LICENSE
Mag_attention.py		Mag_attention.py
Mag_mlp.py		Mag_mlp.py
README.md		README.md
attribution_attention.py		attribution_attention.py
attribution_mlp.py		attribution_mlp.py
configuration_phi.py		configuration_phi.py
folder_creation.py		folder_creation.py
label_generation.py		label_generation.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Achieving Sparse Activation in Small Language Models

Introduction

Requirement

General Usage

Folder Creation

Label Generation

Mganitude Generation

Attribution scores Generation

gradient

Gradient*Output (GxO)

Integrated gradients (IG) with 20 interpolations (The number of interpolations can be any positive integer)

Apply sparse activation based on output magnitude and different attribution scores and plot the accuracy-sparsity trade-off curves

magnitude

gradient

GxO

SNIP

Corrected GxO

Citation

About

Releases

Packages

Languages

License

pittisl/Sparse-Activation

Folders and files

Latest commit

History

Repository files navigation

Achieving Sparse Activation in Small Language Models

Introduction

Requirement

General Usage

Folder Creation

Label Generation

Mganitude Generation

Attribution scores Generation

gradient

Gradient*Output (GxO)

Integrated gradients (IG) with 20 interpolations (The number of interpolations can be any positive integer)

Apply sparse activation based on output magnitude and different attribution scores and plot the accuracy-sparsity trade-off curves

magnitude

gradient

GxO

SNIP

Corrected GxO

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages