Comparison of learning-efficiency using PEFT techniques

This project helps to compare the effectiveness of Parameter-Efficient Fine-Tuning (PEFT) techniques in various NLP tasks. PEFT methods can democratize access to high-performance LLM models by making their fine-tuning more resource-efficient and accessible for diverse applications. The project implements three PEFT methods: Prefix-Tuning, Parallel Adapters, and Low-Rank Adaptation (LoRA). It can experiment on tasks including emotion classification, topic classification, multiple-choice question answering, and distractor generation, the research aims to identify the most effective PEFT technique given any number of operations.

Requirements

Tested Python version: 3.9

Tested CUDA version: 12.3

Installation

It's recommended to first create a virtual environment for the project:

python -m venv peft-env

Install the dependencies:

pip install -r requirements.txt --find-links https://download.pytorch.org/whl/torch_stable.html

Create a .env file and set your Weights & Biases key. An fictional example is provided here:

WANDB_API_KEY=5B5FCFA35882C50972C4CF6AA89DBFCA19608A28

Usage

The main.py file is the starting point of the project, obtain help with:

python main.py -h

Some example of usage are the following:

python main.py -d=commonsense_qa -e=4 -ee=100 FT

It executes a normal fine-tuning experiment on the CommonsenseQA dataset for 4 epochs with evaluation every 100 steps.

python main.py -d=race -e=1 -ee=100 LoRA -r=8 -a=8

It executes a LoRA experiment on the RACE dataset with a rank of 8 and alpha of 8. It runs for 1 epoch with evaluation every 100 steps.

python main.py -d=ag_news -e=1 -ee=100 prefix -nt=100

It executes a Prefix-Tuning experiment on the AG News dataset with a prefix length of 100. It runs for 1 epoch with evaluation every 100 steps.

python main.py -m=large -d=commonsense_qa --debug=DEBUG -e=4 -ee=100 adapters -rf=8

It executes a Parallel Adapters experiment on the CommonsenseQA dataset with a reduction factor of 8. It runs for 4 epochs with evaluation every 100 steps. It executes the large version of the T5 model and sets the logging level to DEBUG.

Linting

pylint ./

Disable reporting to W&B

You may disable Weigths & Bias reporting (it will be treated as a dummy run) creating a .env file and setting

WANDB_MODE=disabled

Architecture

The project is structured as follows:

assets: Contains images and other assets used for explaining the project.
downloads: It is created when the program is executed and contains the downloaded models and datasets.
plotting: Contains scripts to plot the results of the experiments.
results: The results of the experiments are stored here. Our results are stored in the repository for reference.
src: Contains the source code of the project.

The source code is structured as follows:

dataset: Contains the dataset base class and the implementations of the datasets used in the experiments.
models: Contains the code for downloading the model and applying the PEFT techniques.
utils: Contains the training functions and other auxiliary functions.

A diagram of the program flow is shown below:

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.github/workflows		.github/workflows
.vscode		.vscode
assets		assets
plotting		plotting
results		results
src		src
.gitignore		.gitignore
.pure-requirements.txt		.pure-requirements.txt
.pylintrc		.pylintrc
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Comparison of learning-efficiency using PEFT techniques

Requirements

Installation

Usage

Linting

Disable reporting to W&B

Architecture

About

Languages

Markel/PEFT-by-time

Folders and files

Latest commit

History

Repository files navigation

Comparison of learning-efficiency using PEFT techniques

Requirements

Installation

Usage

Linting

Disable reporting to W&B

Architecture

About

Resources

Stars

Watchers

Forks

Languages