Nystromformer-Paddle

Reproducing Nystromformer(A Nyström-Based Algorithm for Approximating Self-Attention, AAAI 2021) based on PaddlePaddle.

Nystromformer official repo: https://github.com/mlpen/Nystromformer

Nystromformer paper link: https://arxiv.org/pdf/2102.03902v3.pdf

Sub-directories

paper_summary: Brief summary of the idea of the Nystromformer paper.
nystromformer_paddle: Code of the model reproduced based on paddlepaddle.
pretrained_files: The pretrained tokenizer file and model parameters. Note that we transferred the pretrained weight provided by huggingface(which is also transferred from the weights provided by the original paper repo) into paddlepaddle format.
data: The directory which contains data files. Currently only the tokenized IMDB dataset.
compare_code: We compared our reproduced model with the huggingface pytorch version, showing that the model structure and the training procedure are exactly the same. Pytorch is needed when running the code in this directory.

Requirements

paddlepaddle >= 2.2.0
datasets >= 1.18.1
reprod_log (Installation see https://github.com/WenmuZhou/reprod_log)

If you need to run the compare_code, transformers >= 4.16.0.dev0 is required.

Get started

Clone our repo, and under Nystromformer-Paddle directory, simply run:

python run.py

This file fine-tunes the pretrained model with IMDB dataset and gets the expected result on validation set.

To change the configuration of the training, simply modify line 16-22 in run.py

Reproduce result

We fine-tuned the model on IMDB dataset.

Using pretrained weight of nystromformer-512(https://huggingface.co/uw-madison/nystromformer-512), we gets the f1-score: 93.24 on validation set, higher than the result of 93.0 reported in the original paper. The training info is saved in fine_tune_log.npy.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.idea		.idea
compare_code		compare_code
nystromformer_paddle		nystromformer_paddle
paper_summary		paper_summary
pretrained_files		pretrained_files
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
fine_tune_log.npy		fine_tune_log.npy
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nystromformer-Paddle

Sub-directories

Requirements

Get started

Reproduce result

About

Releases

Packages

Languages

JeremyZhao1998/Nystromformer-Paddle

Folders and files

Latest commit

History

Repository files navigation

Nystromformer-Paddle

Sub-directories

Requirements

Get started

Reproduce result

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages