Adversarial-Train-TextCNN-Pytorch

A repository that implements three adversarial training methods, FGSM, "Free" and PGD, which are published in the paper, "Fast is better than free: Revisiting adversarial training", created by Eric Wong, Leslie Rice, and Zico Kolter. Then leverage them to train textCNN models, and show how they performance comparing with an original textCNN model trained in an ordinary way.

Implementation

By introducing perturbations into embeddings of inputs , adversarial training regularizes model's parameters to improve its robustness and generalization. It assumed that the perturbations on inputs wouldn't influence the distribution of outputs.

In the paper above, it presented three different adversaries as follows:

1. FGSM

FGSM, short for Fast Gradient Sign Method, is summarized below:

The pseudo code above tells us that for each input x, how FGSM performs adversarial attack:

Initialize perturbation $https://latex.codecogs.com/svg.image?\delta$ from uniform distribution between - $https://latex.codecogs.com/svg.image?\epsilon$ and $https://latex.codecogs.com/svg.image?\epsilon$ .
Add perturbation to input x then calculate the gradient of $https://latex.codecogs.com/svg.image?\delta$ , and update the $https://latex.codecogs.com/svg.image?\delta$ as below:

$https://latex.codecogs.com/svg.image?\delta = \delta + \alpha * sign(\nabla_\delta l(f_\theta(x_i + \delta), y_i))$

If the absolute value of $https://latex.codecogs.com/svg.image?\delta$ is too great, project it back into (- $https://latex.codecogs.com/svg.image?\epsilon$ , $https://latex.codecogs.com/svg.image?\epsilon$ ):

$https://latex.codecogs.com/svg.image?\delta = max(min(\delta, \epsilon), -\epsilon)$

Update model weights with some optimizer, e.g. SGD:

$https://latex.codecogs.com/svg.image?\theta = \theta - \nabla_\theta l(f_\theta(x_i + \delta), y_i)$

2. Free

The pseudo code of "Free" is as followings:

It can be regarded that Free adversarial repeats several FGSM attacks in one batch of x:

Initialize $https://latex.codecogs.com/svg.image?\delta$ to 0 before training starts.
For each input x, Free adversary perform FGSM adversarial attack N times simultaneously.
For every FGSM attack, It compute gradients for perturbation and model weights simultaneously:

$https://latex.codecogs.com/svg.image?\nabla_\delta, \nabla_\theta = \nabla l(f_\theta(x_i+\delta),y_i)$

Update $https://latex.codecogs.com/svg.image?\delta$ :

$https://latex.codecogs.com/svg.image?\delta = \delta + \epsilon * sign(\nabla_\delta l(f_\theta(x_i + \delta), y_i))$

This formula is similar to FGSM's, the only difference is the coefficient of sign function.

The same as FGSM, project $https://latex.codecogs.com/svg.image?\delta$ back into (- $https://latex.codecogs.com/svg.image?\epsilon$ , $https://latex.codecogs.com/svg.image?\epsilon$ ) if its absolute value too great:

$https://latex.codecogs.com/svg.image?\delta = max(min(\delta, \epsilon), -\epsilon)$

Update model weights with some optimizer, e.g. SGD:

$https://latex.codecogs.com/svg.image?\theta = \theta - \nabla_\theta$

Note: Because Free adversary attack N times for each batch, epochs of training could decreased to T/N times.

3. PGD

PGD adversarial training updates perturbation $https://latex.codecogs.com/svg.image?\delta$ N times before update model weights:

The steps are as followings:

For each input x, PGD initialize $https://latex.codecogs.com/svg.image?\delta$ to zero at first.
Loop N times to update $https://latex.codecogs.com/svg.image?\delta$ in the way below:

$https://latex.codecogs.com/svg.image?\delta = \delta + \alpha * sign(\nabla_\delta l(f_\theta(x_i + \delta), y_i))$

If it exceeds the scale (- $https://latex.codecogs.com/svg.image?\epsilon$ , $https://latex.codecogs.com/svg.image?\epsilon$ ), it must be scaled again:

$https://latex.codecogs.com/svg.image?\delta = max(min(\delta, \epsilon), -\epsilon)$

After accumulate $https://latex.codecogs.com/svg.image?\delta$ N times, update model weights with some optimizer:

$https://latex.codecogs.com/svg.image?\theta = \theta - \nabla_\theta l(f_\theta(x_i + \delta), y_i)$

To achieve the three adversarial trainings above , this repository includes two different implementations:

encapsulate a adversarial training class to add perturbations of inputs when training, such as FGSM, PGD etc.
create a textCNN class that is able to add perturbations when a instance calls forward function, such as Free etc.

Configuration

Environment

python=3.8.8
joblib==1.0.1
numpy==1.20.1
pandas==1.2.4
scikit_learn==1.0.2
torch==1.11.0
tqdm==4.59.0

Data

The train data is from this github site, and it consists of two hundred thousand new headlines from THUCNews. There are 10 classes, including finance, realty, stocks, education, science, society, politics, sports, game and entertainment, twenty thousand texts with length between 20 and 30 for each. The sheet below indicates how the data set is divided:

	数量
训练集	180000
验证集	10000
测试集	10000
类目数	10

Inputs of model are characters of texts, and the pre-trained character embeddings come from sogou news. Click here to download them.

Result

	Precision	Recall	F1	Accuracy
normal	91.54	91.54	1.83	91.54
FGSM	92.05	91.98	1.84	91.97
Free	90.14	89.94	1.80	89.94
PGD	92.10	92.03	1.84	92.03

According to the index listed above, it seems that PDG adversarial training has the best performance, but it takes at least twice longer to train models. The performance of model trained by FGSM is close to PSG's, however, it only takes half the time of PGD training.Though it takes twice the time of a model training in the ordinary way.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
PNG		PNG
data		data
models		models
README.md		README.md
adversary.py		adversary.py
config.py		config.py
evalution.py		evalution.py
textCNN.py		textCNN.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial-Train-TextCNN-Pytorch

Implementation

1. FGSM

2. Free

3. PGD

Configuration

Environment

Data

Result

Reference

About

Releases

Packages

Languages

joey0922/Adversarial-Train-TextCNN-Pytorch

Folders and files

Latest commit

History

Repository files navigation

Adversarial-Train-TextCNN-Pytorch

Implementation

1. FGSM

2. Free

3. PGD

Configuration

Environment

Data

Result

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages