Augmented Random Forest

Simple Post-Training Robustness Using Test Time Augmentations and Random Forest

This repo reproduces all the reuslts shown in our paper.

Init project

Run in the project dir:

source ./init_project.sh

Create validation set and test set indices for all dataset by running:

python src/scripts/set_val_test_inds.py

This generates the 'test' and 'test-val' subsets (as explained in the paper) for each dataset

Train

Train Resnet networks for cifar10, cifar100, svhn, and tiny_imagenet using src/train.py.

For example, for CIFAR-10 run:

Regular network:

python src/scripts/train.py --dataset cifar10 --net resnet34 --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00

TRADES:

python src/scripts/train.py --dataset cifar10 --net resnet34 --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/adv_robust_trades --adv_trades True

VAT:

python src/scripts/train.py --dataset cifar10 --net resnet34 --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/adv_robust_vat --adv_vat True

If you wish also to reproduce results for the ensemble, train 9 more networks in:

/tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_01
/tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_02
/tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_03
/tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_04
/tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_05
/tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_06
/tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_07
/tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_08
/tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_09

Attack

For attacking a network, use src/attack.py.

For example, to attack CIFAR-10 with the $FGSM^2$ attack (defined in the paper), run:

python src/scripts/attack.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --attack fgsm --targeted True --attack_dir fgsm2 --eps 0.031

Prior to training the Random Forest classifier, one has to generate all the non-adapted attacks in section 4 in the paper: fgsm1, fgsm2, jsma, pgd1, pgd2, cw, cw_Linf, square, and boundary. The complete set of attacks one must run is given here:

[$FGSM^1$]:

python src/scripts/attack.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --attack fgsm --targeted True --attack_dir fgsm1 --eps 0.01

[$FGSM^2$]:

python src/scripts/attack.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --attack fgsm --targeted True --attack_dir fgsm2 --eps 0.031

[JSMA]:

python src/scripts/attack.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --attack jsma --targeted True --attack_dir jsma

[$PGD^1$]:

python src/scripts/attack.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --attack pgd --targeted True --attack_dir pgd1 --eps 0.01 --eps_step 0.003

[$PGD^2$]:

python src/scripts/attack.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --attack pgd --targeted True --attack_dir pgd2 --eps 0.031 --eps_step 0.003

[Deepfool]:

python src/scripts/attack.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --attack deepfool --targeted False --attack_dir deepfool

[$CW_{L_2}$]:

python src/scripts/attack.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --attack cw --targeted True --attack_dir cw

[$CW_{L_\infty}$]:

python src/scripts/attack.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --attack cw_Linf --targeted True --attack_dir cw_Linf --eps 0.031

[Square]:

python src/scripts/attack.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --attack square --targeted False --attack_dir square --eps 0.031

[Boundary]:

python src/scripts/attack.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --attack boundary --targeted True --attack_dir boundary

Fit the random forest

After attacking a network with the above 10 attack, train the random forest by running:

python src/scripts/train_random_forest.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00

The random forest parameters will be saved under: /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00/random_forest/random_forest_classifier.pkl

Adaptive white-box BPDA attack:

After saving the random forest weights, you can attack the ARF defense.

First, create a substitute model using:

python src/scripts/train_random_forest_sub.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00

Second, call the BPDA attack:

python src/scripts/attack.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --attack bpda --targeted True --eps 0.031 --eps_step 0.007 --max_iters 10

Evaluation

Use src/scrips/eval.py to evaluate the defenses.

For evaluation a plain model without any defense, run:

python src/scripts/eval.py --checkpoint_dir /tmp/adversarial_robustness/cifar10/resnet34/regular/resnet34_00 --method simple --attack_dir <YOUR_SELECTED_ATTACK> --dump_dir simple

For calculating accuracy on the Ensemble, TTA, or ARF, replace the "simple" above with "ensemble", "tta", or "random_forest", respectively.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
TRADES		TRADES
VAT_pytorch		VAT_pytorch
adversarial_robustness_toolbox/art		adversarial_robustness_toolbox/art
src		src
README.md		README.md
init_project.sh		init_project.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Augmented Random Forest

Init project

Train

Attack

Fit the random forest

Adaptive white-box BPDA attack:

Evaluation

About

Releases

Packages

Languages

giladcohen/ARF

Folders and files

Latest commit

History

Repository files navigation

Augmented Random Forest

Init project

Train

Attack

Fit the random forest

Adaptive white-box BPDA attack:

Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages