Adversarial pNML

This is the official implementation of "A UNIVERSAL LEARNING APPROACH FOR ADVERSARIAL DEFENSE" paper.

The following figure from the paper presents CIFAR10 l-infinity PGD accuracy as a function of attack strength for our method and previous state-of-the-art:

Requirements:

Clone the repository.
Create conda enviorment: conda env create -f environment.yml
Activate conda environment.yml: conda activate pnml_adv3
Download CIFAR10 model from:

https://github.com/yaircarmon/semisup-adv.

Download ImageNet model from:

https://github.com/locuslab/fast_adversarial.
Edit the json parameter files within /src/parameters: Inside "model" change "ckpt_path" to the path of the downloaded models.
Download ImageNet validation set http://www.image-net.org/challenges/LSVRC/2012/downloads into ./data/imagenet/val directory:

Evaluate the model:

White-box attack

Base model

The following command will evaluate the base model robustness against PGD attack:

python src/eval.py -o <output_dir_path> -t <experiment_type> -p <params_path> --general.save <bool>
# For MNIST:
python src/eval.py -o ./output -t mnist_adversarial -p ./src/parameters/mnist_params.json
# For CIFAR10:
python src/eval.py -o ./output -t cifar_adversarial -p ./src/parameters/cifar_params.json
# For ImageNet:
python src/eval.py -o ./output -t imagenet_adversarial -p ./src/parameters/imagenet_params.json

output_dir_path - the location of the output of the script.
experiment_type - Type of experiment, must be one of the following: 'mnist_adversarial', 'cifar_adversarial', 'imagenet_adversarial'.
params_path - A path to a json parameter file that contains the experiment parameters.
general.save - Whether to save the generated adversarial samples. Default value is False.

The specific details of the experiment are given in the json parameter file.

To evaluate natural accuracy use adv_attack_test.attack_type argument, for example, for MNIST:

python src/eval.py -o ./output -t mnist_adversarial -p ./src/parameters/mnist_params.json --adv_attack_test.attack_type natural

Adversarial pNML

To evaluate adversarial pNML (with any base model) change the field "model"->"pnml_active" to true in the json parameter file. For example, to evaluate adversarial pNML robustness against adaptive attack run:

python src/eval.py -o ./output -t mnist_adversarial -p ./src/parameters/mnist_params_pnml_adaptive.json

To evaluate adversarial pNML against PGD attack we first need to generate PGD adversarial samples using the base model and save them into adversarials.t file. Then, evaluate the samples for adversarial pNML scheme. To generate adversarials.t file that hold adversarial samples set general.save to true and run eval.py:

python src/eval.py -o ./output -t mnist_adversarial -p ./src/parameters/mnist_params.json --general.save true

Then, locate adversarials.t in the output folder and update the json parameter file:

Set adv_attack_test->black_box_adv_path to adversarials.t path.
Set adv_attack_test->attack_type to "natural".
Make sure model->pnml_active is true. Run eval.py with the updated param file:

python src/eval.py -o ./output -t mnist_adversarial -p ./src/parameters/mnist_params_pnml_pgd.json

Black-box attack

To evaluate HSJA attack run:

python src/eval_hsj.py -o <output_dir_path> -t <experiment_type> -p <params_path>

Training:

python src/train.py -o <output_dir_path> -t <experiment_type> -p <params_path>
# For toy dataset:
python src/train.py -o ./output -t synthetic -p ./src/parameters/synthetic_params.json
# For MNIST:
python src/train.py -o ./output -t mnist_adversarial -p ./src/parameters/mnist_params.json
# For CIFAR10:
python src/train.py -o ./output -t cifar_adversarial -p ./src/parameters/cifar_params.json

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
data		data
notebooks		notebooks
output		output
src		src
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
param_serach_mnist.sh		param_serach_mnist.sh
param_serach_mnist2.sh		param_serach_mnist2.sh
run_multiple_beta.sh		run_multiple_beta.sh
run_multiple_eps.sh		run_multiple_eps.sh
run_multiple_refinements.sh		run_multiple_refinements.sh
tmuxp_search_attack_hyperparams.yaml		tmuxp_search_attack_hyperparams.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial pNML

Requirements:

Evaluate the model:

White-box attack

Base model

Adversarial pNML

Black-box attack

Training:

About

Releases

Packages

Languages

uriyapes/pnml_adv

Folders and files

Latest commit

History

Repository files navigation

Adversarial pNML

Requirements:

Evaluate the model:

White-box attack

Base model

Adversarial pNML

Black-box attack

Training:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages