TrojDRL: Evaluation of Backdoor Attacks on Deep Reinforcement Learning

This repository is the official open source implementation of the paper: TrojDRL: Evaluation of Backdoor Attacks on Deep Reinforcement Learning accepted at DAC 2020.

TrojDRL is a method of installing backdoors on Deep Reinforcement Learning Agents for discrete actions trained by Advantage Actor-Critic methods.

The implementation is based on the paac (Parallel Advantage Actor-Critic) method from the Efficient Parallel Methods for Deep Reinforcement Learning that uses Tensorflow 1.13.1.
We recommend installing the dependencies using the env.yml
- Install anaconda
- Open env.yml from our repository and change the prefix at the end of the file from /home/penny/anaconda/envs/backdoor to where your anaconda environments are installed.
- Run conda env create -f env.yml

train: $ python3 train.py --game=breakout --debugging_folder=data/strong_targeted/breakout/ --poison --color=100 --attack_method=targeted --pixels_to_poison_h=3 --pixels_to_poison_v=3 --target_action=2 --start_position="0,0"
test without attack: $ python3 test.py --folder=data/strong_targeted/breakout/ --no-poison --index=80000000 --gif_name=breakout
test with attack: $ python3 test.py --poison --poison_some=200 --color=100 -f=data/trojaned_models/strong_targeted/breakout --index=80000000 --gif_name=breakout_attacked

breakout: The target action is move to the right. The trigger is a gray square on the top left.
Strong Targeted-Attacked Agent

Untargeted-Attacked Agent
seaquest:
Weak Targeted-Attacked Agent
(More results under pretrained_models)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
atari_roms		atari_roms
paac		paac
pretrained/trojaned_models		pretrained/trojaned_models
.gitignore		.gitignore
README.md		README.md
actor_learner.py		actor_learner.py
adversary.py		adversary.py
atari_emulator.py		atari_emulator.py
emulator_runner.py		emulator_runner.py
env.yml		env.yml
environment.py		environment.py
environment_creator.py		environment_creator.py
evaluator.py		evaluator.py
logger_utils.py		logger_utils.py
networks.py		networks.py
paac.py		paac.py
policy_v_network.py		policy_v_network.py
runners.py		runners.py
test.py		test.py
train.py		train.py

Provide feedback