DeepState: Selecting Massive Test Suites to Enhance the Robustness of Recurrent Neural Networks

DeepState has conducted experiments on the combination of 6 models and data sets. To illustrate the usage of the code of this project, the example of testing LSTM model trained with MNIST data set is illustrated as follows.

Environment

python=3.6

pip install -r requirements.txt

Preparing an RNN model as the test object

cd RNNModels/mnist_demo
python3 mnist_lstm.py -type "train"

After the training is completed, the output is as follows, and the trained model will be saved in the ./RNNModels/mnist_demo/models/mnist_lstm.h5.

Epoch 20/20
54000/54000 [==============================] - 10s 188us/step - loss: 0.0112 - accuracy: 0.9963 - val_loss: 0.0548 - val_accuracy: 0.9878

Preparing the data set for selection

For evaluating RQ1, we generate 30 different dataset for selection:

# generate the augmented data for selection
cd ../../gen_data/gen_test_dataset
python3 dau_mnist.py
python3 gen_toselect_dataset.py -dataset "mnist"   # for RQ1 & RQ2

For evaluation RQ3, we generate the dataset for selection and retraining and the test set for evaluation:

# generate the augmented data for selection and retraining
cd ../gen_retrain_dataset
python3 dau_mnist.py
python3 gen_retrain.py -dataset "mnist"   # for RQ3

If you feel that the steps to generate the augmented data are troublesome, or you just want to reproduce the results, you can download the data we generated here.

Generating the abstract model used for calculating the DeepStellar-coverage

The coverage calculation of DeepStellar requires an abstract model to be generated in advance. This part of the code comes from DeepStellar 's open source code.

cd ../..
python3 ./abstraction_runner.py -test_obj "mnist_lstm"

RQ1

python3 rq1.py -dl_model "./RNNModels/mnist_demo/models/mnist_lstm.h5" -model_type "lstm" -dataset "mnist"

The results will be saved in ./exp_results/rq1 .

RQ2

python3 rq2.py -dl_model "./RNNModels/mnist_demo/models/mnist_lstm.h5" -model_type "lstm" -dataset "mnist"

The results will be saved in ./exp_results/rq2 .

RQ3

First, we need to prepare the model before retraining:

cd RNNModels/mnist_demo
python3 mnist_lstm.py -type "retrain"

Then, we can evaluate the RQ3:

python3 rq3.py -dl_model "./RNNModels/mnist_demo/models/mnist_lstm_ori.h5" -model_type "lstm" -dataset "mnist"

The results will be saved in ./exp_results/rq3 .

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.idea		.idea
RNNModels		RNNModels
deepstellar		deepstellar
exp_results		exp_results
gen_data		gen_data
README.md		README.md
abstraction_runner.py		abstraction_runner.py
requirements.txt		requirements.txt
rq1.py		rq1.py
rq2.py		rq2.py
rq3.py		rq3.py
rq3_100tests.py		rq3_100tests.py
selection_tools.py		selection_tools.py
statics.py		statics.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepState: Selecting Massive Test Suites to Enhance the Robustness of Recurrent Neural Networks

Environment

Preparing an RNN model as the test object

Preparing the data set for selection

Generating the abstract model used for calculating the DeepStellar-coverage

RQ1

RQ2

RQ3

About

Uh oh!

Releases

Packages

Languages

SSCT-Lab/DeepState

Folders and files

Latest commit

History

Repository files navigation

DeepState: Selecting Massive Test Suites to Enhance the Robustness of Recurrent Neural Networks

Environment

Preparing an RNN model as the test object

Preparing the data set for selection

Generating the abstract model used for calculating the DeepStellar-coverage

RQ1

RQ2

RQ3

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages