-
Notifications
You must be signed in to change notification settings - Fork 7
Description
Your 2019 target tracking paper, "Learning Q-network for Active Information Acquisition," is really interesting work. I'd like to build on and cite it, so I'm trying to reproduce the results. I've had success running the code, but I've been unsuccessful in getting results that match Fig. 2.
The version of the code that appears to most closely match the description in the paper is a19dd2a. I've started with the simplest case: the "ADFQ, TS" method for the "Single Target - Empty Environment" problem. I'm running the following in Google Colaboratory; I believe this matches all the available information provided in the paper:
%tensorflow_version 1.x
!git clone https://github.com/coco66/ADFQ.git
!cd ADFQ; git checkout -b reproduce; git reset --hard a19dd2a2e5c012e3bd7f6129c7beab51313b6f39; cd ..
!pip install filterpy
python_paths = ['ADFQ', 'ADFQ/classic_adfq', 'ADFQ/deep_adfq']
import sys
sys.path.extend(python_paths)
import os
os.environ['PYTHONPATH'] += ':' + ':'.join(python_paths)
!echo -e '\n' | python ADFQ/deep_adfq/run_tracking.py --nb_train_steps 3000 --act_policy bayesian --map empty
The versions are:
- Python 3.6.9
- TensorFlow 1.15.2
- filterpy 1.4.5
This is the image batch_0/mean_nlogdetcov.png in the results directory:
and this is the moving average with window 4 of mean_nlogdetcov:
which doesn't match Fig. 2 of the paper.
What am I missing? Was a different version of the code used to generate the results in the paper? Are there other options/parameters that I need to explicitly set to match those used for the paper?

