Releases · NickleDave/thrillington · GitHub

04 Feb 12:58

NickleDave

Ramón Pre-release

Pre-release

Added

ability to use only a subset of MNIST training data and get a validation set from it
ability to shuffle dataset on each epoch
ability to train replicates (experiment repeated n times with same training data,
only random initialization / shuffling different)
ability to specify other modules to use to load other datasets
- an example is provided to load the kanji MNIST dataset
logging of runs/experiments, with option to dump to a text file
tests for MNIST module in datasets
a CHANGELOG (this one)

Changed

change argparser to use positional arguments command and config
- before all arguments were "optional" (although the program would crash without them)
many changes to training, in attempt to reproduce original paper + reconcile different versions
- currently: use pdf of Gaussian for policy gradient of location network, and
  normalize both baseline, target of baseline, and advantage to decrease variance and
  to keep gradient from exploding

Fixed

fix action network and glimpse network, did not have correct number of layers

Assets 2

04 Feb 12:57

NickleDave

Ramone Latest

Latest

Added

original version, probably closest to the one from Kevin Zakka
but in Tensorflow (Eager) instead of PyTorch

Assets 2