This repository tests various recurrent neural network architectures on baseline datasets SeqMNIST and pMNIST.
The network architectures chosen were those deemed to be the most effective currently available.
Architectures tested include:
- RNN
- Peephole LSTM - https://ieeexplore.ieee.org/document/861302/
- Intersection RNN - https://arxiv.org/pdf/1611.09913.pdf
The following results are were generated using the architectures listed above.
Hyperparameters used: layers 3, num neurons 50, optimizer Adam, learning rate .0001 and batch 64.
python train.py --model-type=irnn --task=seqmnist --layers=2 --batch-size=64 --epochs=10
Update BASE_DIR in config.ini with the absolute path to the current directory.
Packages needed to run the code include:
- numpy
- python
- PyToch
- argparse
- configparser