Project on Meta-Learning for the "Advanced Topics on Computational Semantics" course (UvA, 2020)
- Single-task training can be started by running the following line in your Command Line Interface:
python3 train.py
. Possible command-line options include:batch_size
to set the number of elements in a batchrandom_seed
to set a seed for reproducibilityepochs
to set the number of epochs to run fordataset
, e.g.NLI
orIBM
to set the dataset you want to use (for single task only)- More parameters can be found by executing
train.py --help
or in the filetrain.py
itself.
Similarly Multi-task training, training of Prototypical networks and Proto-MAML can be started with the python scripts multitask.py
, prototypes.py
and protomaml.py
.
.data/multinli
for the NLI dataset- Downloading is taken care of by PyTorch
.data/ibm
for the IBM dataset- Download dataset 3.1 from here.
.data/MRPC
for the MRPC dataset- Download dataset from here.
.data/pdb
for the Penn Discourse Bank- Contact us for an appropriatly pre-processed version; or
- Download the raw dataset from here.
.data/SICK
for SICK- Download dataset from here.