Modeling Event Plausibility with Consistent Conceptual Abstraction

Data

Download data.zip from this Google Drive directory and place the ./data folder in the project directory. You can also place the data in a different location which must be specified using the --data_dir argument.

This directory contains:

Wikipedia training data as a parquet (pyarrow) table. Each row is a training example with an uttered s-v-o triple and it's corresponding pseudo-negative. Specifically, the columns are (subject verb object negative_subject negative_verb negative_object subject_synset object_synset negative_subject_synset negative_object_synset) where synsets are those disambiguated using BERT-WSD.
Plausibility judgements for evaluation (PEP-3K and Twenty Questions) in .tsv format.
Filtered WordNet saved a .tsv files. lemma2synsets is a mapping from lemmas to corresponding synsets. synset2hc is a mapping from synset to hypernym chain. synset2lemma is a mapping from synset to lemma.

Requirements

First make sure you have the requirements as specified in the requirements.txt file. E.g., create a new virtual environment and install the necessary requirements:

virtualenv ./env
source ./env/bin/activate
pip install -r requirements.txt

Training the model

To train a model, and run (specifying the model type, i.e. roberta or conceptmax):

python src/train.py \
    --model_type roberta

The training dataset and WordNet data will be cached the first time training is run.

You can override the default directories and also resume training from an existing checkpoint:

python src/train.py \
    --model_name_or_path $MODEL_PATH \
    --data_dir $DATA_DIR \
    --output_dir $OUTPUT_DIR \
    --cache_dir $CACHE_DIR \
    --model_type conceptmax \
    --ckpt_path $CHECKPOINT_DIR/last.ckpt \
    --stage train

I haven't merged in conceptinject as the performance is similar to roberta. Please let me know if you need this model.

Testing the model

You can test a model by specifying test as the --stage and pointing to the pytorch-lightning checkpoint to be evaluated, e.g.:

python src/train.py \
    --model_type conceptmax \
    --ckpt_path $CHECKPOINT_DIR/last.ckpt \
    --stage test

Trained checkpoints

The Google Drive directory also has pytorch-checkpoints for trained models. You can, for example, evaluate these models by downloading the relevant .ckpt file and then running the test stage:

python src/train.py \
    --model_type roberta \
    --ckpt_path ./roberta-plausibility.ckpt \
    --stage test

The AUC results of these models are higher than those reported in the paper. I think this might be due to a smaller Wikipedia validation split (and thus larger training set):

Model	PEP-3K		20 Questions
Model	Valid	Test	Valid	Test
Roberta	0.702	0.678	0.692	0.688
ConceptMax	0.679	0.698	0.746	0.757

Running with Slurm

Models can be run using Slurm Workload Manager. See ./jobs

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
jobs		jobs
src		src
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Modeling Event Plausibility with Consistent Conceptual Abstraction

Data

Requirements

Training the model

Testing the model

Trained checkpoints

Running with Slurm

About

Releases

Packages

Languages

License

ianporada/modeling_event_plausibility

Folders and files

Latest commit

History

Repository files navigation

Modeling Event Plausibility with Consistent Conceptual Abstraction

Data

Requirements

Training the model

Testing the model

Trained checkpoints

Running with Slurm

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages