QACNNnet

This repository contains a project whose goal is to solve an interesting and challenging NLP task: the Question Answering.

Branches

Currently there are two branches:

main: this branch contains the latest features required for the PROJECT WORK IN NLP, and it has new files purposely created for appling data augmentation to the original SQuAD dataset.
releases/QACNNnet-v1: this branch contains the first version of the model, and it was released for the NLP exam.

Data augmentation

For supporting the data augmentation, a new package called augmentation has been added to the project. This package contains some modules that can be conveniently used for generating new augmented datasets. However, the datasets previously agreed with the professors have been pushed to the repository. Therefore it should not be necessary to rerun these modules, unless new augmented datasets are required.

Moreover, the embedding layer of the model has been updated so as to support fastText (in place of GloVe).

Finally, note that all the theoretical information about the applied data augmentation can be found in the docs folder.

Description

QACNNet is the neural network that has been created for solving the question answering task, and it is mainly composed by CNNs along with attentions.

The relation that comes along with the project contains all the details that are related to the network, from the architecture specifications to the various results obtained during our testing.

For convenience, it is reported here the architecture overview, and how to execute the evaluation script for the model.

Architecture Overview

The following picture depicts an overview of the implemented model architecture.

The data folder

It is mandatory to copy the content of the data folder in the project files, and it can be downloaded from the following link: https://drive.google.com/drive/folders/1HAOte5VlHcmSgkMv4-G6JlXyFRLHqGME?usp=sharing.

The final result that you should see inside the project data folder is:

Evaluation Script

For evaluating the model, it is necessary to execute the two following scripts.

compute_answers.py: this script is responsible of creating the predictions.json file, which is necessary for the following script. As argument it requires the path of the dataset that must be tested. Usage example:
```
  python compute_answers.py "data/dev_set.json"
```
It is important that model weights are correcly loaded before it proceeds to compute the answers. If everything is ok (e.g., the data folder has been correctly imported in the project), then you should read the following logs:
```
  ....
  Loading model's weights...
  Model's weights successfully loaded!
  Model succesfully built!  
  ...
```
evaluate.py: after executing the previous script it is possible to proceed with the actual evaluation of the model. The script requires two arguments: the path of the dataset that must be tested (the same used in the previous script), and the predictions.json file path. This file is automatically created in the predictions project folder. Usage example:
```
  python evaluate.py "data/dev_set.json" "predictions/predictions.json"
```

Error Analysis Scripts

error_analyzer.py: to launch a statistical error analysis of the model through its predictions, run the module error_analyzer.py, with the same parameters used for the evaluation script evaluate.py. Usage example:

    python error_analyzer.py "data/dev_set.json" "predictions/predictions.json"

question_classifier.py: to launch questions classifier based on models' predictions (see official report for more), run the module question_classifier.py, with the same parameters as above. Usage example:

    python question_classifier.py "data/dev_set.json" "predictions/predictions.json"

Contributors

Gaetano Signorelli
Daniele Sirocchi

Name		Name	Last commit message	Last commit date
Latest commit History 256 Commits
data		data
docs		docs
layer_context_query_attention		layer_context_query_attention
layer_encoder		layer_encoder
layer_input_embedding		layer_input_embedding
layer_model_output		layer_model_output
model		model
predictions		predictions
preprocessing		preprocessing
test		test
.gitignore		.gitignore
README.md		README.md
compute_answers.py		compute_answers.py
config.py		config.py
error_analyzer.py		error_analyzer.py
evaluate.py		evaluate.py
main.py		main.py
metrics_extractor.py		metrics_extractor.py
question_classifier.py		question_classifier.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QACNNnet

Branches

Data augmentation

Description

Architecture Overview

The data folder

Evaluation Script

Error Analysis Scripts

Contributors

About

Releases

Packages

Contributors 4

Languages

dsr-lab/QACNNnet

Folders and files

Latest commit

History

Repository files navigation

QACNNnet

Branches

Data augmentation

Description

Architecture Overview

The data folder

Evaluation Script

Error Analysis Scripts

Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages