Language Modelling

Repository containing the results of the Natural Language Understanding course project about Language Modelling.

Project description

The proposed task of Language Modelling (LM) for the NLU course required to:

implement a Language Model using one of the RNN architectures (eg. Vanilla, LSTM, GRU);
train it and evaluate its performance on the word-level Penn Treebank (PTB) dataset;
reach a baseline value of 140 PP using a Vanilla RNN, or 90.7 PP using an LSTM.

Results

As a starting point, I decided to implement a very basic model made of:

a neural embedding layer;
an LSTM, to capture context information;
a fully connected layer, for the final word prediction; and obtained 137 PP.

To improve such results, I have considered the techniques described by Merity et. al, reaching 81.43 PP.

Final Mark

The Examination Board gave me a full mark for my project (30 Cum Laude).

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
report		report
.envrc		.envrc
.gitignore		.gitignore
README.md		README.md
Simone_Alghisi_229355.ipynb		Simone_Alghisi_229355.ipynb
flake.lock		flake.lock
flake.nix		flake.nix
full_requirements.txt		full_requirements.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TOC

Language Modelling

Project description

Results

Final Mark

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Simone-Alghisi/language_modelling

Folders and files

Latest commit

History

Repository files navigation

TOC

Language Modelling

Project description

Results

Final Mark

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages