imdb-sentiment-analysis

Mark IMDB reviews as either positive or negative using stacked LSTMs. Facilitate learning and testing by using Pytorch Lightning. Log data with ClearML.

Examples of sentences from test set

Model performance

Future improvement directions

There is a number of possible approaches which could both raise the test accuracy and speed up the training process

Use of pretrained word embeddings: this is a standard approach in the field of NLP. The current approach tries to learn word embeddings on its own which is a time-consuming task. At the same time there is not enough data in the IMDB dataset to achieve SOTA performance. There are various pre-trained word embeddings available under permissive licensing schemes such as Word2Vec by Google or GloVe from Stanford University.
Elimination of rare tokens: words which are not used often (e. g. hapax legomena) do not carry any meaningful information which the model could learn on its own. The usefulness of this approach would be diminished if pre-trained embeddings were used.
Use of transformers: LSTM is not regarded anymore as a SOTA neural net architecture. Currently, the most powerful NLP models commonly use transformers, which adopt the mechanism of self-attention, i. e. they focus only on the parts of the input which seems the most important for the task at hand.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images		images
README.md		README.md
imdb_sentiment_analysis.ipynb		imdb_sentiment_analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

imdb-sentiment-analysis

Examples of sentences from test set

Model performance

Future improvement directions

About

Releases

Packages

Languages

maciek-pioro/imdb-sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

imdb-sentiment-analysis

Examples of sentences from test set

Model performance

Future improvement directions

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages