Disaster Tweets

Summary

This project aims to leverage advanced natural language processing (NLP) techniques using DistilBert, GRU (Gated Recurrent Unit), LSTM (Long Short-Term Memory), and RNN (Recurrent Neural Network) to analyze the Disaster Tweets dataset. The primary goal is to develop a model that can accurately classify tweets as either related to a disaster or not. The project utilizes state-of-the-art (SOTA) deep learning models and explores their performance in handling text classification tasks.

Goals

Implement and compare the performance of DistilBert, GRU, LSTM, and RNN for disaster tweet classification.
Develop a robust and accurate model for identifying tweets related to disasters.
Explore the strengths and weaknesses of different architectures in the context of natural language processing.

About the data

The Kaggle dataset used in this project consists of tweets labeled as either disaster-related or non-disaster-related. Each tweet is associated with a binary label indicating whether it is relevant to a disaster or not. The data exploration process will involve understanding the distribution of classes, preprocessing text data, and preparing it for model training.

Approach

Data Preprocessing:

Tokenization, padding, and leveraging GloVe embeddings for word representation.
Removal of frequent words.

Model Architecture:

DistilBert: Utilizing a pre-trained transformer model for contextualized embeddings.
GRU, LSTM, and RNN: Employing recurrent neural network architectures for sequence modeling.

Model Evaluation

Model evaluation using metrics such as accuracy, precision, recall, and F1 score

Observations

Simple Neural Networks (e.g. Single LSTM, GRU or Recurrent layer) have a similar performance than more complex models like DistilBert

Future Works:

Implement it using cloud computing. This will allow me to test more complex models and possibly achieve better performance. E.g increase embedding length, more complex NN architectures.
Try other embeddings. E.g. fasttext
Use Optuna to perform hyperparameter tunning in the NN, including how many hidden layers and neurons.
Embeddings preprocessing within the model artifact. Similar to sklearn pipelines.

References:

Fake news classification: Definition Several models https://github.com/Isoken00/-Fake-News-Classification-in-Python/tree/main

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
figures		figures
mlruns		mlruns
models		models
notebooks		notebooks
outcome		outcome
scr		scr
.gitignore		.gitignore
20_RunNLP.ipynb		20_RunNLP.ipynb
README.md		README.md
testing.ipynb		testing.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Disaster Tweets

Summary

Goals

About the data

Approach

Data Preprocessing:

Model Architecture:

Model Evaluation

Observations

Future Works:

References:

About

Releases

Packages

Languages

JPonsa/nlp_disaster_tweets

Folders and files

Latest commit

History

Repository files navigation

Disaster Tweets

Summary

Goals

About the data

Approach

Data Preprocessing:

Model Architecture:

Model Evaluation

Observations

Future Works:

References:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages