Multi-Class Text Emotion Analysis

Social_Sentify is a project to develop rule-based and deep learning algorithms with an aim to first appropriately detect the different types of emotions contained in a collection of English sentences or a large paragraph and then accurately predict the overall emotion of the paragraph.

I have two training and validation dataset:

emotion_data.csv in which basic pre-processing of tweets in done (no lemmatization, no removal of stopwords).
This dataset is comprised of 55,774 tweets from Twitter with labelled emotions of five classes: Neutral, Happy, Sad, Love, Anger.
emotion_data_prep.csv in which more deep pre-processing of tweets in done (lemmatization, removal of stopwords, etc).
This dataset is comprised of 62,015 tweets from Twitter with labelled emotions of five classes: Neutral, Happy, Sad, Love, Anger.

Comparison of DL and ML models:

DL:

The DLModel using emotion_data.csv gave me 64.80% accuracy.

Confusion Matrix:

The DLModel-Prep using emotion_data_prep.csv gave me 63.47% accuracy.

Confusion Matrix (Prep):

ML:

The ML Algorithms used for prediction are listed as follows:

Building models using different classifiers (Count vectorizer):

Model 1: Multinomial Naive Bayes Classifier - Accuracy 58.46%
Model 2: Linear SVM - Accuracy 62.00%
Model 3: Logistic Regression - Accuracy 62.47%

Building models using different classifiers (TF-IDF vectorizer):

Model 1: Multinomial Naive Bayes Classifier - Accuracy 38.37%
Model 2: Linear SVM - Accuracy 38.49%
Model 3: Logistic Regression - Accuracy 40.13%

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
cleaned_data		cleaned_data
crawled_csv		crawled_csv
dataset		dataset
images		images
ClassMerge.ipynb		ClassMerge.ipynb
CleanData-Crawled.ipynb		CleanData-Crawled.ipynb
CleanData.ipynb		CleanData.ipynb
DLModel-Prep.ipynb		DLModel-Prep.ipynb
DLModel.ipynb		DLModel.ipynb
ExtraFunctions.ipynb		ExtraFunctions.ipynb
LICENSE		LICENSE
MLModels.ipynb		MLModels.ipynb
README.md		README.md
Setup-Prep.ipynb		Setup-Prep.ipynb
Setup.ipynb		Setup.ipynb
history-balance1.csv		history-balance1.csv
history-balance2.csv		history-balance2.csv
history-balance3.csv		history-balance3.csv
tokenizer.pickle		tokenizer.pickle
twitter_crawl.py		twitter_crawl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Class Text Emotion Analysis

I have two training and validation dataset:

Comparison of DL and ML models:

DL:

Confusion Matrix:

Confusion Matrix (Prep):

ML:

Building models using different classifiers (Count vectorizer):

Building models using different classifiers (TF-IDF vectorizer):

Prediction of emotions from paragraphs and sentences (DL Model):

About

Releases

Packages

Languages

License

ariesiitr/Social_Sentify

Folders and files

Latest commit

History

Repository files navigation

Multi-Class Text Emotion Analysis

I have two training and validation dataset:

Comparison of DL and ML models:

DL:

Confusion Matrix:

Confusion Matrix (Prep):

ML:

Building models using different classifiers (Count vectorizer):

Building models using different classifiers (TF-IDF vectorizer):

Prediction of emotions from paragraphs and sentences (DL Model):

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages