This thesis aims to contribute to research efforts in the field of affective computing and to provide a holistic analysis of text-based emotion recognition from the perspective of Applied and Computational Linguistics. We will examine linguistic features, annotation schemes, categorical and dimensional emotion models, as well as commonly used research datasets with different linguistic styles, and focus on deep neural network architectures as the main prediction systems, since deep learning has achieved major breakthroughs and state-of-the-art results for a large number of tasks in the field of Natural Language Processing (Young et al. 2018). Schematic thesis overview that spans analyses, tasks and implications for (1) datasets, (2) emotion models and (3) algorithms:

Overview

Emotion Models

Categorical emotion models – emotions are represented as distinct, mutually exclusive categories (e.g. the basic emotions anger, fear, joy, ...)
Dimensional emotion models – emotions are represented in a two- or multidimensional space (e.g. valence and arousal)

Model demo: Predicting basic emotions

In this demo you can try out directly in the browser a fine-tuned checkpoint of DistilRoBERTa-base by Jochen Hartmann. The model was trained on 6 diverse datasets (see Appendix below) and predicts Ekman's 6 basic emotions, plus a neutral class:

anger 🤬
disgust 🤢
fear 😨
joy 😀
neutral 😐
sadness 😭
surprise 😲

Demo by Suzana Ilic

Model reference: Jochen Hartmann, "Emotion English DistilRoBERTa-base". https://huggingface.co/j-hartmann/emotion-english-distilroberta-base/, 2022.

Datasets

Dataset I – Facebook posts
Dataset II – Media headlines (SemEval 2007)
Dataset III – Dialogue (SemEval 2019)

Notebooks

EDA
Transformers

Exploratory data analysis for emotion datasets (text)

The goal of exploratory data analyses for emotion datasets is to get an understanding of the corpus, the linguistic style, lexical elements, syntax as well as the annotation scheme, distribution and imbalance check of classes (or analyses of scores).

Dataset I

Dataset: 2,894 Facebook posts annotated with scores for valence and arousal on an integer scale from 1-9 repsectively
EDA
Model (BERT, RoBERTa) using Simple Transformers

Task: Regression
Paper: Modelling valence and arousal in facebook posts (2016)
References:
Preoţiuc-Pietro, D., Schwartz, H. A., Park, G., Eichstaedt, J., Kern, M., Ungar, L., & Shulman, E. (2016): Modelling valence and arousal in facebook posts. In Proceedings of the 7th workshop on computational approaches to subjectivity, sentiment and social media analysis (pp. 9-15).

Dimensonal Emotion model based on the circumplex model (valence and arousal) by James A. Russell (1980): A Circumplex Model of Affect. Journal of Personality and Social Psychology (39,6:1161–1178).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Overview

Emotion Models

Model demo: Predicting basic emotions

Datasets

Notebooks

Exploratory data analysis for emotion datasets (text)

Contents

Dataset I

Files

README.md

Latest commit

History

README.md

File metadata and controls

Overview

Emotion Models

Model demo: Predicting basic emotions

Datasets

Notebooks

Exploratory data analysis for emotion datasets (text)

Contents

Dataset I