This thesis aims to contribute to research efforts in the field of affective computing and to provide a holistic analysis of text-based emotion recognition from the perspective of Applied and Computational Linguistics. We will examine linguistic features, annotation schemes, categorical and dimensional emotion models, as well as commonly used research datasets with different linguistic styles, and focus on deep neural network architectures as the main prediction systems, since deep learning has achieved major breakthroughs and state-of-the-art results for a large number of tasks in the field of Natural Language Processing (Young et al. 2018). Schematic thesis overview that spans analyses, tasks and implications for (1) datasets, (2) emotion models and (3) algorithms:
- Categorical emotion models – emotions are represented as distinct, mutually exclusive categories (e.g. the basic emotions anger, fear, joy, ...)
- Dimensional emotion models – emotions are represented in a two- or multidimensional space (e.g. valence and arousal)
In this demo you can try out directly in the browser a fine-tuned checkpoint of DistilRoBERTa-base by Jochen Hartmann. The model was trained on 6 diverse datasets (see Appendix below) and predicts Ekman's 6 basic emotions, plus a neutral class:
- anger 🤬
- disgust 🤢
- fear 😨
- joy 😀
- neutral 😐
- sadness 😭
- surprise 😲
Demo by Suzana Ilic
Model reference: Jochen Hartmann, "Emotion English DistilRoBERTa-base". https://huggingface.co/j-hartmann/emotion-english-distilroberta-base/, 2022.
- Dataset I – Facebook posts
- Dataset II – Media headlines (SemEval 2007)
- Dataset III – Dialogue (SemEval 2019)
- EDA
- Transformers
The goal of exploratory data analyses for emotion datasets is to get an understanding of the corpus, the linguistic style, lexical elements, syntax as well as the annotation scheme, distribution and imbalance check of classes (or analyses of scores).
- Dataset: 2,894 Facebook posts annotated with scores for valence and arousal on an integer scale from 1-9 repsectively
- EDA
- Model (BERT, RoBERTa) using Simple Transformers
Task: Regression
Paper: Modelling valence and arousal in facebook posts (2016)
References:
Preoţiuc-Pietro, D., Schwartz, H. A., Park, G., Eichstaedt, J., Kern, M., Ungar, L., & Shulman, E. (2016): Modelling valence and arousal in facebook posts. In Proceedings of the 7th workshop on computational approaches to subjectivity, sentiment and social media analysis (pp. 9-15).
Dimensonal Emotion model based on the circumplex model (valence and arousal) by James A. Russell (1980): A Circumplex Model of Affect. Journal of Personality and Social Psychology (39,6:1161–1178).