Emotion Architecture in Reddit Comments

A Diagnostic and Predictive Exploration of Crowdsourced Emotional Labels

Project Overview

This project explores emotional patterns, rater bias, and label reliability in the GoEmotions dataset using Python and data science tools.

The EDA investigates:

Annotation inconsistency and “neutral spamming”
Emotion co-occurrence and correlation
Contradictions between labels and textual signals
Predictive performance using TF-IDF + Logistic Regression
Semantic structure using Word2Vec + t-SNE

This project was presented as the final EDA project at TovTech.

Main Notebook

emotion-architecture-in-reddit-comments-by-d.ipynb — contains the full annotated analysis, visuals, and conclusions

Key Visual

DataCamp Learning Experience

As part of this EDA, I studied statistics, data science, and modeling using the DataCamp platform. I learned to:

Apply diagnostic thinking to real-world datasets
Identify annotation bias and contradictions
Use statistical tools to interpret NLP structures
Process findings clearly under pressure

Despite a nontraditional background, this project helped me realize the power of clean logic, ethical scrutiny, and practical data skills.

Dataset

Source: GoEmotions on Kaggle

Contact

Feel free to reach out or fork this repo if you're interested in:

Emotion AI quality control
Annotator profiling
Ethical NLP systems

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
emotion-architecture-in-reddit-comments-by-d.ipynb		emotion-architecture-in-reddit-comments-by-d.ipynb
emotion_score_by_raters_1_5.png		emotion_score_by_raters_1_5.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Emotion Architecture in Reddit Comments

Project Overview

Main Notebook

Key Visual

DataCamp Learning Experience

Dataset

Contact

About

Uh oh!

Releases

Packages

Languages

a-neti-neti/goemotions-eda-annotation-diagnostics

Folders and files

Latest commit

History

Repository files navigation

Emotion Architecture in Reddit Comments

Project Overview

Main Notebook

Key Visual

DataCamp Learning Experience

Dataset

Contact

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages