Text classification for SemEval 2014 task 9

Complete problem description can be found here: http://alt.qcri.org/semeval2014/task9/
Complete data is dowloaded from here: http://alt.qcri.org/semeval2017/task4/?id=download-the-full-training-data-for-semeval-2017-task-4

Description of each notebook:

prepare-data-csv.ipynb: Using raw data txt files create pandas dataframe
Data cleaning and EDA.ipyb: Cleaning raw text and some Exploratory data analysis on our data
Modelling.ipynb: CNN model for text classification task

Dataset details:

	Positive	Negative	Nuetral
Total	3640	1458	4586
Train	2919	1166	3662
Test	721	292	924

Model architecture for text classification task:

Model inspired from here: https://arxiv.org/abs/1610.08815

We've used GloVe embeddings trained on twitter dataset downloaded from here: https://nlp.stanford.edu/projects/glove/

Model Results

Classification report:

Metric Plot:

Loss Plot:

Conclusion

We've achieved an F1-score of 0.6205 on test dataset

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
data		data
images		images
models		models
Data cleaning and EDA.ipynb		Data cleaning and EDA.ipynb
Modelling.ipynb		Modelling.ipynb
README.md		README.md
emo_unicode.py		emo_unicode.py
prepare-data-csv.ipynb		prepare-data-csv.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text classification for SemEval 2014 task 9

Description of each notebook:

Dataset details:

Model architecture for text classification task:

Model Results

Conclusion

About

Releases

Packages

Languages

NamanJain2050/semeval-2014-task-9

Folders and files

Latest commit

History

Repository files navigation

Text classification for SemEval 2014 task 9

Description of each notebook:

Dataset details:

Model architecture for text classification task:

Model Results

Conclusion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages