Here we provide a data set of tweets which have been annotated for hate speech.
We provide the ID and the annotation in a tab seperated file (annotation.tsv
). To obtain the individual tweets, use the Twitter API of your choice and query for the ID's provided.
If using NAACL_SRW_2016.csv
please cite using:
@InProceedings{waseem-hovy:2016:N16-2,
author = {Waseem, Zeerak and Hovy, Dirk},
title = {Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter},
booktitle = {Proceedings of the NAACL Student Research Workshop},
month = {June},
year = {2016},
address = {San Diego, California},
publisher = {Association for Computational Linguistics},
pages = {88--93},
url = {http://www.aclweb.org/anthology/N16-2013}
}
If using NLP+CSS_2016.csv
please cite using:
@InProceedings{waseem:2016:NLPandCSS,
author = {Waseem, Zeerak},
title = {Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter},
booktitle = {Proceedings of the First Workshop on NLP and Computational Social Science},
month = {November},
year = {2016},
address = {Austin, Texas},
publisher = {Association for Computational Linguistics},
pages = {138--142},
url = {http://aclweb.org/anthology/W16-5618}
}