Roman Urdu Hate-speech and Offensive Language Detection

Embeddings

The embeddings could be found at: https://drive.google.com/drive/folders/1_ZeoYMyBTb2sROeKmxS0quVrbvJBrct2?usp=sharing

Note that ver1 is trained on 0.3 million tweets only while ver2 is trained on 4.7 million tweets.

label_definitions.txt contains the mapping for the labels for both tasks (i.e., coarsegrained and finegrained labels).

Reference

@inproceedings{rizwan2020hate,
  title={Hate-speech and offensive language detection in roman Urdu},
  author={Rizwan, Hammad and Shakeel, Muhammad Haroon and Karim, Asim},
  booktitle={Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)},
  pages={2512--2522},
  year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Hate_Words_RomanUrdu_Lower.txt		Hate_Words_RomanUrdu_Lower.txt
LICENSE		LICENSE
README.md		README.md
label_definitions.txt		label_definitions.txt
task_1_test.tsv		task_1_test.tsv
task_1_train.tsv		task_1_train.tsv
task_1_validation.tsv		task_1_validation.tsv
task_2_test.tsv		task_2_test.tsv
task_2_train.tsv		task_2_train.tsv
task_2_validation.tsv		task_2_validation.tsv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Roman Urdu Hate-speech and Offensive Language Detection

Embeddings

Reference

About

Releases

Packages

License

haroonshakeel/roman_urdu_hate_speech

Folders and files

Latest commit

History

Repository files navigation

Roman Urdu Hate-speech and Offensive Language Detection

Embeddings

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages