GeorgeFloydTopicDetection

In this repository, you can find the tweet ids and features extracted for the hand labeled 5k dataset.

In the Obfuscated Data zip file, you can find the xlsx, json and csv formats of the obfuscated data. In these files, ID means the tweet ids; which are also in the ids folder, which means the user can only crawl the needed dataset.

In order to reference and learn more about the dataset, you can use the following bibtex and link to read the paper:

@article{kemik2021blm,
  title={BLM-17m: A Large-Scale Dataset for Black Lives Matter Topic Detection on Twitter},
  author={Kemik, Hasan and {\"O}zate{\c{s}}, Nusret and Asgari-Chenaghlu, Meysam and Cambria, Erik},
  journal={arXiv preprint arXiv:2105.01331},
  year={2021}
}

Link to the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
ids		ids
labels		labels
.gitignore		.gitignore
LICENSE		LICENSE
Obfuscated_Data.zip		Obfuscated_Data.zip
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GeorgeFloydTopicDetection

Contributors

About

Releases

Packages

License

SenticNet/BLM

Folders and files

Latest commit

History

Repository files navigation

GeorgeFloydTopicDetection

Contributors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages