This repository contain the data of the paper entitled "A Corpus of the Sorani Kurdish Folkloric Lyrics".
The corpus is available in TEI format in KurdishLyricsCorpus.xml
. For the sake of convenience, the corpus is also provided in JSON in KurdishLyricsCorpus.json
. A live version of the corpus can be consulted at https://kurdishblark.github.io/KurdishLyricsCorpus/ with basic query functions.
If you’re using any part of this corpus, please don’t forget to cite our paper:
@inproceedings{ahmadi2020folklyrics,
title={A Corpus of the Sorani Kurdish Folkloric Lyrics},
author={Ahmadi, Sina and Hassani, Hossein and Abedi, Kamaladdin},
booktitle={Proceedings of the 1st Joint Spoken Language Technologies for Under-resourced languages ({SLTU}) and Collaboration and Computing for Under-Resourced Languages (CCURL) Workshop at the 12th International Conference on Language Resources and Evaluation (LREC)},
date="2020-05-11",
year={2020},
address= "Marseille, France"
}