Skip to content

KurdishBLARK/KurdishLyricsCorpus

Repository files navigation

Kurdish Folkloric Lyrics Corpus

This repository contain the data of the paper entitled "A Corpus of the Sorani Kurdish Folkloric Lyrics".

The corpus is available in TEI format in KurdishLyricsCorpus.xml. For the sake of convenience, the corpus is also provided in JSON in KurdishLyricsCorpus.json. A live version of the corpus can be consulted at https://kurdishblark.github.io/KurdishLyricsCorpus/ with basic query functions.

If you’re using any part of this corpus, please don’t forget to cite our paper:

@inproceedings{ahmadi2020folklyrics,
  title={A Corpus of the Sorani Kurdish Folkloric Lyrics},
  author={Ahmadi, Sina and Hassani, Hossein and Abedi, Kamaladdin},
  booktitle={Proceedings of the 1st Joint Spoken Language Technologies for Under-resourced languages ({SLTU}) and Collaboration and Computing for Under-Resourced Languages (CCURL) Workshop at the 12th International Conference on Language Resources and Evaluation (LREC)},
  date="2020-05-11",
  year={2020},
  address= "Marseille, France"
}