Skip to content

Latest commit

 

History

History
107 lines (93 loc) · 3.41 KB

README.md

File metadata and controls

107 lines (93 loc) · 3.41 KB

syc

31,972 lexemes in Classical Syriac and their inflectional forms annotated according to Sylak-Glassman (2016)

License

https://creativecommons.org/licenses/by-sa/3.0/

Caveats

  • Includes 2,740 unvocalised entries from 2019 UniMorph dataset, plus an additional 29,232 vocalised entries
  • Contractions (e.g., 'can not' > 'can't') excluded
  • Alienable (ALN) and inalienable (NALN) possession not marked
  • Clitics included as independent particles. (En/Pro)clitics not marked
  • Homomorphs/homonyms included
  • Syriac adjective participles marked as Verbal Participles (V.PTCP) following Sylak-Glassman (2016)
  • Syriac denominatives excluded (UniMorph unclassified)
  • Words may contain unmarked prefixes (e.g., prepositional b-) and particles (e.g., relative marker d-)
  • Historical linguistic developments (e.g., redundancy of DEF, fossilisation of PSSD, etc) disregarded
  • All entries unique

Author

Charbel El-Khaissi (Australian National University)

Acknowledgements

Special thank-you extended to Dr George Kiraz and Beth Mardutho: The Syriac Institute for making the raw data from SEDRA database available.

Shared Tasks

2021: SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages

References


@inproceedings{pimentel-ryskina-etal-2021-sigmorphon,
    title = "SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages",
    author = "Pimentel, Tiago  and
      Ryskina, Maria  and
      Mielke, Sabrina J.  and
      Wu, Shijie  and
      Chodroff, Eleanor  and
      Leonard, Brian  and
      Nicolai, Garrett  and
      Ghanggo Ate, Yustinus  and
      Khalifa, Salam  and
      Habash, Nizar  and
      El-Khaissi, Charbel  and
      Goldman, Omer  and
      Gasser, Michael  and
      Lane, William  and
      Coler, Matt  and
      Oncevay, Arturo  and
      Montoya Samame, Jaime Rafael  and
      Silva Villegas, Gema Celeste  and
      Ek, Adam  and
      Bernardy, Jean-Philippe  and
      Shcherbakov, Andrey  and
      Bayyr-ool, Aziyana  and
      Sheifer, Karina  and
      Ganieva, Sofya  and
      Plugaryov, Matvey  and
      Klyachko, Elena  and
      Salehi, Ali  and
      Krizhanovsky, Andrew  and
      Krizhanovsky, Natalia  and
      Vania, Clara  and
      Ivanova, Sardana  and
      Salchak, Aelita  and
      Straughn, Christopher  and
      Liu, Zoey  and
      Washington, Jonathan North  and
      Ataman, Duygu  and
      Kiera{\'s}, Witold  and
      Woli{\'n}ski, Marcin  and
      Suhardijanto, Totok  and
      Stoehr, Niklas  and
      Nuriah, Zahroh  and
      Ratan, Shyam  and
      Tyers, Francis M.  and
      Ponti, Edoardo M.  and
      Aiton, Grant  and
      Hatcher, Richard J.  and
      Prud'hommeaux, Emily  and
      Kumar, Ritesh  and
      Hulden, Mans  and
      Barta, Botond  and
      Lakatos, Dorina  and
      Szolnok, G{\'a}bor  and
      {\'A}cs, Judit  and
      Raj, Mohit  and
      Yarowsky, David  and
      Cotterell, Ryan  and
      Ambridge, Ben  and
      Vylomova, Ekaterina",
    booktitle = "Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology",
    month = aug,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.sigmorphon-1.25",
    doi = "10.18653/v1/2021.sigmorphon-1.25",
    pages = "229--259"
}