Skip to content

cl-tohoku/J-UniMorph

Repository files navigation

J-UniMorph

Dataset of UniMorph in Japanese. The file jpn is the data we created for UniMorph, and jpn_with_hits.txt is the data before filtering with the number of hits for exact match search.

これは,UniMorphの日本語版データセットです. ファイルjpnはUniMorph用に作成したデータであり,jpn_with_hits.txtは完全一致検索のヒット数でフィルタリングする前のデータです.

Citation

We have published a description paper on arXiv.

arXivに説明論文を投稿しています.

@article{matsuzaki2024junimorph,
      title={J-UniMorph: Japanese Morphological Annotation through the Universal Feature Schema}, 
      author={Kosuke Matsuzaki and Masaya Taniguchi and Kentaro Inui and Keisuke Sakaguchi},
      year={2024},
      eprint={2402.14411},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

J-UniMorph Visualizer

We developed J-UniMorph Visualizer, which takes an inflected form as the input and provides the UniMorph labels of its form. This makes manual analysis of J-UniMorph easier. We hope it will be useful for Japanese learners. You can use it from the following link.

https://matsukosuke.github.io/inflection_tool/inflection.html?lang=en

意味分析・語形変換ツールを作成しました.これは,日本語のさまざまな語形の意味を調べられるツールです. 以下のリンクから利用できます. 日本語学習者にとって有益なものであることを期待しております.

https://matsukosuke.github.io/inflection_tool/inflection.html

Copyright and License

This dataset is published under the CC BY 4.0.

このデータセットは CC BY 4.0 の下で公開されています.

Contact

E-mail: matsuzaki.kosuke.r7 (at) dc.tohoku.ac.jp
※Replace the (at) above with "@". / 上記の (at) を「@」に置き換えてください.

(c) 2024 Kosuke Matsuzaki All Rights Reserved.

About

Dataset of UniMorph in Japanese

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published