Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make Semitic mode compatible with inflection operators R, C and D #3

Open
martinec opened this issue Apr 2, 2016 · 0 comments
Open

Comments

@martinec
Copy link
Member

martinec commented Apr 2, 2016

Submitted by @eric-laporte

Feature Request

In the Unitex/GramLab code for inflection of simple words, the Semitic mode (user manual, section 3.5.4) is presently incompatible with R, C and D, which are 3 of the inflection operators described in the manual (section 3.5.1).

Current work with Prihantoro (Diponegoro University) suggests that it would be useful for the description of Indonesian to use both the Semitic mode and the R operator in the same inflection graphs: the Semitic mode for reduplication, and R for morphological variations of the stem. Some Indonesian words undergo both reduplication and morphological variations of the stem.

Example

  • DELAs entry: balik
  • DELAF entry: balak-balik

The C and D operators are likely to be useful in the same conditions.

I suggest the Semitic mode should be made compatible with the R, C and D operators.

It is already compatible with the L, <R=?>, <I=?> and <X=n> operators.

In the attached logs, the dictionary invokes 2 inflection graphs:

  • V232LLak-ref.grf without the Semitic mode,
  • V232LLak.grf in the Semitic mode.

The first graph produces the morphological variants (e.g. balak) but not the reduplication.
The second graph produces the reduplication, but not all the morphological variants work.

Thanks,

Eric

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant