Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Many compound parts with "-ungs" suffix #3

Open
NoZomIBK opened this issue Mar 21, 2019 · 1 comment
Open

Many compound parts with "-ungs" suffix #3

NoZomIBK opened this issue Mar 21, 2019 · 1 comment

Comments

@NoZomIBK
Copy link

i tried your dictionary and found many words with the "ungs" ending do not exist as "ung" in the dictionary (already in the original by Björn Jacke)

the problem:
"Abrechnugsverlauf" will be properly devided into "Abrechnungs","verlauf"
but
"Betragsabrechnung" will not be properly devided, because the dictionary does not contain "abrechnung".
in fact the only word i found is "fälschung/fälschungs" which has both forms.

this problem may apply on other words, too, like "arbeitens" exists, bot no "arbeiten" which would result in a wrong decompounding of "wartungsarbeiten"

@micha-heigl
Copy link

I have the same problem. Example: "Erfahrungsbericht" decompounds well to "erfahrung" and "bericht" thanks to the "erfahrungs" word in the list. But "Berufserfahrung" misses the "erfahrung"-part :(

Even stemming doesn't help as it cannot be applied to the word list but only to the decompounder-results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants