Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conversion from duffus to tlo does not handle diacritics correctly #9

Open
2 of 5 tasks
kbseah opened this issue Apr 27, 2024 · 0 comments
Open
2 of 5 tasks
Labels
bug Something isn't working

Comments

@kbseah
Copy link
Member

kbseah commented Apr 27, 2024

parsetc version
parsetc 0.4.1
lark 1.1.9
unicodedata unidata_version 13.0.0

How did you install parsetc?

  • Using pip install from PyPI
  • Using pip install . from cloned repository, if so, which branch and commit? master 75b10b4
  • Other: please describe

Describe the error
cat examples/teochew.duffus.txt | parsetc -l Teochew -i duffus --all

  • parsetc command crashed with an error:
  • Output romanization is wrong
tlo	lâi-sî pêh-be2 kua3 kim-uann1
duffus	lâi-sî pėh-be2 kua3 kim-uaⁿ1

Input text

lai5-si5 peh8-be2 kua3 kim1-uaⁿ1

Expected output

lâi-sî pêh-bé kùa kim-uann

Additional information

@kbseah kbseah added the bug Something isn't working label Apr 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant