Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request]: Add Korean morphological analyzer to support Korean for full text search #1228

Closed
1 task done
yingfeng opened this issue May 20, 2024 · 1 comment
Closed
1 task done
Labels
feature request New feature or request

Comments

@yingfeng
Copy link
Member

yingfeng commented May 20, 2024

Is there an existing issue for the same feature request?

  • I have checked the existing issues.

Describe the feature you'd like

nori-clone is a good candidate for KMA to be added.

Describe implementation you've considered

However, it has several drawbacks and need to be refactored:

  1. Remove the dependencies of abseil
  2. Remove the dependencies of protobuf
  3. Remove the dependencies of snappy, and replace the snappy based compressed dictionary to zlib based one, to be kept the same with JMA.
  4. Upload the raw dictionary files to resource repo, which has already contained the dictionary files of Chinese and Japanese.

Documentation, adoption, use case

No response

Additional information

No response

@yingfeng yingfeng added feature request New feature or request good first issue Good for newcomers labels May 20, 2024
@yingfeng yingfeng removed the good first issue Good for newcomers label Sep 11, 2024
@yingfeng
Copy link
Member Author

Mecab can be directly used to support Korea, build dictionary resources according to https://bitbucket.org/eunjeon/mecab-ko-dic

yingfeng added a commit that referenced this issue Sep 11, 2024
### What problem does this PR solve?

Use libmecab embedded within ijma, and the dictionary is generated
according to the instructions of
https://bitbucket.org/eunjeon/mecab-ko-dic

Issue link:#1228

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
@yingfeng yingfeng changed the title [Feature Request]: Add Korea morphological analyzer to support Korea for full text search [Feature Request]: Add Korean morphological analyzer to support Korea for full text search Sep 24, 2024
@yingfeng yingfeng changed the title [Feature Request]: Add Korean morphological analyzer to support Korea for full text search [Feature Request]: Add Korean morphological analyzer to support Korean for full text search Sep 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant