-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add nori_number token filter in analysis-nori and discard_punctuation to filter #4591
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Left a comment about filters
/// For example: "nori_number" - token filter normalizes Korean numbers to regular Arabic decimal numbers in half-width characters. | ||
/// </summary> | ||
[DataMember(Name = "filter")] | ||
IEnumerable<string> Filters { get; set; } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this should be removed. The token filters are applied at the analyzer
level (level above tokenizer
)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
++ thanks, please check 0cb682b
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 👍
Add discard_punctuation to nori token filter
Add discard_punctuation to nori token filter
Implements elastic/elasticsearch/pull/53583