-
Notifications
You must be signed in to change notification settings - Fork 25.6k
Correct rebuilding of Persian analyzer #38724
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Correct rebuilding of Persian analyzer #38724
Conversation
Make substitution of \u200C with a space explicit Closes elastic#38188
cbuescher
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, LGTM
cbuescher
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actually, looking at #38188 (comment) which was just added it seems other analyzers in this documentation page might have similar issues. Can you check if that failure is related and maybe fix it in the same PR?
cbuescher
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry for the noise, just doule-checked and its the same error on 7.x. I missread the line-numbers and thought they are different.
|
@cbuescher thanks a lot |
Make substitution of \u200C with a space explicit The problem with this symbol `\u200C` in a test string, that **SHOULD** be substituted with space in the rebuilt Persian analyzer, but it is not. Correcting this line `"mappings": [ "\\u200C=> "] <1>` to `"mappings": [ "\\u200C=>\\u0020"] <1>` in solves the problem. This change explicitly says to substitute ZWNJ with a space. Closes #38188
Make substitution of \u200C with a space explicit The problem with this symbol `\u200C` in a test string, that **SHOULD** be substituted with space in the rebuilt Persian analyzer, but it is not. Correcting this line `"mappings": [ "\\u200C=> "] <1>` to `"mappings": [ "\\u200C=>\\u0020"] <1>` in solves the problem. This change explicitly says to substitute ZWNJ with a space. Closes elastic#38188
Make substitution of \u200C with a space explicit The problem with this symbol `\u200C` in a test string, that **SHOULD** be substituted with space in the rebuilt Persian analyzer, but it is not. Correcting this line `"mappings": [ "\\u200C=> "] <1>` to `"mappings": [ "\\u200C=>\\u0020"] <1>` in solves the problem. This change explicitly says to substitute ZWNJ with a space. Closes #38188
Make substitution of \u200C with a space explicit The problem with this symbol `\u200C` in a test string, that **SHOULD** be substituted with space in the rebuilt Persian analyzer, but it is not. Correcting this line `"mappings": [ "\\u200C=> "] <1>` to `"mappings": [ "\\u200C=>\\u0020"] <1>` in solves the problem. This change explicitly says to substitute ZWNJ with a space. Closes elastic#38188
Make substitution of \u200C with a space explicit The problem with this symbol `\u200C` in a test string, that **SHOULD** be substituted with space in the rebuilt Persian analyzer, but it is not. Correcting this line `"mappings": [ "\\u200C=> "] <1>` to `"mappings": [ "\\u200C=>\\u0020"] <1>` in solves the problem. This change explicitly says to substitute ZWNJ with a space. Closes #38188
Make substitution of \u200C with a space explicit
The problem with this symbol
\u200Cin a test string, that SHOULD be substituted with space in the rebuilt Persian analyzer, but it is not.Correcting this line
"mappings": [ "\\u200C=> "] <1>to"mappings": [ "\\u200C=>\\u0020"] <1>inlang-analyzer solves the problem.
This change explicitly says to substitute ZWNJ with a space.
Closes #38188