Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Normalizing Japanese punctuation per issue #2337. #2648

Merged
merged 1 commit into from
Jan 14, 2025
Merged

Conversation

martindholmes
Copy link
Contributor

Not sure who to choose to review this -- I don't see any native Japanese speakers in the list of people -- but it's very basic stuff and I can explain it to anyone.

@martindholmes martindholmes self-assigned this Jan 3, 2025
@ebeshero ebeshero added this to the Guidelines 4.9.0 milestone Jan 11, 2025
Copy link
Member

@sydb sydb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

According to Wikipedia the IDEOGRAPHIC COMMA (U+3001) is used in Japanese where we use COMMA (U+002C) as a separator of items in a list. All of these appear to be that kind of case.

Furthermore, in all 5 cases of //*[@xml:lang eq 'ja'][contains( ., ',')] the U+002C does not seem to be a list item separator.

(That said, I think an xml:lang="en" should be added to #MEASUREMENT-egXML-cw. This need not be part of this PR, of course.)

@sydb sydb merged commit ba43ca5 into dev Jan 14, 2025
3 checks passed
@sydb sydb deleted the issue-2337-ja-punc branch January 14, 2025 20:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants