Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

基于 W3C 中文排版需求进一步细分标点符号类型 #129

Closed
Jinjiang opened this issue Dec 17, 2023 · 0 comments · Fixed by #131
Closed

基于 W3C 中文排版需求进一步细分标点符号类型 #129

Jinjiang opened this issue Dec 17, 2023 · 0 comments · Fixed by #131
Milestone

Comments

@Jinjiang
Copy link
Collaborator

Jinjiang commented Dec 17, 2023

标点符号可以进一步细分为:

  1. 点号 (逗号、句号、顿号、冒号、分号、叹号、问号):只有这些符号匹配 noSpaceBeforePunctuation / spaceAfterHalfWidthPunctuation / noSpaceAfterFullWidthPunctuation 规则
    • 需考虑叹号问号叠加的情况,比如“!!!”、“???”、“!?”等;也包括单一 unicode 的情况,如:“⁇”
  2. 标号
    1. 引号、括号:分别匹配 spaceOutsideHalfQuote / noSpaceOutsideFullQuote / noSpaceInsideQuote 和 spaceOutsideHalfBracket / noSpaceOutsideFullBracket / noSpaceInsideBracket
      • 引号:包含单/双弯引号和单/双直角引号
      • 括号:包含甲式括号 (弯括号)、方头括号、空心方头括号、六角括号、花括号;乙式括号技术上可作为破折号处理
    2. 其他标号:破折号、省略号/删节号、着重号、连接号、间隔号、书名号、专名号、分隔号。这些符号:
      • 暂不做任何特殊处理,可作进一步讨论,比如是否去掉两侧空格或书名号是否做类似引号的处理等;
      • 有些符号并不产生实际的字符,而可能是以某种样式的方式存在于内容中,比如着重号是横排文本下方的圆形中黑点,这类符号的用法和格式待后续讨论
      • 额外补充一个非典型标点符号:示亡号;同样不产生实际的字符,待后续讨论
@Jinjiang Jinjiang linked a pull request Dec 24, 2023 that will close this issue
@Jinjiang Jinjiang mentioned this issue Jan 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant