Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update .NET 8 Unicode data to version 15.0.0 #75711

Closed
Tracked by #78518
GrabYourPitchforks opened this issue Sep 15, 2022 · 1 comment · Fixed by #80804
Closed
Tracked by #78518

Update .NET 8 Unicode data to version 15.0.0 #75711

GrabYourPitchforks opened this issue Sep 15, 2022 · 1 comment · Fixed by #80804
Labels
area-System.Globalization enhancement Product code improvement that does NOT require public API changes/additions
Milestone

Comments

@GrabYourPitchforks
Copy link
Member

GrabYourPitchforks commented Sep 15, 2022

The Unicode Standard version 15.0.0 was released this week. As per usual, since the .NET runtime carries a copy of Unicode-derived data, we should update our data files to match this version.

This will affect the following APIs:

  • System.Globalization.StringInfo
  • System.Globalization.CharUnicodeInfo
  • System.Text.Encodings.Web.*
  • System.Text.Json.* (since it depends on System.Text.Encodings.Web)

For instructions on how to update the runtime-carried Unicode data files, consult the GenUnicodeProp docs and the STEW docs. Also update the UnicodeUcdVersion data throughout our .csproj files (see samples).

See #44423 for the changes we made to support Unicode 14.0.0 in .NET 7.

There are no interesting changes to UAX #29 for this version of Unicode. Our tools will automatically pick up any changes to a code point's Grapheme_Cluster_Break property, but there should be no changes required to the core logic within TextSegmentationUtility.cs.

@GrabYourPitchforks GrabYourPitchforks added enhancement Product code improvement that does NOT require public API changes/additions area-System.Globalization labels Sep 15, 2022
@GrabYourPitchforks GrabYourPitchforks added this to the 8.0.0 milestone Sep 15, 2022
@ghost
Copy link

ghost commented Sep 15, 2022

Tagging subscribers to this area: @dotnet/area-system-globalization
See info in area-owners.md if you want to be subscribed.

Issue Details

The Unicode Standard version 15.0.0 was released this week. As per usual, since the .NET runtime carries a copy of Unicode-derived data, we should update our data files to match version 14.0.0 when it's released.

This will affect the following APIs:

  • System.Globalization.StringInfo
  • System.Globalization.CharUnicodeInfo
  • System.Text.Encodings.Web.*
  • System.Text.Json.* (since it depends on System.Text.Encodings.Web)

For instructions on how to update the runtime-carried Unicode data files, consult the GenUnicodeProp docs and the STEW docs. Also update the UnicodeUcdVersion data throughout our .csproj files (see samples).

See #44423 for the changes we made to support Unicode 14.0.0 in .NET 7.

There are no interesting changes to UAX #29 for this version of Unicode. Our tools will automatically pick up any changes to a code point's Grapheme_Cluster_Break property, but there should be no changes required to the core logic within TextSegmentationUtility.cs.

Author: GrabYourPitchforks
Assignees: -
Labels:

enhancement, area-System.Globalization

Milestone: 8.0.0

@ghost ghost added the in-pr There is an active PR which will close this issue when it is merged label Jan 18, 2023
@ghost ghost removed the in-pr There is an active PR which will close this issue when it is merged label Jan 20, 2023
@ghost ghost locked as resolved and limited conversation to collaborators Feb 19, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
area-System.Globalization enhancement Product code improvement that does NOT require public API changes/additions
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant