introduce encoding_is_utf8() internal #2781
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
apparently _nl_normalize_charset() of glibc removes everything but alnums, and converts to lowercase.
replace strcmp() against "UTF-8" with encoding_is_utf8(), which matches this behavior, and supports the optional "cs" ("character set") prefix.
Closes #2780, reported by @barracuda156.
See https://github.com/bminor/glibc/blob/5f72f9800b250410cad3abfeeb09469ef12b2438/intl/l10nflist.c#L330-L334 and https://serverfault.com/questions/616744/is-there-a-difference-between-en-us-utf8-and-en-us-utf-8