Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RFC libicu Unicode support #722

Closed
wants to merge 2 commits into from
Closed

Conversation

rolandwalker
Copy link
Contributor

As discussed in #679, add configure --with-icu to compile with support for libicu, and as an example, change contains_uppercase() to Unicode-aware string_contains_uppercase().

The pros of libicu are that it is deep, complete, well-tested, and up-to-date.

The cons are that it contains multiple overlapping/conflicting APIs, and that UTF8 is generally treated as second-class compared to UTF16.

In the example function we are able to use one of the UTF8-specific APIs cleanly without recoding.

Linking with libpcre might also be required to get utf8-aware search. libicu knows about regular expressions but only exposes an interface for C++ and Java.

@rolandwalker rolandwalker force-pushed the libicu4c branch 6 times, most recently from b8685fc to 6855d4c Compare August 16, 2017 12:51
jonas added a commit that referenced this pull request May 21, 2018
jonas added a commit that referenced this pull request May 21, 2018
@jonas jonas closed this in #827 May 21, 2018
jonas added a commit that referenced this pull request May 21, 2018
@rolandwalker rolandwalker deleted the libicu4c branch May 21, 2018 13:54
rolandwalker pushed a commit to rolandwalker/tig that referenced this pull request May 21, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant