You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[gagolews@zeus tmp]$ LC_ALL="pl_PL.iso-8859-2" R
R Under development (unstable) (2016-04-14 r70486) -- "Unsuffered Consequences"
Copyright (C) 2016 The R Foundation for Statistical Computing
Platform: x86_64-pc-linux-gnu (64-bit)
> library("stringi")
> x <- stri_conv("a\u0105bc", "UTF-8", "")
> library(re2r)
> re2_match("\u0105", x)
[1] FALSE
> re2_match(x, "\u0105")
B��D: invalid UTF-8 in regexp:
> stri_extract_all_regex(x, "\u0105") # this is OK
[[1]]
[1] "�"
consider converting all input strings to utf8, preferably with `stringi::stri_enc_toutf8``
gagolews
changed the title
TO DO: test when LC_COLLATE is not UTF-8
Match failure when LC_COLLATE is not UTF-8
Apr 23, 2016
e.g., Windows does not have a UTF-8 locale set by default
The text was updated successfully, but these errors were encountered: