You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
what would you suggest though? is there a standard api for getting the encoding of a string? converting it to utf8? or maybe if it's not in the encoding of the rest of the array, we reject it?
AFAIK there's no API to get the encoding of a string, but that would be a logical complement to codeunit/codeunits. BTW, there's no guaranty that you can call pointer on an AbstractString and get a pointer to the data: one would need to use codeunits anyway even if the encoding matched.
Waiting for a better API, I guess the only solution is to have a fast method for String with StringArray{<:Union{Missing, String}}, and a slower method iterating over characters for other cases.
These two lines don't seem correct to me for non-UTF-8
AbstractString
types:WeakRefStrings.jl/src/WeakRefStrings.jl
Lines 369 to 370 in caf4ed4
Indeed this will copy the contents of the string even if it uses a different encoding from existing data.
The text was updated successfully, but these errors were encountered: