Relative Date Time Format fails with locale en-US and non-Latn numbering system #5204

sven-oly · 2024-07-09T01:09:36Z

In ICU conformance testing with this data, the expected result has Arabic digits. This appears to be a problem in inheritance for en-US

{"test_type": "rdt_fmt", "label":"704","unit":"second","count":"-100","locale":"en-US","options":{"numberingSystem":"arab"}}

Actual result: {"actual_options":"DataLocale{en-US-u-nu-arab}","label":"704","result":"100 seconds ago"}

Expected "١٠٠ seconds ago"

Similar with Bengali digits:
{"test_type": "rdt_fmt", "label":"1056","unit":"second","count":"-100","locale":"en-US","options":{"numberingSystem":"beng"}}

Actual result: {"actual_options":"DataLocale{en-US-u-nu-beng}","label":"1056","result":"100 seconds ago"}

Expected: "১০০ seconds ago"

Similar with Adlam digits:
{"test_type": "rdt_fmt", "label":"1408","unit":"second","count":"-100","locale":"en-US","options":{"numberingSystem":"adlm"}}

Actual result: {"actual_options":"DataLocale{en-US-u-nu-adlm}","label":"1408","result":"100 seconds ago"}

Expected: "𞥑𞥐𞥐 seconds ago"

sffc · 2024-07-09T03:34:39Z

I think it's FixedDecimalFormatter in general, not just RelativeTimeFormatter.

-u-nu-arab only loads data for locales that have a -u-nu-arab. However, not all locales have it, and datagen only exports it when it is in the CLDR data for that locale.

It does raise the question, do we really want/need to support en-u-nu-arab if CLDR doesn't provide data for it? ICU and others support arbitrary numbering systems with Latin symbols. But does that make sense?

sven-oly · 2024-07-09T23:46:13Z

I agree that we should verify if using any numbering system with any locale is useful. The tests show that Arabic, Adlam, and Bengali digits do appear with conforming locales.

sffc added T-bug Type: Bad behavior, security, privacy C-numbers Component: Numbers, units, currencies labels Jul 9, 2024

sffc added the discuss Discuss at a future ICU4X-SC meeting label Jul 9, 2024

sffc added this to the 2.x Priority ⟨P2⟩ milestone Jul 23, 2024

Manishearth added the U-ecma402 User: ECMA-402 compatibility label Nov 20, 2024

Manishearth self-assigned this Nov 20, 2024

Manishearth mentioned this issue Dec 18, 2024

Allow overriding the numbering system during decimal format #5914

Merged

sffc mentioned this issue Dec 18, 2024

Initial check-in of ICU4X 2.0-Beta1 unicode-org/conformance#363

Merged

Manishearth closed this as completed in #5914 Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relative Date Time Format fails with locale en-US and non-Latn numbering system #5204

Relative Date Time Format fails with locale en-US and non-Latn numbering system #5204

sven-oly commented Jul 9, 2024

sffc commented Jul 9, 2024

sven-oly commented Jul 9, 2024

Relative Date Time Format fails with locale en-US and non-Latn numbering system #5204

Relative Date Time Format fails with locale en-US and non-Latn numbering system #5204

Comments

sven-oly commented Jul 9, 2024

sffc commented Jul 9, 2024

sven-oly commented Jul 9, 2024