fix: Add "iw" variant of Hebrew language code #7752

joeyparrish · 2024-12-11T20:48:28Z

We already mapped Hebrew's "heb" to "he", but not to the alternative 2-letter code "iw".

Also reformats the map for cleaner future diffs.

joeyparrish · 2024-12-11T20:49:30Z

See also shaka-project/shaka-packager#1457 and shaka-project/shaka-packager#1458

This started as a sync against Packager, where in C++, double-quotes are required.

shaka-bot · 2024-12-11T22:08:38Z

Incremental code coverage: 100.00%

tykus160 · 2024-12-12T08:29:30Z

lib/util/language_utils.js

+  ['heb', 'he'],
+  ['heb', 'iw'],


It's a map after all, so only heb -> iw mapping will be used. Is that what we want?

Nope! Not at all. Good catch.

I'll restructure this.

Oh, wait a sec, I see the root of my confusion now.

In Shaka Player, we only ever map from 3-letter codes to 2-letter codes, to canonicalize inputs for comparison. So this extra mapping is not needed.

In Shaka Packager, we initially copied the map from Player. But Packager uses it in both directions, because to create an mp4, you need the 3-letter code in all cases. So one input with "he" and another with "iw" both need to map to "heb" when Packager creates the MP4. But then when Packager creates the MPD, it needs to follow BCP-47 and map down to the shortest possible tag.

There's an edge case here for the player, where one stream says "he" and another says "iw", and the player would currently treat them as different. This could also happen for other languages with multiple 2-letter codes. However, only one 2-letter code is canonical. In this case, "he" is the official ISO-639-2 code for Hebrew, whereas "iw" was used before standardization and still appears in some places. So I would argue that we could ignore this edge case, and leave it to packaging software to create DASH manifests with canonical language codes.

With that, I'm closing this PR.

fix: Add "iw" variant of Hebrew language code

6793207

Also reformats the map for cleaner future diffs.

joeyparrish requested a review from avelad December 11, 2024 20:48

Actual addition, in separate commit, for clarity

9d38ab3

fix lint

88fa192

This started as a sync against Packager, where in C++, double-quotes are required.

tykus160 requested changes Dec 12, 2024

View reviewed changes

joeyparrish closed this Dec 12, 2024

joeyparrish deleted the hebrew-language-code branch December 17, 2024 23:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Add "iw" variant of Hebrew language code #7752

fix: Add "iw" variant of Hebrew language code #7752

joeyparrish commented Dec 11, 2024 •

edited

Loading

joeyparrish commented Dec 11, 2024

shaka-bot commented Dec 11, 2024

tykus160 Dec 12, 2024

joeyparrish Dec 12, 2024

joeyparrish Dec 12, 2024

fix: Add "iw" variant of Hebrew language code #7752

fix: Add "iw" variant of Hebrew language code #7752

Conversation

joeyparrish commented Dec 11, 2024 • edited Loading

joeyparrish commented Dec 11, 2024

shaka-bot commented Dec 11, 2024

tykus160 Dec 12, 2024

Choose a reason for hiding this comment

joeyparrish Dec 12, 2024

Choose a reason for hiding this comment

joeyparrish Dec 12, 2024

Choose a reason for hiding this comment

joeyparrish commented Dec 11, 2024 •

edited

Loading