[FEEDBACK] bidi clarifications #958

macchiati · 2024-11-22T23:41:03Z

We had a discussion around the implications of bidi isolation, in formatting.md. I'm capturing some items here for post 46.1

General issue. While it is clear that Default Bidi Strategy is the correct strategy in the general case, the algorithm seems to preclude various optimizations. It should make it clear that an implementation is conformant to the DBS if the bidi ordering it produces for any formatted pattern is identical to the bidi ordering produced for that pattern by the DBS.
It should also make clear than an implementation may generate equivalent results for other environments. Eg, when generating HTML, it could produce bidi markdown instead of the Unicode bidi control characters. This might be covered by #formatting, but best to have it specifically called out.
Details
1. In #handling-bidirectional-text we have "Let msgdir be the directionality of the whole message," but it is not defined.
2. In #formatting-context we have "Information on the base directionality of the message and its text tokens. This will be used by strategies for bidirectional isolation, and can be used to set the base direction of the message upon display." That sounds like it is supposed to define msgdir — or at least the connection between them needs to be clear.
3. "Let fmt be the formatted string representation of the resolved value of exp." should be "Let fmt be the formatted string representation of exp."

A related issue: in #formatting it should make it clear that callers of implementations cannot rely on the literal text in a pattern being preserved in the formatted pattern. That is, an implementation could change the literal text, such as improving the result of {{You have an {$item} in your basket.}} based on the value of $item, eg "You have an apple in your basket." vs "You have a pear in your basket.".

eemeli · 2024-11-25T09:37:53Z

It's quite intentional that the default bidi strategy does not allow for optimizations, as it's meant to produce the same output in different implementations. This enables e.g. rehydration to work well by having server and client code produce the exact same output. We do allow for other strategies or variants to be provided, which may perform any such optimizations:

message-format-wg/spec/formatting.md

Lines 924 to 927 in 849db9c

    
           Implementations MUST provide the _Default Bidi Strategy_ as one of the  
        
           _bidirectional isolation strategies_. 
        
           Implementations MAY provide other _bidirectional isolation strategies_.

Note that we're not defining any explicit HTML or other non-string formatting output in the spec. We got somewhat close to defining a formatted-parts output, but ultimately decided not to define it here (it is defined in the JS spec, though). Therefore, to enable properly isolated HTML to be produced from MF2, we have at least this:

message-format-wg/spec/formatting.md

Lines 894 to 896 in 849db9c

    
           If an implementation supports formatting to something other than a string 
        
           (such as a sequence of parts), 
        
           the directionality of each formatted _placeholder_ needs to be available to the caller.

Agreed, some of these references should be clarified a bit.

Re: changing literal text, that sounds like something that ought to be done as post-processing to the MF2 output. After this was discussed earlier, I ended up implementing a PoC hackyFixArticles function in the JS messageformat test suite that applies this correction, to show how it could be done with formatted parts.

macchiati · 2024-11-25T17:32:01Z

It's quite intentional that the default bidi strategy does not allow for optimizations, as it's meant to produce the same output in different implementations.

But there is no guarantee that two different implementations will produce the same result for almost any placeholder with a function. So bidi "compatibility" would not at all guarantee "the same output in different implementations". So that forces implementations to have an option to produce the 'heavy' version of bidi control insertion, even if what most clients will want is the 'light' version (which produces the same results).

As for the HTML and literal text, the main point is that an implementation's MF2 APIs should be able to have options for those. So we need to make sure that the spec doesn't exclude that.

eemeli · 2024-11-26T09:17:23Z

We're not looking to guarantee the same results, but to enable them. If there's a way for a user to get the exact same bidi isolation with two different implementations and at least one of them allows for its function handlers to be user-customizable, it becomes possible to have the same function handler behaviour in both implementations, and for the outputs to match.

Also, we do include this directive:

message-format-wg/spec/formatting.md

Lines 828 to 829 in ec9089d

    
           Implementations SHOULD encourage users to consider a formatted localised string 
        
           as an opaque data structure, suitable only for presentation.

Following that, it should not matter if the output includes more isolation than strictly necessary.

As for the HTML and literal text, the main point is that an implementation's MF2 APIs should be able to have options for those. So we need to make sure that the spec doesn't exclude that.

The spec does not exclude those possibilities. We explicitly call out potential support for not only HTML syntax, but also DOM fragments, and we do not establish any upper bound for what the formatted output might look like or what transforms could be applied to it.

macchiati added the Preview-Feedback Feedback gathered during the technical preview label Nov 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEEDBACK] bidi clarifications #958

[FEEDBACK] bidi clarifications #958

macchiati commented Nov 22, 2024 •

edited

Loading

eemeli commented Nov 25, 2024

macchiati commented Nov 25, 2024

eemeli commented Nov 26, 2024

[FEEDBACK] bidi clarifications #958

[FEEDBACK] bidi clarifications #958

Comments

macchiati commented Nov 22, 2024 • edited Loading

eemeli commented Nov 25, 2024

macchiati commented Nov 25, 2024

eemeli commented Nov 26, 2024

macchiati commented Nov 22, 2024 •

edited

Loading