Call chain formatting in fluent style #6151

konstin · 2023-07-28T13:19:20Z

Implement fluent style/call chains. See the call_chains.py formatting for examples.

This isn't fully like black because in raise A from B they allow A breaking can influence the formatting of B even if it is already multiline.

Similarity index:

project	main	PR
build	???	0.753
django	0.991	0.998
transformers	0.993	0.994
typeshed	0.723	0.723
warehouse	0.978	0.994
zulip	0.992	0.994

Call chain formatting is affected by #627, but i'm cutting scope here.

Closes #5343

Test Plan:

Added a dedicated call chains test file
The ecosystem checks found some bugs
I manually check django and zulip formatting

konstin · 2023-07-28T13:19:33Z

Current dependencies on/for this PR:

main
- PR Call chain formatting in fluent style #6151 👈

This comment was auto-generated by Graphite.

github-actions · 2023-07-28T13:51:43Z

PR Check Results

Benchmark

Linux

group                                      main                                   pr
-----                                      ----                                   --
formatter/large/dataset.py                 1.00      9.3±0.02ms     4.4 MB/sec    1.11     10.3±0.09ms     3.9 MB/sec
formatter/numpy/ctypeslib.py               1.00   1885.3±8.59µs     8.8 MB/sec    1.06      2.0±0.01ms     8.3 MB/sec
formatter/numpy/globals.py                 1.02   231.6±14.37µs    12.7 MB/sec    1.00    226.2±7.00µs    13.0 MB/sec
formatter/pydantic/types.py                1.00      4.0±0.03ms     6.3 MB/sec    1.06      4.3±0.04ms     6.0 MB/sec
linter/all-rules/large/dataset.py          1.00     13.1±0.10ms     3.1 MB/sec    1.02     13.3±0.14ms     3.1 MB/sec
linter/all-rules/numpy/ctypeslib.py        1.00      3.3±0.02ms     5.0 MB/sec    1.01      3.4±0.01ms     5.0 MB/sec
linter/all-rules/numpy/globals.py          1.00    458.6±0.76µs     6.4 MB/sec    1.00    457.8±1.70µs     6.4 MB/sec
linter/all-rules/pydantic/types.py         1.00      6.0±0.05ms     4.3 MB/sec    1.01      6.0±0.02ms     4.2 MB/sec
linter/default-rules/large/dataset.py      1.00      6.5±0.24ms     6.3 MB/sec    1.05      6.8±0.04ms     6.0 MB/sec
linter/default-rules/numpy/ctypeslib.py    1.00   1356.7±7.79µs    12.3 MB/sec    1.05   1428.3±6.01µs    11.7 MB/sec
linter/default-rules/numpy/globals.py      1.01    159.5±7.98µs    18.5 MB/sec    1.00    157.3±0.61µs    18.8 MB/sec
linter/default-rules/pydantic/types.py     1.00      2.8±0.02ms     9.0 MB/sec    1.05      3.0±0.03ms     8.5 MB/sec

Windows

group                                      main                                   pr
-----                                      ----                                   --
formatter/large/dataset.py                 1.00     10.4±0.13ms     3.9 MB/sec    1.06     11.1±0.11ms     3.7 MB/sec
formatter/numpy/ctypeslib.py               1.00  1974.8±38.30µs     8.4 MB/sec    1.04      2.0±0.04ms     8.1 MB/sec
formatter/numpy/globals.py                 1.00   222.4±10.07µs    13.3 MB/sec    1.02    225.9±8.00µs    13.1 MB/sec
formatter/pydantic/types.py                1.00      4.3±0.07ms     5.9 MB/sec    1.03      4.5±0.06ms     5.7 MB/sec
linter/all-rules/large/dataset.py          1.00     14.7±0.16ms     2.8 MB/sec    1.01     14.8±0.16ms     2.8 MB/sec
linter/all-rules/numpy/ctypeslib.py        1.00      3.8±0.06ms     4.4 MB/sec    1.03      3.9±0.06ms     4.3 MB/sec
linter/all-rules/numpy/globals.py          1.00    452.9±8.58µs     6.5 MB/sec    1.02    462.0±6.85µs     6.4 MB/sec
linter/all-rules/pydantic/types.py         1.00      6.7±0.09ms     3.8 MB/sec    1.01      6.8±0.10ms     3.8 MB/sec
linter/default-rules/large/dataset.py      1.00      7.3±0.11ms     5.6 MB/sec    1.08      7.9±0.15ms     5.2 MB/sec
linter/default-rules/numpy/ctypeslib.py    1.00  1470.6±26.20µs    11.3 MB/sec    1.07  1568.9±21.62µs    10.6 MB/sec
linter/default-rules/numpy/globals.py      1.00    165.7±3.23µs    17.8 MB/sec    1.08    178.2±5.09µs    16.6 MB/sec
linter/default-rules/pydantic/types.py     1.00      3.2±0.08ms     7.9 MB/sec    1.07      3.4±0.08ms     7.4 MB/sec

konstin · 2023-07-31T07:54:17Z

Blocked on unstable formatting of

y = (
    x.a()  #
    .b()
)

y = x.a().b()  #

y = (
    x.aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa()  #
    .b()
)

y = x.aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa().b()  #

MichaReiser

Woah, nice improvement on the compatibility!

The fluent style formatting now requires to route through the fluent style in many positions (which I like more than setting in on context).

Have you considered to, instead "unroll" the call chain in the CallExpression formatting? Meaning, we would have a single formatting that owns the whole call chain formatting without calling into format attribute and format subscript (maybe parts of it). I'm asking because I find it difficult to "unroll" the recursion in my head and wonder if it would be easier if the whole call chain formatting would be in its own file.

MichaReiser · 2023-08-03T08:54:32Z

crates/ruff_python_formatter/src/expression/parentheses.rs

+
+ /// Switch call chain and attribute formatting to fluent style. This is otherwise identical to
+ /// `Never`, fluent style implies a set of outer parentheses
+ FluentStyle { outermost: bool },


FluentStyle doesn't fit well into the Parentheses concept, which is intended to be generally applicable to all expressions.

konstin · 2023-08-03T12:14:54Z

Good point, i made fluent style a bool that gets passed through the call chain formatting. It's still recursive but i think that's better than unrolling

konstin · 2023-08-03T13:01:10Z

This took some rotations but now it's just a is_fluent_style_call_chain function we call when formatting an expression. There is still a case we miss (not a().b().c()), but i'm happy how it looks now.

MichaReiser · 2023-08-03T13:04:22Z

Good point, i made fluent style a bool that gets passed through the call chain formatting. It's still recursive but i think that's better than unrolling

Could you explain your reasoning of why?

konstin · 2023-08-03T13:29:59Z

Good point, i made fluent style a bool that gets passed through the call chain formatting. It's still recursive but i think that's better than unrolling

Could you explain your reasoning of why?

I think consistency, mainly: Every other expression is formatted from outermost to innermost, so this is no difference. The main thing we do differently is to put a newline between the closing parentheses and the dot.