Avoid error handling duplication for starred, yield, lambda expressions #10809

dhruvmanila · 2024-04-07T02:23:46Z

Summary

This PR updates the error handling logic for certain expressions in a way to either perform it automatically or provide an option for the user. The expression in discussion here are lambda, starred and yield expression.

Problem

The current parser allows these expressions at arbitrary context. This is because the mentioned expressions are parsed using parse_lhs_expression which is part of other higher level grammar rules. This means that the caller needs to validate the parsed expression and report an error if it isn't allowed in that context. This can get quite cumbersome to do so as it needs to be done for all of the call sites for following methods:

parse_expression_list: 14 references
parse_star_expression_list: 4 references
parse_star_expression_or_higher: 8 references
parse_named_expression_or_higher: 10 references
parse_conditional_expression_or_higher: 25 references
parse_simple_expression: 4 references

The numbers corresponding to the methods are the number of references as of today. This list is also in the correct hierarchy of grammar precedence. For example, parse_expression_list calls into parse_conditional_expression_or_higher but not the other way around.

Solution

We'll take the above expression one at a time to understand the solution:

Lambda expression

Lambda expressions are only allowed in expression grammar rule which corresponds to parse_conditional_expression_or_higher. This means that this expression is only allowed when using either of the following functions:

parse_expression_list
parse_star_expression_list
parse_star_expression_or_higher
parse_named_expression_or_higher
parse_conditional_expression_or_higher

The solution is to move the error handling in parse_simple_expression and parameterize it where any of the above listed function would always use AllowLambdaExpression::Yes.

Starred expression

There are two grammar rules related to starred expression:

star_expression which corresponds to parse_star_expression_or_higher
starred_expression which is parsed in LHS parsing

Remember that LHS parsing isn't accessed directly but only via any of the above listed functions in the problem section. Now, starred expressions are allowed in a lot of places but sometimes in a limited capacity. For example, an assignment target can have a starred expression but only if it is a name node (*x).

The solution here is to adopt the one used in star pattern matching which is to use a parameter. The following functions are parameterized:

parse_expression_list
parse_named_expression_or_higher
parse_conditional_expression_or_higher

Now, parse_star_expression_list and parse_star_expression_or_higher aren't parameterized because they handle the star_expression grammar which means that the caller wants to parse a starred expression but with a limited precedence.

Yield expression

Yield expressions are only allowed in the following context:

Top level as yield statement
Parenthesized
F-string expression
Assignment (including annotated and augmented) value

We could parameterize it similar to starred expression but that seems like a waste given the limited number of locations they're allowed.

The solution is to add a parse_yield_expression_or_else method which parses a yield expression if the parser is at yield token or else calls the given method to parse the expression. The call site would like:

// (yield_expr | named_expression)
self.try_parse_yield_expression()
  .unwrap_or_else(|| self.parse_named_expression_or_higher())

// (yield_expr | star_expressions)
self.try_parse_yield_expression()
  .unwrap_or_else(|| self.parse_star_expression_list())

An added benefit for this is that the call site looks exactly like the grammar.

Review

The reviewer would mainly just look at the de-duplication logic.
The reviewer doesn't really need to verify the call sites as they're verified by existing test cases. For nodes which aren't yet tested, they will be done so in their own PR.

Test Plan

Run existing test cases and verify the snapshot updates.

Additional test cases will be added when working on specific nodes.

github-actions · 2024-04-07T04:13:39Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Formatter (stable)

✅ ecosystem check detected no format changes.