[ty] Better handling of "derived information" in constraint sets #21463

dcreager · 2025-11-14T22:43:41Z

This saga began with a regression in how we handle constraint sets where a typevar is constrained by another typevar, which #21068 first added support for:

def mutually_constrained[T, U]():
    # If [T = U ∧ U ≤ int], then [T ≤ int] must be true as well.
    given_int = ConstraintSet.range(U, T, U) & ConstraintSet.range(Never, U, int)
    static_assert(given_int.implies_subtype_of(T, int))

While working on #21414, I saw a regression in this test, which was strange, since that PR has nothing to do with this logic! The issue is that something in that PR made us instantiate the typevars T and U in a different order, giving them differently ordered salsa IDs. And importantly, we use these salsa IDs to define the variable ordering that is used in our constraint set BDDs. This showed that our "mutually constrained" logic only worked for one of the two possible orderings. (We can — and now do — test this in a brute-force way by copy/pasting the test with both typevar orderings.)

The underlying bug was in our ConstraintSet::simplify_and_domain method. It would correctly detect (U ≤ T ≤ U) ∧ (U ≤ int), because those two constraints affect different typevars, and from that, infer T ≤ int. But it wouldn't detect the equivalent pattern in (T ≤ U ≤ T) ∧ (U ≤ int), since those constraints affect the same typevar. At first I tried adding that as yet more pattern-match logic in the ever-growing simplify_and_domain method. But doing so caused other tests to start failing.

At that point, I realized that simplify_and_domain had gotten to the point where it was trying to do too much, and for conflicting consumers. It was first written as part of our display logic, where the goal is to remove redundant information from a BDD to make its string rendering simpler. But we also started using it to add "derived facts" to a BDD. A derived fact is a constraint that doesn't appear in the BDD directly, but which we can still infer to be true. Our failing test relies on derived facts — being able to infer that T ≤ int even though that particular constraint doesn't appear in the original BDD. Before, simplify_and_domain would trace through all of the constraints in a BDD, figure out the full set of derived facts, and add those derived facts to the BDD structure. This is brittle, because those derived facts are not universally true! In our example, T ≤ int only holds along the BDD paths where both T = U and U ≤ int. Other paths will test the negations of those constraints, and on those, we shouldn't infer T ≤ int. In theory it's possible (and we were trying) to use BDD operators to express that dependency...but that runs afoul of how we were simultaneously trying to remove information to make our displays simpler.

So, I ripped off the band-aid. simplify_and_domain is now only used for display purposes. I have not touched it at all, except to remove some logic that is definitely not used by our Display impl. Otherwise, I did not want to touch that house of cards for now, since the display logic is not load-bearing for any type inference logic.

For all non-display callers, we have a new sequent map data type, which tracks exactly the same derived information. But it does so (a) without trying to remove anything from the BDD, and (b) lazily, without updating the BDD structure.

So the end result is that all of the tests (including the new regressions) pass, via a more efficient (and hopefully better structured/documented) implementation, at the cost of hanging onto a pile of display-related tech debt that we'll want to clean up at some point.

astral-sh-bot · 2025-11-14T23:01:25Z

Diagnostic diff on typing conformance tests

No changes detected when running ty on typing conformance tests ✅

astral-sh-bot · 2025-11-14T23:02:38Z

`mypy_primer` results

No ecosystem changes detected ✅

No memory usage changes detected ✅

sharkdp

Not a full review yet, but I might not have enough time to review the rest today, so sending two initial questions.

crates/ty_python_semantic/src/types/constraints.rs

sharkdp · 2025-11-17T13:12:03Z

crates/ty_python_semantic/src/types/constraints.rs

+/// pruned from the search), and new constraints that we can assume to be true even if we haven't
+/// seen them directly.
+///
+/// We support several kinds of sequent:


Just curious about the terminology here. The special thing about the equivalent term in logic seems to be that there can be multiple "consequents" on the right hand side. But none of the cases listed below seem to have that form?

The representation does allow for multiple consequents (there is a Vec in the single_implications and pair_implications maps). But we never construct one directly, since each sequent that we construct corresponds to an implication where we infer one fact from something else we already know. But when that single-consequent sequent is folded in to the rest of the SequentMap, it might get combined with other sequents that have the same lhs, producing a combined sequent with multiple consequents. (And the consequents of a sequent are ORed, so if there's multiple derived facts that we can infer from the same set of given facts, we can choose which of them we need to use.)

codspeed-hq · 2025-11-17T19:33:28Z

CodSpeed Performance Report

Merging #21463 will not alter performance

_{Comparing dcreager/ordering-bug (a87c3d9) with main (e4a32ba)}

Summary

✅ 22 untouched
⏩ 30 skipped¹

30 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩

sharkdp

Fantastic in-code documentation and writeup. I have the feeling that I understood roughly what is going on and it seems to make a lot of sense to me, but I'll also admit that my understanding is most certainly not deep enough to provide any meaningful input on the overall design. But all of the individual pieces seem consistent to me.

We were previously normalizing the upper and lower bounds of each constraint when constructing constraint sets. Like in #21463, this was for conflated reasons: It made constraint set displays nicer, since we wouldn't render multiple constraints with obviously equivalent bounds. (Think `T ≤ A & B` and `T ≤ B & A`) But it was also useful for correctness, since prior to #21463 we were (trying to) add the full transitive closure to a constraint set's BDD, and normalization gave a useful reduction in the number of nodes in a typical BDD. Now that we don't store the transitive closure explicitly, that second reason is no longer relevant. Our sequent map can store that full transitive closure much more efficiently than the expanded BDD would have. This helps fix some false positives on #20933, where we're seeing some (incorrect, need to be fixed, but ideally not blocking this effort) assignability failures between a type and its normalization. Normalization is still useful for display purposes, and so we do normalize the upper/lower bounds before building up our display representation of a constraint set BDD. --------- Co-authored-by: David Peter <sharkdp@users.noreply.github.com>

dcreager added internal An internal refactor or improvement ty Multi-file analysis & type inference labels Nov 14, 2025

dcreager mentioned this pull request Nov 14, 2025

[ty] Implement constraint implication for compound types #21366

Merged

dcreager marked this pull request as ready for review November 14, 2025 23:26

dcreager requested review from AlexWaygood, carljm and sharkdp as code owners November 14, 2025 23:26

dcreager force-pushed the dcreager/ordering-bug branch from 2cf80e0 to 7733417 Compare November 14, 2025 23:46

sharkdp reviewed Nov 17, 2025

View reviewed changes

Base automatically changed from dcreager/coolable to main November 17, 2025 18:43

dcreager added 18 commits November 17, 2025 13:46

add failing test

f4e46c9

add sequent map

ef08334

sequent paths

f9f0426

use sequent map

cd87da2

don't include unhelpful sequents

fec701e

fix path containment check

206ef25

back to customizable return type

68e1b96

never too

ea0bffb

restructure this a bit

4b7f75a

don't most simplifies

fafb196

transitive closure

55e7ede

use sequents in exists

c3fbe02

remove unused stuff

5159068

clarify comment

e6499e2

🔥 domain

efcef45

add historical note

8bb819a

this is no longer true

f853aad

document sequent map more

eb451dc

dcreager added 2 commits November 17, 2025 13:46

one more comment for the road

137a057

whopps

1b94bbd

dcreager force-pushed the dcreager/ordering-bug branch from 6fe0caa to 1b94bbd Compare November 17, 2025 18:47

move if guard

b700a31

fix docs

a87c3d9

sharkdp approved these changes Nov 18, 2025

View reviewed changes

dcreager merged commit f67236b into main Nov 18, 2025
41 checks passed

dcreager deleted the dcreager/ordering-bug branch November 18, 2025 17:02

dcreager mentioned this pull request Nov 18, 2025

[ty] Only normalize constraint bounds for display #21516

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ty] Better handling of "derived information" in constraint sets #21463

[ty] Better handling of "derived information" in constraint sets #21463

dcreager commented Nov 14, 2025

Uh oh!

astral-sh-bot bot commented Nov 14, 2025 •

edited

Loading

Uh oh!

astral-sh-bot bot commented Nov 14, 2025 •

edited

Loading

Uh oh!

sharkdp left a comment

Uh oh!

Uh oh!

sharkdp Nov 17, 2025

Uh oh!

dcreager Nov 17, 2025

Uh oh!

codspeed-hq bot commented Nov 17, 2025 •

edited

Loading

Uh oh!

sharkdp left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[ty] Better handling of "derived information" in constraint sets #21463

[ty] Better handling of "derived information" in constraint sets #21463

Conversation

dcreager commented Nov 14, 2025

Uh oh!

astral-sh-bot bot commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Diagnostic diff on typing conformance tests

Uh oh!

astral-sh-bot bot commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

mypy_primer results

Uh oh!

sharkdp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sharkdp Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

dcreager Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

codspeed-hq bot commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Performance Report

Merging #21463 will not alter performance

Summary

Footnotes

Uh oh!

sharkdp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

astral-sh-bot bot commented Nov 14, 2025 •

edited

Loading

astral-sh-bot bot commented Nov 14, 2025 •

edited

Loading

`mypy_primer` results

codspeed-hq bot commented Nov 17, 2025 •

edited

Loading