Lockfiles: when checking for staleness, only use current context's interpreter constraints #12542

Eric-Arellano · 2021-08-10T21:10:46Z

Background

For tool lockfiles, #12448 uses the rules to get the PythonLockfileRequest for the tool to check if the current lockfile has become stale.

Some tool lockfile requests are tricky to set up because interpreter constraints depend on the user's code, such as how the ICs used for Flake8 and Pytest depend on what the user has. This has a severe performance impact: when running any of these tools, like running tests on a single file, we must first check if the lockfile is stale by consulting the entire repository to determine its interpreter constraints. That is slow to scan/compute, and means tests will be invalidated much more infrequently.

Proposal

Instead, when checking a lockfile for staleness, we can simply check that the current context's interpreter constraints are compatible with the lockfile's constraints, rather than identical. This would avoid entirely the performance concern.

Tips for implementation

The main challenge will be adding code to interpreter_constraints.py to compare if two interpreter constraints are compatible with each other or not: checking if one is a subset of the other.

Likely the easiest way to implement this is to leverage that we have users define for us already what the finite universe of Python interpreters should be, e.g. Pythons 3.5-3.10. You can use this function to generate all possible versions:

pants/src/python/pants/backend/python/util_rules/interpreter_constraints.py

Lines 179 to 183 in ecf85d7

    
           def _valid_patch_versions(self, major: int, minor: int) -> Iterator[int]: 
        
               for p in range(0, _EXPECTED_LAST_PATCH_VERSION + 1): 
        
                   for req in self: 
        
                       if req.specifier.contains(f"{major}.{minor}.{p}"):  # type: ignore[attr-defined] 
        
                           yield p

(Even better, a PR I'm about to put up to add InterpreterConstraints.flatten adds a PR that computes all valid Py2 and Py3 versions for the ICs. You can use that.)

Then, use set methods to check that one is a subset of the other.

--

We'll also need to refactor #12448 to no longer use the rule to determine the lockfile request, but instead compute what the request would be for that current context (e.g. lockfile for just that test).

The lockfile header will also probably need to start preserving the original interpreter_constraints string, and we'll need a way to serialize/deserialize that.

The text was updated successfully, but these errors were encountered:

chrisjrn · 2021-08-10T21:33:30Z

I'm trying to figure out what "compatible" means here.

The definition I'm thinking of says:

The user's constraints, and the lockfile's constraints are non-disjoint (i.e. lockable_constraints := (user_constraints & lockfile_constraints) != EMPTY)
lockable_constraints matches at least one interpreter that is available on the current machine

Does that line up with your understanding of the constraints?

Eric-Arellano · 2021-08-10T21:37:15Z

I was thinking the mathematical definition of subset: the context's ICs must be completely contained by the lockfiles's ICs. That is, every patch version in the context must be in the lockfile's possible patch versions.

It need not be a "proper subset", it's valid if the context ICs == lockfile ICs.

chrisjrn · 2021-08-12T15:33:02Z

@Eric-Arellano Thoughts on including platforms as part of the requirements here? Or is that for a later date?

Eric-Arellano · 2021-08-12T15:40:32Z

If we go with Poetry, then it doesn't matter what platform you generate from. The lockfile should handle windows, Linux, and macOS. No need to invalidate based on platform.

(Still thinking about John's question on handling --platform when set in a pex_binary. Will put up some thoughts, but tldr is I think this issue will not need to do anything.)

chrisjrn · 2021-08-12T15:40:48Z

Cool, I'll ignore that for now.

chrisjrn · 2021-08-12T17:28:00Z

(Completion of this task is blocked on merging #12448)

…r constraints, rather than global constraints (#12566) Part 2 of 3 for #12542. Still to do: update call sites to leverage this change so that they no longer compute their global constraints.

Eric-Arellano assigned chrisjrn Aug 10, 2021

Eric-Arellano mentioned this issue Aug 10, 2021

[internal] Add InterpreterConstraints.flatten() in preparation for Poetry lockfiles #12543

Merged

chrisjrn mentioned this issue Aug 12, 2021

Set default invalid lockfile behaviour to error #12552

Closed

Eric-Arellano mentioned this issue Aug 12, 2021

Lockfile invalidation consumption #12448

Merged

chrisjrn mentioned this issue Aug 13, 2021

Use interpreter constraints to verify lockfile environment, rather than invalidation inputs #12566

Merged

Eric-Arellano closed this as completed in #12566 Aug 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lockfiles: when checking for staleness, only use current context's interpreter constraints #12542

Lockfiles: when checking for staleness, only use current context's interpreter constraints #12542

Eric-Arellano commented Aug 10, 2021

chrisjrn commented Aug 10, 2021

Eric-Arellano commented Aug 10, 2021

chrisjrn commented Aug 12, 2021

Eric-Arellano commented Aug 12, 2021

chrisjrn commented Aug 12, 2021

chrisjrn commented Aug 12, 2021

Lockfiles: when checking for staleness, only use current context's interpreter constraints #12542

Lockfiles: when checking for staleness, only use current context's interpreter constraints #12542

Comments

Eric-Arellano commented Aug 10, 2021

Background

Proposal

Tips for implementation

chrisjrn commented Aug 10, 2021

Eric-Arellano commented Aug 10, 2021

chrisjrn commented Aug 12, 2021

Eric-Arellano commented Aug 12, 2021

chrisjrn commented Aug 12, 2021

chrisjrn commented Aug 12, 2021