Download wheels in batch at the end of .prepare_linked_requirements_more() #8896

cosmicexplorer · 2020-09-22T09:34:17Z

This PR corresponds to the second work section described in #7819 (comment) from the much-too-large #8448:

move out "download entire file" out of prepare (and hence, Resolver.resolve)

With some modifications to that prompt, it now consists of:

Create a method RequirementPreparer._complete_partial_requirements() to fully download and prepare lazily-downloaded wheels.
Call that new method at the bottom of .prepare_linked_requirements_more(), separating the batch download from the rest of that method.

cosmicexplorer · 2020-09-23T07:23:52Z

cc @pradyunsg @McSinyx wrote a tweet about much I loved your work on the lazy wheel fetching: https://twitter.com/hipsterelectron/status/1308306011409661954?s=20

McSinyx · 2020-09-23T10:39:48Z

I'm sorry to disappoint you, the lazy wheel isn't making download any faster at the moment. I suppose the other use-case (pip resolve) is possible with extra steps and this is what you're trying to do here.

Regarding the news file, since pip's internal API is internal, I think this should be flagged as trivial. Concerning the approach, I can recall discussions with @pradyunsg and @chrahunt resulting in keeping the late download in the resolver (at least until the legacy resolver is removed, to keep the consistency of output by the two resolvers), in particular, within RequirementPreparer (see GH-8685). IIRC @pradyunsg told me that it's preferable to pass a dry_run option to the resolver instead—the reason pip resolve has not been implemented is rather we don't know to be exact what format of the output would be most helpful.

Side comment: the use PartialRequirementDownloadCompleter is solely init then call it's only method, which I would love to see to be changed into a function if it turns out that my memory regarding the general approach said above isn't correct.

pradyunsg

Me likey!

news/8896.feature

src/pip/_internal/commands/download.py

That was the wrong shortcut, on the wrong window. :)

src/pip/_internal/commands/install.py

pradyunsg

Me likey the general clarity that factoring out the logic for the batch download logic brings.

As @McSinyx mentioned, until the older resolver dies, we'd want to keep the "download everything else" inside the resolver -- maintaining the abstraction of "everything is ready". While the "prepare more" logic is indeed a no-op with the old resolver, it'll be nice to keep it inside to be able to reason about these things more easily until that old one goes away.

Let's keep the download logic inside the resolver.resolve call, and add a flag to tell the resolver whether to fully-prepare partially prepared items. :)

pradyunsg · 2020-09-23T19:46:27Z

I made a mess of the review here -- but that's what I get for reviewing at 1am. :)

cosmicexplorer · 2020-09-24T05:47:18Z

I'm sorry to disappoint you, the lazy wheel isn't making download any faster at the moment.

You couldn't possibly disappoint me!!! Sounds like it's time for some profiling 🔍 . Sorry for disappearing off the face of the earth for so long! Your adaptation was much much cleaner than what I had and that is a much better place to start off than the other way around (at least, in my opinion, in this case).

I suppose the other use-case (pip resolve) is possible with extra steps and this is what you're trying to do here.

So my thought process was actually just that parallel downloading would be the key to actually improving the download performance!! And if not, I was thinking a further optimization would be to keep alive the connections used to download the metadata, in case requests wasn't doing that already. This is why I'm sorry I was so quiet for a few months 😅. This is definitely on me providing an unclear description of the goal and not handing off my work correctly!

Regarding the news file, since pip's internal API is internal, I think this should be flagged as trivial. Concerning the approach, I can recall discussions with @pradyunsg and @chrahunt resulting in keeping the late download in the resolver (at least until the legacy resolver is removed, to keep the consistency of output by the two resolvers), in particular, within RequirementPreparer (see GH-8685).

This is super super helpful especially with the link to the PR I missed!! Hardly gave @pradyunsg much else to review ^_^

IIRC @pradyunsg told me that it's preferable to pass a dry_run option to the resolver instead—the reason pip resolve has not been implemented is rather we don't know to be exact what format of the output would be most helpful.

Yes! dry_run seems much closer to the option I'd expect. Off the top of my head I was thinking something like a constraints.txt with pinned (==) requirements would be most interesting (I think whatever pip freeze produces would likely be super neat) -- but will follow up in #53!

Side comment: the use PartialRequirementDownloadCompleter is solely init then call it's only method, which I would love to see to be changed into a function if it turns out that my memory regarding the general approach said above isn't correct.

Will do!

Let's keep the download logic inside the resolver.resolve call, and add a flag to tell the resolver whether to fully-prepare partially prepared items. :)

This makes perfect sense! It's also less code ^_^

I made a mess of the review here -- but that's what I get for reviewing at 1am. :)

You reviewed at approximately the same time I posted it, then! We'll call it even.

cosmicexplorer · 2020-09-24T06:53:42Z

Instead of doing this:

add a flag to tell the resolver whether to fully-prepare partially prepared items. :)

I instead just added an extra method .complete_partial_requirements() in operations/prepare.py (and updated the description), because it seemed that piping a PipSession instance into the resolver would risk it being used for other things that were previously nicely separated. So instead I made use of the existing fields of the RequirementPreparer to just ensure that resolver.resolve() could clearly mark which part did the batch downloading. I hope that seems reasonable -- totally willing to refactor if either of you two had something else in mind!

pradyunsg

Hmm... keeping things on RequirementPreparer itself... should get us nearly the same benefits (separation of "collect" vs "download" in prepare_more). I think we can get way without introducing changes on the resolver side?

We'd skip the entire "prepare_more" stage from the resolver during a dry run anyway, so... :)

src/pip/_internal/resolution/resolvelib/resolver.py

src/pip/_internal/operations/prepare.py

cosmicexplorer · 2020-09-24T11:03:33Z

should get us nearly the same benefits (separation of "collect" vs "download" in prepare_more).

Yes! This is a much nicer way to think about it. Will do.

We'd skip the entire "prepare_more" stage from the resolver during a dry run anyway, so... :)

Great point!! (along with the rest)

cosmicexplorer · 2020-09-24T11:35:38Z

Done, I think! I think this gets us the separation you noted above, paving the way for other methods of downloading requirements, as well as not downloading them at all!

cosmicexplorer · 2020-09-28T00:50:47Z

ping!

xavfernandez

This seems clean enough but would need rebasing :)

create PartialRequirementDownloadCompleter, and use in wheel, install, and download add NEWS entry rename NEWS entry rename NEWS entry respond to review comments move the partial requirement download completion to the bottom of the prepare_more method

cosmicexplorer · 2020-10-09T07:20:14Z

Rebased!

cosmicexplorer · 2020-10-17T04:10:27Z

Perhaps @xavfernandez is the right person to ping here?

cosmicexplorer · 2020-10-26T05:09:29Z

Ping once more! ^_^

cosmicexplorer · 2020-10-26T05:09:54Z

Let me know if there's someone specific I should contact!

pradyunsg · 2020-10-26T11:09:07Z

Nah -- I'll merge this once #8936 is done with (later this week). :)

cosmicexplorer · 2020-11-16T05:11:13Z

ping @pradyunsg! ❤️

cosmicexplorer · 2020-11-19T06:14:37Z

ping!

pradyunsg · 2020-11-19T10:17:00Z

Pong! pip 20.3 hasn't been released yet, because, well, 2020.

I will come back to this, once that's done.

cosmicexplorer · 2020-11-20T02:25:26Z

Not a problem!!! Thanks for the update. I will follow along with the 20.3 updates then!

uranusjr · 2021-02-18T10:10:08Z

Come on, report your status, Azure.

cosmicexplorer force-pushed the download-entire-file-outside-of-resolver branch from de18d0d to 4d02725 Compare September 22, 2020 09:35

pradyunsg previously approved these changes Sep 23, 2020

View reviewed changes

news/8896.feature Outdated Show resolved Hide resolved

src/pip/_internal/commands/download.py Outdated Show resolved Hide resolved

pradyunsg reviewed Sep 23, 2020

View reviewed changes

src/pip/_internal/commands/install.py Outdated Show resolved Hide resolved

pradyunsg reviewed Sep 23, 2020

View reviewed changes

cosmicexplorer force-pushed the download-entire-file-outside-of-resolver branch from 0d7e9e8 to 15e61d6 Compare September 24, 2020 06:46

cosmicexplorer requested a review from pradyunsg September 24, 2020 06:50

pradyunsg reviewed Sep 24, 2020

View reviewed changes

src/pip/_internal/resolution/resolvelib/resolver.py Outdated Show resolved Hide resolved

src/pip/_internal/operations/prepare.py Outdated Show resolved Hide resolved

cosmicexplorer force-pushed the download-entire-file-outside-of-resolver branch 4 times, most recently from 34d27a5 to 50c5e51 Compare September 24, 2020 11:31

cosmicexplorer requested a review from pradyunsg September 24, 2020 11:33

cosmicexplorer changed the title ~~Download wheels in batch outside of the resolver when --use-feature=fast-deps is on~~ Download wheels in batch at the end of .prepare_linked_requirements_more() Sep 25, 2020

cosmicexplorer force-pushed the download-entire-file-outside-of-resolver branch from 50c5e51 to e9ed0b5 Compare September 28, 2020 09:30

cosmicexplorer mentioned this pull request Sep 28, 2020

extend fast-deps to sdists (and allow downloading foreign sdists) (working prototype!) #8929

Closed

xavfernandez reviewed Oct 2, 2020

View reviewed changes

cosmicexplorer force-pushed the download-entire-file-outside-of-resolver branch from e9ed0b5 to 22406d4 Compare October 9, 2020 07:20

fix lint

24ad324

pradyunsg approved these changes Oct 9, 2020

View reviewed changes

xavfernandez approved these changes Oct 26, 2020

View reviewed changes

uranusjr closed this Feb 18, 2021

uranusjr reopened this Feb 18, 2021

uranusjr merged commit f03d71e into pypa:master Feb 18, 2021

github-actions bot locked as resolved and limited conversation to collaborators Oct 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Download wheels in batch at the end of .prepare_linked_requirements_more() #8896

Download wheels in batch at the end of .prepare_linked_requirements_more() #8896

cosmicexplorer commented Sep 22, 2020 •

edited

Loading

cosmicexplorer commented Sep 23, 2020 •

edited

Loading

McSinyx commented Sep 23, 2020

pradyunsg left a comment

pradyunsg left a comment

pradyunsg commented Sep 23, 2020

cosmicexplorer commented Sep 24, 2020 •

edited

Loading

cosmicexplorer commented Sep 24, 2020

pradyunsg left a comment

cosmicexplorer commented Sep 24, 2020

cosmicexplorer commented Sep 24, 2020

cosmicexplorer commented Sep 28, 2020

xavfernandez left a comment

cosmicexplorer commented Oct 9, 2020

cosmicexplorer commented Oct 17, 2020

cosmicexplorer commented Oct 26, 2020

cosmicexplorer commented Oct 26, 2020

pradyunsg commented Oct 26, 2020

cosmicexplorer commented Nov 16, 2020

cosmicexplorer commented Nov 19, 2020

pradyunsg commented Nov 19, 2020

cosmicexplorer commented Nov 20, 2020

uranusjr commented Feb 18, 2021 •

edited

Loading

Download wheels in batch at the end of .prepare_linked_requirements_more() #8896

Download wheels in batch at the end of .prepare_linked_requirements_more() #8896

Conversation

cosmicexplorer commented Sep 22, 2020 • edited Loading

cosmicexplorer commented Sep 23, 2020 • edited Loading

McSinyx commented Sep 23, 2020

pradyunsg left a comment

Choose a reason for hiding this comment

pradyunsg left a comment

Choose a reason for hiding this comment

pradyunsg commented Sep 23, 2020

cosmicexplorer commented Sep 24, 2020 • edited Loading

cosmicexplorer commented Sep 24, 2020

pradyunsg left a comment

Choose a reason for hiding this comment

cosmicexplorer commented Sep 24, 2020

cosmicexplorer commented Sep 24, 2020

cosmicexplorer commented Sep 28, 2020

xavfernandez left a comment

Choose a reason for hiding this comment

cosmicexplorer commented Oct 9, 2020

cosmicexplorer commented Oct 17, 2020

cosmicexplorer commented Oct 26, 2020

cosmicexplorer commented Oct 26, 2020

pradyunsg commented Oct 26, 2020

cosmicexplorer commented Nov 16, 2020

cosmicexplorer commented Nov 19, 2020

pradyunsg commented Nov 19, 2020

cosmicexplorer commented Nov 20, 2020

uranusjr commented Feb 18, 2021 • edited Loading

cosmicexplorer commented Sep 22, 2020 •

edited

Loading

cosmicexplorer commented Sep 23, 2020 •

edited

Loading

cosmicexplorer commented Sep 24, 2020 •

edited

Loading

uranusjr commented Feb 18, 2021 •

edited

Loading