Feat/generic pool #309

tdejager · 2023-09-04T12:24:09Z

Description

PR that makes the Rust libsolv solver generic on a couple of traits and the current conda ecosystem uses these traits. The traits you currently need are:

VersionTrait defines what a version specification is. We might be able to get rid of some requirements on this later. For Conda this is a PackageRecord.
VersionSet this is the set of version which you can use to match something that implements VersionTrait.
DependencyProvider this has a function that determines how candidates selected by the solver should be ordered. This is custom per ecosystem.

The reason for doing this is to be able to integrate other ecosystems like PyPi into the solver and see if this can work. We now know the generic requirements currently needed by the solver.

All the conda specific code has been moved to rattler_solve. I think if all tests succeed this is in state that it could potentially be merged. All the tests are succeeding locally so I suppose the whole thing should be a drop in replacement.

It does have a performance regression, namely that we are copying a lot of PackageRecords. Regression range from 13% - 77% depending on the test, so that's quite a lot. However, I think it would be good to review and merge before it gets bigger :) Then look at PackageRecord improvement.

In the solver tests I've made a bespoke version and matching spec that could potentially be a source of inspiration for an implementation.

Todo's in code

Some TODO's have been added to the code where we could potentially improve some aspects.
Some notable todo's:

The sorting method in conda requires a cache of candidates to highest versions, could we potentially make this more generic? Maybe even a generic caching crate like lru would just work.
There is a Name type in the VersionTrait that might be able to move to Pool
In the libsolv tests, I've made generic methods for adding packages and dependecies in the Pool, this puts some more requirements on the Traits of pulling these requirements into the traits. This goes against the last point, however. And we should see what we prefer. It's also totally possible to keep these methods ecosystem specific, with some potential duplication.
Can we avoid NewTypes somehow when implementing the traits in rattler_solve or is this something we actually want? :)
Can we make the arguments to sort_candidates type-safe as of now they can can by any SolvableId it would be nice to be able to actually use the subset returned by the pool when matching on the name. Could introduce a new type for this that can only be constructed in such a situation.

What's next?

Making use of the &PackageRecord type instead of the PackageRecord value, avoiding copies.
Try writing the same for PyPi and see what we still need to change, while keeping the solver code generic.

The Record trait no longer has a `name()` function so I think it acts more like a version than a record.

wolfv · 2023-09-04T12:49:05Z

Wow, impressive that it's passing our test suite! Awesome!
I was wondering if the error message changes are really necessary or if we can keep the current way of displaying the "matchspec" form?
I haven't done a deeper review of the changes yet.

I am also not sure how we should proceed given the performance regression. That might be a little annoying to end-users since we already enabled the new solver in pixi.

Should we make a solver-development branch instead?

baszalmstra · 2023-09-04T12:59:49Z

I think I can solve the performance regressions tomorrow!

tdejager · 2023-09-04T13:08:44Z

@baszalmstra @wolfv Okay so we'll keep this PR open, until the performance regressions are fixed?

baszalmstra · 2023-09-04T13:09:45Z

Yes! should be fixed pretty soon!

tdejager · 2023-09-04T13:12:58Z

Wow, impressive that it's passing our test suite! Awesome! I was wondering if the error message changes are really necessary or if we can keep the current way of displaying the "matchspec" form? I haven't done a deeper review of the changes yet.

I am also not sure how we should proceed given the performance regression. That might be a little annoying to end-users since we already enabled the new solver in pixi.

Should we make a solver-development branch instead?

It's for the internal tests anyway I think the error message and format used there is a little more generic because it removes the ambiguity of the upper bound, which I hope helps a random person looking at the solver tests for the first time. Also decoupling it more from conda in the same time.

wolfv · 2023-09-04T13:14:47Z

Ah, OK, so to confirm, the error messages we would see in pixi are the same?

tdejager · 2023-09-04T15:01:13Z

Ah, OK, so to confirm, the error messages we would see in pixi are the same?

For sure!

wolfv · 2023-09-07T08:09:19Z

I was running the benchmark locally and it was only ~10% slower max so I'm fine with merging

tdejager · 2023-09-07T09:09:10Z

When I added LTO it is the same again on my M1.

tdejager and others added 23 commits August 24, 2023 11:16

feat: now compiles with generic matchspec pool

e9c5715

Merge branch 'main' into feat/generic-pool

c6c14ae

Merge branch 'main' into feat/generic-pool

b2e16b6

fix: remove matchspec from SolveJobs

fa36cd4

feat: clause is now generic

6f4552f

feat: converted some more to generic types

c2b4940

fix: changed V to VS

aabe34a

feat: moved traits

1d27c91

feat: made more types in the solver generic

ba26928

feat: compiling of generic solver

9021376

fix: typo

81446d8

fmt

cf546ac

wip: extract matchspec

3b88b3f

fix: tests succeed again

4f325c7

refactor: removed Version::name()

8c58026

refactor: renamed Record back to Version

16bcfba

The Record trait no longer has a `name()` function so I think it acts more like a version than a record.

refactor: made Problem MatchSpec agnostic

2ac2f0f

feat: renaming and made the cache to the candidates

780a793

feat: removed some more trait bounds

5dc784d

feat: made more generics, started converting tests

f4a245b

feat: porting tests

3a6c792

feat: all tests are made generic and run succesfully

ad5dbf4

feat: libsolvrs is free from conda types

084b990

tdejager requested review from wolfv and baszalmstra September 4, 2023 12:24

Merge branch 'main' into feat/generic-pool

9c41786

tdejager added 5 commits September 4, 2023 15:53

feat: removed SortCache trait replaced it with a HashMap for now

2186baf

feat: some small improvements

48016a6

feat: cache now lives in dependency provider

d987f44

feat: comment regarding pool

f7d0b99

feat: add repr transparent to the types

5725290

baszalmstra and others added 5 commits September 4, 2023 21:02

feat: pool stores references to package records

595eaf8

fix: solvable exposes its name

42c0341

expose name

97c9ffa

cache parsing of matchspec

1b99938

Merge remote-tracking branch 'upstream/main' into feat/generic-pool

be5b4c3

feat: removed rattler dependency and added lto for bench

e9cbe41

baszalmstra approved these changes Sep 7, 2023

View reviewed changes

tdejager merged commit 6af1c5e into main Sep 7, 2023

tdejager deleted the feat/generic-pool branch September 7, 2023 09:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/generic pool #309

Feat/generic pool #309

tdejager commented Sep 4, 2023 •

edited

Loading

wolfv commented Sep 4, 2023

baszalmstra commented Sep 4, 2023

tdejager commented Sep 4, 2023

baszalmstra commented Sep 4, 2023

tdejager commented Sep 4, 2023

wolfv commented Sep 4, 2023

tdejager commented Sep 4, 2023

wolfv commented Sep 7, 2023

tdejager commented Sep 7, 2023

Feat/generic pool #309

Feat/generic pool #309

Conversation

tdejager commented Sep 4, 2023 • edited Loading

Description

Todo's in code

What's next?

wolfv commented Sep 4, 2023

baszalmstra commented Sep 4, 2023

tdejager commented Sep 4, 2023

baszalmstra commented Sep 4, 2023

tdejager commented Sep 4, 2023

wolfv commented Sep 4, 2023

tdejager commented Sep 4, 2023

wolfv commented Sep 7, 2023

tdejager commented Sep 7, 2023

tdejager commented Sep 4, 2023 •

edited

Loading