conda-build traverses the whole index greedily, undoing the lazy-loading optimizations #4961

jaimergp · 2023-08-09T08:01:38Z

In #4431 I was investigating some differences in timings between CONDA_SOLVER=libmamba conda-build and conda mambabuild, which should be similar. However, conda-build spends a few minutes before the solver kicks in, while mambabuild does not.

My research led me to finding out that those extra minutes are spent creating the build index, which is the aggregation of all the source channels (e.g. defaults, conda-forge) and their platforms (noarch is collapsed into e.g. linux-64 🤷), plus the local channels (the CONDA_BLD_PATH cache and/or the chosen output folder).

The slow part is not the repodata fetching, but creating the million+ PackageRecord objects greedily, instead of letting conda do it lazily as needed (introduced in conda/conda#12050). This actually happens in conda, in two places, but conda-build is the sole consumer of those endpoints AFAIK.

In index.py, in the exports module, where an identity dict[PackageRecord, PackageRecord] is built by aggregating all the SubdirData.iter_records() instances.
In exports.py, where that map is processed again just to convert the keys into Dist objects.

It feels like we could do this better, as @dholth was saying #4431 (comment). The interface with the Solver could also be better; it currently overwrites Solver._index with this greedily built object, and conda-libmamba-solver won't even use that 😬

The text was updated successfully, but these errors were encountered:

jaimergp · 2024-04-11T13:33:08Z

Superseded by #5154

conda-bot added this to 🧭 Planning Aug 9, 2023

github-project-automation bot moved this to 🆕 New in 🧭 Planning Aug 9, 2023

jaimergp mentioned this issue Apr 11, 2024

Remove old full-index-in-memory loading and conda.plan-related solver handling #5154

Closed

jaimergp closed this as completed Apr 11, 2024

github-project-automation bot moved this from 🆕 New to 🏁 Done in 🧭 Planning Apr 11, 2024

github-actions bot added the locked [bot] locked due to inactivity label Oct 9, 2024

github-actions bot locked as resolved and limited conversation to collaborators Oct 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

conda-build traverses the whole index greedily, undoing the lazy-loading optimizations #4961

conda-build traverses the whole index greedily, undoing the lazy-loading optimizations #4961

jaimergp commented Aug 9, 2023

jaimergp commented Apr 11, 2024

conda-build traverses the whole index greedily, undoing the lazy-loading optimizations #4961

conda-build traverses the whole index greedily, undoing the lazy-loading optimizations #4961

Comments

jaimergp commented Aug 9, 2023

jaimergp commented Apr 11, 2024