Deactivate caching in fulltest pipeline #294

AdrianSosic · 2024-07-02T07:54:02Z

This PR removes the last remaining caching steps from our pipeline due to the mysterious interference with serialization. Can be reactivated once the root cause is identified.

Scienfitz · 2024-07-02T07:57:53Z

.github/workflows/regular.yml

@@ -1,7 +1,7 @@
 # NOTES:
 # - The map syntax used for matrix is flagged red but actually works
 # - This runs everything in Python 3.10, 3.11 and 3.12
-# - No environments are cached due to space limit


the old reason is in principle still true so why delete?

But that's not the primary reason why we don't want caching here, right? I thought we explicitly want to trigger to full installation pipeline etc (including newest package versions etc) in order to have a true e2e test? While the old reason might still be valid, it doesn't matter here. Or how would you even want me to express that? I guess not like "... to perform a full end-to-end test. Even if we didn't care about e2e, environments would still not be cached due to space limits." 😄

no it was certainly one of the reasons and still is, if this action creates lots of caches it might delete caches form CI because theres a limit, just keep it and add your reason, no reason to delete

Scienfitz · 2024-07-03T08:29:27Z

closing due to #298

Due to continuing serialization problems that were thought to be related with caching, #277 deactivated core test caching and #294 was prepared to do the same for the full test environment. This PR reactivates caching and instead refactors the class layout of `SKLearnClusteringRecommender` in an attempt to fix the root cause. Mysteriously, the top-level import of `sklearn.mixture.GaussianMixture` seems to cause trouble. While the reason is still unclear, turning it into a lazy import (which will also become handy later when making `scikit-learn` an optional dependency) seems to resolve the problem. On a side note: deactivating slots for the recommenders solves the problem as well, which suggests that the root cause could be related to classes not being properly garbage collected (since `attrs` needs to create new classes when slots are activated), which could also explain that `GaussianMixtureClusteringRecommender` seemed to have improperly overridden methods after deserialization (for example, the `__repr__` of a created Gaussian mixture recommender correctly pointed to its own class before serialization but to the `__repr__` of `SKLearnClusteringRecommender` after serialization – but weirdly only when executed in `tox`).

Deactivate caching in fulltest pipeline

2ffe300

AdrianSosic added the repo Requires changes to the project configuration label Jul 2, 2024

AdrianSosic self-assigned this Jul 2, 2024

AdrianSosic requested review from Scienfitz and AVHopp as code owners July 2, 2024 07:54

Scienfitz approved these changes Jul 2, 2024

View reviewed changes

AVHopp approved these changes Jul 2, 2024

View reviewed changes

AdrianSosic mentioned this pull request Jul 3, 2024

Fix serialization and caching #298

Merged

Scienfitz closed this Jul 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deactivate caching in fulltest pipeline #294

Deactivate caching in fulltest pipeline #294

AdrianSosic commented Jul 2, 2024

Scienfitz Jul 2, 2024

AdrianSosic Jul 2, 2024

Scienfitz Jul 2, 2024

Scienfitz commented Jul 3, 2024 •

edited

Loading

Deactivate caching in fulltest pipeline #294

Deactivate caching in fulltest pipeline #294

Conversation

AdrianSosic commented Jul 2, 2024

Scienfitz Jul 2, 2024

Choose a reason for hiding this comment

AdrianSosic Jul 2, 2024

Choose a reason for hiding this comment

Scienfitz Jul 2, 2024

Choose a reason for hiding this comment

Scienfitz commented Jul 3, 2024 • edited Loading

Scienfitz commented Jul 3, 2024 •

edited

Loading