Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: resume pipelines on restart #7229

Merged
merged 3 commits into from
Apr 1, 2024
Merged

Conversation

klesh
Copy link
Contributor

@klesh klesh commented Mar 27, 2024

Summary

Use cases: devlake container gets restarted accidentally
Why: currently, all running pipelines would be marked as FAILED on restart and all progress would be dropped, one would have to rerun all of them which is quite costly.
What: this PR would resume the running pipelines on the Subtask level.
How: resuming pipeline on subtask level could pick up most of the work which could reduce the cost drastically
Todo: gitextractor requires the repo to be cloned into the local storage for data collection, unfortunately, the repo would be gone after the container restarted, either we cache it by using a volume or we have to optimize the clone speed for the whole thing to make sense.

Screenshots

Pipeline is running
Snipaste_2024-04-01_15-26-15

Pipeline resumed successfully after restarted
Snipaste_2024-04-01_15-27-04

@klesh klesh force-pushed the kw-resume-pipeline-after-restart branch from c99bb36 to e0490d1 Compare March 28, 2024 08:17
@klesh klesh force-pushed the kw-resume-pipeline-after-restart branch from e0490d1 to f162af8 Compare April 1, 2024 07:30
@abeizn abeizn merged commit e5ee65e into main Apr 1, 2024
10 checks passed
@abeizn abeizn deleted the kw-resume-pipeline-after-restart branch April 1, 2024 07:34
abeizn pushed a commit that referenced this pull request Apr 1, 2024
* feat: resume pipelines on restart

* feat: gitextractor clone phase must be executed on resume

* fix: subtasks always get skipped
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants