-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question: Do you plan to create a pipeline of Jobs? #240
Comments
We have this issue/feature. #104 |
I thought this feature was only about supporting which one to start first, the worker or the driver. |
That's correct, the idea described in #104 would only address things like the leader/work pattern, where we wait to start the worker until the leader (Job A) is running before starting the worker (Job B) - that is, not waiting until Job A is finished but waiting until Job A is ready before launching Job B. However, I think the proposed design could easily be extended to include an extra knob to configure if we want to wait until the previous Job is ready or until it is finished before starting the next job. Building off of @kannon92's comment, maybe something like this:
|
I see. Sounds useful. Thank you for your explanation! |
Only issue with |
Based on a discussion with @ahg-g and @danielvegamyhre in the startup policy KEP, I don't know if we will cover a pipeline of jobs (ie sequential completed execution). |
As @kannon92 mentioned above, after some discussion we concluded we want to keep the startup policy API simple and specific to sequential startup, to address specific requirements of distributed training frameworks. For now, we do not want to expand to include other use cases, like sequential completed execution, where the API can quickly explode into a full blown workload DAG / workflow execution engine. |
Thank you for the development of this interesting OSS.
I have a question.
I think it would be very reasonable to develop
JobSet
to meet the needs of AI/ML workloads. On the other hand, in ML pipelines, it is often required to run multiple Jobs sequentially, such as running Job A, then Job B, and then Job C after that.Can
JobSet
cover this functionality of executing Jobs in sequence, or should features like these be handled by another OSS likeArgo
?The text was updated successfully, but these errors were encountered: