You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What would you like to be added:
Comprehensive example tasks running training workloads on TPUs with JobSet. Also demonstrating JobSet + Kueue integration would be nice.
Why is this needed:
We need more comprehensive examples that will reduce friction for users trying out JobSet for real training workloads. Right now we mostly just have toy examples with "sleep" containers that demonstrate different features.
The text was updated successfully, but these errors were encountered:
danielvegamyhre
changed the title
Comprehensive example tasks for running multislice TPU workloads with JobSet (and JobSet + Kueue)
Comprehensive example task for running multislice TPU workloads with JobSet (and JobSet + Kueue)
Mar 13, 2024
@uroy-personal since you have multiple tasks already, I'm going to assign this one to someone with some experience with TPUs to distribute the workload
What would you like to be added:
Comprehensive example tasks running training workloads on TPUs with JobSet. Also demonstrating JobSet + Kueue integration would be nice.
Why is this needed:
We need more comprehensive examples that will reduce friction for users trying out JobSet for real training workloads. Right now we mostly just have toy examples with "sleep" containers that demonstrate different features.
The text was updated successfully, but these errors were encountered: