You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What would you like to be added:
Support for defining a global coordinator pod in the JobSet spec.
Why is this needed:
We need to be able to build automation on top of JobSet which knows the stable network endpoint of the pod assigned to be the global coordinator distributed ML training / HPC workloads.
The text was updated successfully, but these errors were encountered:
What would you like to be added:
Support for defining a global coordinator pod in the JobSet spec.
Why is this needed:
We need to be able to build automation on top of JobSet which knows the stable network endpoint of the pod assigned to be the global coordinator distributed ML training / HPC workloads.
The text was updated successfully, but these errors were encountered: