You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In TensorFlow distributed training docs, it uses CLI arguments instead. We can not support those two ways since we want to hide the service discovery layer from the users, so I think we could discuss which one will be supported. Maybe AI engineers could give us more info.
TF_CONFIG is a TensorFlow convention that TF APIs like the EstimatorAPI use to get information about the runtime environment and configure the job appropriately.
Now we set the
TF_CONFIG
to get the cluster spec in the training code, and it follows the idea in Google Cloud Machine Learning Engine (Cloud ML Engine).In TensorFlow distributed training docs, it uses CLI arguments instead. We can not support those two ways since we want to hide the service discovery layer from the users, so I think we could discuss which one will be supported. Maybe AI engineers could give us more info.
\cc @DjangoPeng @ScorpioCPH
The text was updated successfully, but these errors were encountered: