diff --git a/cookbook/deployment/configure_use_gpus.py b/cookbook/deployment/configure_use_gpus.py index 6c26552f7e..4e320ca52b 100644 --- a/cookbook/deployment/configure_use_gpus.py +++ b/cookbook/deployment/configure_use_gpus.py @@ -11,8 +11,9 @@ treat machines with GPUs and machines with CPUs equally. You may want to reserve machines with GPUs for tasks that explicitly request GPUs. To achieve this, Flyte uses the Kubernetes concept of `taints and tolerations `__. -You can configure Flyte backend to automatically schedule your task onto a node with GPUs by tolerating specific taints. -This configuration is controlled under generic k8s plugin configuration as can be found `here `__. +Kubernetes can automatically apply tolerations for extended resources like GPUs using the `ExtendedResourceToleration plugin `__, enabled by default in some cloud environments. Make sure the GPU nodes are tainted with a key matching the resource name, i.e., ``key: nvidia.com/gpu``. + +You can also configure Flyte backend to apply specific tolerations. This configuration is controlled under generic k8s plugin configuration as can be found `here `__. The idea of this configuration is that whenever a task that can execute on Kubernetes requests for GPUs, it automatically adds the matching toleration for that resource (in this case, ``gpu``) to the generated PodSpec.