-
Notifications
You must be signed in to change notification settings - Fork 16.3k
Closed
Labels
Description
When running airflow DatabricksRunNowOperator, if using managed identity, and if there is multiple tasks running in parallel, the code below can quickly exhaust the allowed metadata server as explained here https://learn.microsoft.com/en-us/azure/virtual-machines/instance-metadata-service?tabs=windows#rate-limiting
In general, requests to IMDS are limited to 5 requests per second (on a per VM basis). Requests exceeding this threshold will be rejected with 429 responses.
airflow/providers/databricks/src/airflow/providers/databricks/hooks/databricks_base.py
Line 568 in 4b83391
| self._check_azure_metadata_service() |
I don't understand why this check was put here 3 years ago, but I feel this is unnecessary.
I propose we remove this check, or make it optional. Can I open a PR for this?
