Skip to content

Upgrade the Airflow Databricks provider to use the Databricks jobs API 2.2 #59189

@JonasDoubleU

Description

@JonasDoubleU

Description

Upgrade the Airflow Databricks provider to use the Databricks jobs API 2.2 from 2.1. Specifically the GET_RUN_ENDPOINT, to allow fetching of the status of all tasks within a workflow with more than 100 tasks. Consequently, the fetching probably needs to be modifed to handle pagination.

Use case/motivation

We are using Airflow to trigger Databricks workflows, but some of our workflows contain more than 100 tasks. The support to create workflows with more than 100 tasks with the Databricks jobs API was added in version 2.2. Therefore, when fetching the status of the tasks Airflow fails because it uses version 2.1. Ultimately, we cannot use Airflow to trigger these workflows with the current Airflow Databricks provider.

Related issues

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions