Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Filter datasets by dag_id in rest API #37423

Closed
1 of 2 tasks
bbovenzi opened this issue Feb 14, 2024 · 3 comments · Fixed by #37512
Closed
1 of 2 tasks

Filter datasets by dag_id in rest API #37423

bbovenzi opened this issue Feb 14, 2024 · 3 comments · Fixed by #37512
Assignees
Labels
area:API Airflow's REST/HTTP API area:webserver Webserver related Issues kind:feature Feature Requests

Comments

@bbovenzi
Copy link
Contributor

bbovenzi commented Feb 14, 2024

Description

We have a list datasets endpoint in the API. Right now we can only filter by searching uri string. It would be useful to be able to pass one or multiple dag_ids and only return datasets that include that dag_id in consuming_dags or producing_tasks

Use case/motivation

Make it easy to see what datasets are connected to a dag.

Right now our webserver endpoint to get dataset dependencies returns all datasets+dags and then the UI needs to filter it down.

Related issues

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Code of Conduct

@bbovenzi bbovenzi added kind:feature Feature Requests needs-triage label for new issues that we didn't triage yet area:API Airflow's REST/HTTP API and removed needs-triage label for new issues that we didn't triage yet labels Feb 14, 2024
@bbovenzi
Copy link
Contributor Author

Some questions I have I'd like to get people's thoughts on:

  1. Should it be a single dag_id or an array of multiple dag_ids?
  2. Should we specify only consuming or only producing or both?
  3. Maybe we need a separate, perhaps not in the public REST API, endpoint to return the entire dataset-dag graph based on either a dataset_uri or a dag_id?

@bbovenzi bbovenzi added the area:webserver Webserver related Issues label Feb 15, 2024
@bbovenzi
Copy link
Contributor Author

Actually this could just be in datasets_summary endpoint in the webserver.

@Satoshi-Sh
Copy link
Contributor

I can take on this task.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area:API Airflow's REST/HTTP API area:webserver Webserver related Issues kind:feature Feature Requests
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants