Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DataCatalog]: Iterate through datasets objects in the catalog #3916

Closed
ElenaKhaustova opened this issue Jun 3, 2024 · 2 comments
Closed
Assignees
Labels
Issue: Feature Request New feature or improvement to existing feature

Comments

@ElenaKhaustova
Copy link
Contributor

ElenaKhaustova commented Jun 3, 2024

Description

Implement iterable support for the catalog.datasets, allowing users to iterate through datasets objects directly.

We propose to implement iterable support for the catalog.datasets, allowing users to iterate through datasets objects directly.

Context

In the current implementation one can only iterate through datasets' names obtained from catalog.list() which forces using private _get_dataset() method to get the dataset by name.

Image

"I want to get a catalog from a Kedro project and then I want to iterate through the datasets or even just fetch one dataset by the its name."

See Miro for user context: https://miro.com/app/board/uXjVN2JuRF0=/?moveToWidget=3458764593622844127&cot=14
See dovetail for user interview: https://mckinsey.dovetail.com/data/2UOzkqe9cGAVh7kpHQrNqX#:v:h=uwQEu12hu5NpE3G1KyA9R&s=1

@astrojuanlu
Copy link
Member

There should be one-- and preferably only one --obvious way to do it.

@ElenaKhaustova
Copy link
Contributor Author

Solved in #4218

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Issue: Feature Request New feature or improvement to existing feature
Projects
Status: Done
Development

No branches or pull requests

6 participants
@astrojuanlu @noklam @merelcht @ankatiyar @ElenaKhaustova and others