Include cardData in list_models and list_datasets #639

muellerzr · 2022-02-01T19:56:14Z

To support listing the carbon emissions as well as any other metadata that can be returned with cardData:True, this PR now includes that as a parameter to both list_models and list_datasets.

Note: By default this is True because timing wise I did not see a difference between with vs without cardData, and returning the metadata for each model and dataset only increases the understandability of each one IMO.

cc @julien-c @LysandreJik

LysandreJik

Thanks for your PR!

Would it be possible to add a bit of docs regarding how this attribute can be leveraged, for example with the co2_eq_emissions?

https://huggingface.co/docs/hub/searching-the-hub

I observed the same, putting cardData to True or False does not change the time required to list all models, so it's fine for me to put it enabled by default.

tests/test_hf_api.py

julien-c · 2022-02-02T16:38:51Z

I observed the same, putting cardData to True or False does not change the time required to list all models, so it's fine for me to put it enabled by default.

can you compare the underlying API calls? I think you're not seeing too much bc we have good bandwidth, but the underlying calls outputs from /api/models would have very different content sizes imo.

src/huggingface_hub/hf_api.py

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

muellerzr · 2022-02-04T17:33:22Z

@julien-c @LysandreJik should be good now.

I made the cardData none by default, but the explicit docstring tells what extra data you can get back from it.

julien-c · 2022-02-04T18:18:39Z

lgtm!

LysandreJik

Other than the tests, LGTM

tests/test_hf_api.py

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

muellerzr · 2022-02-04T22:50:19Z

Merging PR. For transparency:

        self.assertTrue(all([not hasattr(dataset, "cardData") for dataset in datasets]))

This will always fail because it is hard coded into the DatasetResult to set cardData to None, even if it's not present.

@LysandreJik let me know if you'd like me to clean that up a bit with some setattr's instead of hard-coding inputs.

Include cardData

bf81df9

muellerzr requested a review from LysandreJik February 1, 2022 19:56

Fix tests

219e36b

LysandreJik approved these changes Feb 2, 2022

View reviewed changes

tests/test_hf_api.py Outdated Show resolved Hide resolved

tests/test_hf_api.py Outdated Show resolved Hide resolved

Black version

58857ca

LysandreJik reviewed Feb 3, 2022

View reviewed changes

src/huggingface_hub/hf_api.py Outdated Show resolved Hide resolved

muellerzr and others added 3 commits February 4, 2022 12:24

Make it opt-in

5ffdbc9

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

Keep as optional, improve datasets

b5e6c52

Merge branch 'main' into co2

0f91915

Add headers

8160410

LysandreJik approved these changes Feb 4, 2022

View reviewed changes

tests/test_hf_api.py Outdated Show resolved Hide resolved

tests/test_hf_api.py Outdated Show resolved Hide resolved

muellerzr and others added 2 commits February 4, 2022 16:35

Adjust tests

22e0b08

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

Fix tests and functionality

1c6a2f0

muellerzr merged commit c3e07e3 into main Feb 4, 2022

muellerzr deleted the co2 branch February 4, 2022 23:03

muellerzr restored the co2 branch February 9, 2022 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include cardData in list_models and list_datasets #639

Include cardData in list_models and list_datasets #639

muellerzr commented Feb 1, 2022

LysandreJik left a comment

julien-c commented Feb 2, 2022

muellerzr commented Feb 4, 2022

julien-c commented Feb 4, 2022

LysandreJik left a comment

muellerzr commented Feb 4, 2022

Include cardData in list_models and list_datasets #639

Include cardData in list_models and list_datasets #639

Conversation

muellerzr commented Feb 1, 2022

LysandreJik left a comment

Choose a reason for hiding this comment

julien-c commented Feb 2, 2022

muellerzr commented Feb 4, 2022

julien-c commented Feb 4, 2022

LysandreJik left a comment

Choose a reason for hiding this comment

muellerzr commented Feb 4, 2022