Improve Mixin #34

LysandreJik · 2021-04-20T23:59:41Z

Improves the Mixin in the hub by adding support for api_endpoint, use_auth_token, as well as git_user and git_email.

It updates the tests so that they run in the staging environment. In doing so, I found a bug relative to absolute paths in from_pretrained, this fixes that at the same time.

julien-c

LGTM

julien-c · 2021-04-21T07:27:07Z

setup.py

 extras["testing"] = [
    "pytest",
-]
+] + extras["torch"]


is there value in having a second CI setup with no torch? (to make sure we don't introduce a dependency on torch inadvertently down the road)

Yes, I can add that

julien-c · 2021-04-21T07:31:34Z

src/huggingface_hub/hub_mixin.py

+        if isinstance(use_auth_token, str):
+            huggingface_token = use_auth_token
+        elif use_auth_token is None and repo_url is not None:
+            # If the repo url exists, then no need for a token
+            huggingface_token = None
+        else:
+            huggingface_token = HfFolder.get_token()


Suggested change

if isinstance(use_auth_token, str):

huggingface_token = use_auth_token

elif use_auth_token is None and repo_url is not None:

# If the repo url exists, then no need for a token

huggingface_token = None

else:

huggingface_token = HfFolder.get_token()

if isinstance(use_auth_token, str):

huggingface_token = use_auth_token

elif use_auth_token:

huggingface_token = HfFolder.get_token()

else:

huggingface_token = None

let's use the same logic as elsewhere no?

The issue with using the same logic as elsewhere is that the API becomes weird to use, in order to cover a rare use-case. Namely, the following will never work:

model.push_to_hub("xxx")

it needs to be the following:

model.push_to_hub("xxx", use_auth_token=True)

I fail to see when a user would want to push to the hub without having an auth_token, as it's necessary to create a repo. If the repo already exists and one wants to push to it, then the user already has to specify the repo_url to push to.

I think the API is cleaner by having model.push_to_hub("xxx") work if you already have an authentication token in your HF folder.

julien-c · 2021-04-21T07:35:14Z

tests/test_hubmixin.py

+        files = os.listdir(f"{WORKING_REPO_DIR}/{REPO_NAME}")
+        self.assertTrue("config.json" in files)
+        self.assertTrue("pytorch_model.bin" in files)
+        self.assertEqual(len(files), 2)


julien-c · 2021-04-21T07:35:34Z

tests/test_hubmixin.py

        )
+        self.assertTrue(model.config == {"num": 10, "act": "gelu_fast"})


self.assertEqual should work, no?

LysandreJik added 2 commits April 20, 2021 19:58

Improve Mixin

e412bdb

Specify user and email

f2656f4

LysandreJik requested a review from julien-c April 21, 2021 00:16

julien-c reviewed Apr 21, 2021

View reviewed changes

LysandreJik added 4 commits April 22, 2021 10:36

Address comments

86d11f0

Require torch :)

a3d8b1c

Only create model if torch is available

250195e

Quality

926a112

LysandreJik merged commit cec8594 into huggingface:main Apr 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Mixin #34

Improve Mixin #34

LysandreJik commented Apr 20, 2021 •

edited

Loading

julien-c left a comment

julien-c Apr 21, 2021

LysandreJik Apr 21, 2021

julien-c Apr 21, 2021

LysandreJik Apr 21, 2021

julien-c Apr 22, 2021

julien-c Apr 21, 2021

julien-c Apr 21, 2021

LysandreJik Apr 21, 2021

		)
		self.assertTrue(model.config == {"num": 10, "act": "gelu_fast"})

Improve Mixin #34

Improve Mixin #34

Conversation

LysandreJik commented Apr 20, 2021 • edited Loading

julien-c left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LysandreJik commented Apr 20, 2021 •

edited

Loading