[DOC] Improve pipeline() docstrings for config and tokenizer #8123

BramVanroy · 2020-10-28T16:31:31Z

As currently written, it was not clear to me which arguments were needed when using a non-default model in pipeline(). It seemed that when you provided a non-default model, that you still needed to manually change the config and tokenizer because otherwise the "task's default will be used". In practice, though, the pipeline is smart enough to automatically choose the right config/tokenizer for the given model. This PR clarifies that a bit in the docstrings/documentation, by explaining exactly which priorities are used when loading the tokenizer. A small change was made for config, too.

Admittedly, the wording for the tokenizer part is a bit off (programmatical, even), but I think it should make clear how the right tokenizer is loaded.

cc @sgugger

sgugger

Thanks for fixing!

src/transformers/pipelines.py

BramVanroy · 2020-10-28T17:01:57Z

@sgugger I made the change as you requested. Not sure why CI is failing on build_doc. Seems to have to do with some env installation.

sgugger · 2020-10-28T17:27:14Z

The failure is spurious (basically the new version of pytorch is not cached on the CI and it fails to download it sometimes). Thanks for th fix!

…face#8123) * Improve pipeline() docstrings * make style * Update wording for config

…uggingface#8123)" This reverts commit 51f282b.

Improve pipeline() docstrings

3438757

BramVanroy requested a review from sgugger October 28, 2020 16:31

sgugger approved these changes Oct 28, 2020

View reviewed changes

src/transformers/pipelines.py Outdated Show resolved Hide resolved

Bram Vanroy added 2 commits October 28, 2020 17:52

make style

533df33

Update wording for config

37d4004

sgugger merged commit 5193172 into huggingface:master Oct 28, 2020

BramVanroy deleted the patch-2 branch October 28, 2020 19:47

fabiocapsouza pushed a commit to fabiocapsouza/transformers that referenced this pull request Nov 15, 2020

[DOC] Improve pipeline() docstrings for config and tokenizer (hugging…

51f282b

…face#8123) * Improve pipeline() docstrings * make style * Update wording for config

fabiocapsouza added a commit to fabiocapsouza/transformers that referenced this pull request Nov 15, 2020

Revert "[DOC] Improve pipeline() docstrings for config and tokenizer (h…

484cf72

…uggingface#8123)" This reverts commit 51f282b.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DOC] Improve pipeline() docstrings for config and tokenizer #8123

[DOC] Improve pipeline() docstrings for config and tokenizer #8123

BramVanroy commented Oct 28, 2020 •

edited

Loading

sgugger left a comment

BramVanroy commented Oct 28, 2020

sgugger commented Oct 28, 2020

[DOC] Improve pipeline() docstrings for config and tokenizer #8123

[DOC] Improve pipeline() docstrings for config and tokenizer #8123

Conversation

BramVanroy commented Oct 28, 2020 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

BramVanroy commented Oct 28, 2020

sgugger commented Oct 28, 2020

BramVanroy commented Oct 28, 2020 •

edited

Loading