Prompt tuning: Allow to pass additional args to AutoTokenizer.from_pretrained #1053

BenjaminBossan · 2023-10-25T13:37:25Z

Description

Currently, when using prompt tuning with TEXT, we call AutoTokenizer.from_pretrained with only the model id. However, it may be necessary to pass additional arguments, e.g. trust_remote_code=True. This fix allows to pass more arguments by setting the argument tokenizer_kwargs in the PromptTuningConfig.

I also added a check that when tokenizer_kwargs is set, the TEXT option is actually being used.

Moreover, I noticed that we have no tests for prompt tuning with TEXT, so I added those tests for decoder models.

Additional changes

There was a bug in PromptEmbedding where the device of the init_token_ids was not set, which resulted in errors when using CUDA.

Finally, I removed an unused constant CONFIG_CLASSES from a test.

Fixes huggingface#1032 Description Currently, when using prompt tuning with TEXT, we call AutoTokenizer.from_pretrained with only the model id. However, it may be necessary to pass additional arguments, e.g. trust_remote_code=True. This fix allows to pass more arguments by setting the argument tokenizer_kwargs in the PromptTuningConfig. I also added a check that when tokenizer_kwargs is set, the TEXT option is actually being used. Moreover, I noticed that we have no tests for prompt tuning with TEXT, so I added those tests for decoder models. Additional changes There was a bug in PromptEmbedding where the device of the init_token_ids was not set, which resulted in errors when using CUDA. Finally, I removed an unused constant CONFIG_CLASSES from a test.

HuggingFaceDocBuilderDev · 2023-10-25T13:42:14Z

The documentation is not available anymore as the PR was closed or merged.

pacman100

Nice work @BenjaminBossan on adding the support for tokenizer kwargs when using Prompt Tuning along with corresponding tests! ✨

BenjaminBossan requested review from pacman100 and younesbelkada October 25, 2023 13:57

pacman100 approved these changes Nov 14, 2023

View reviewed changes

pacman100 merged commit d350a00 into huggingface:main Nov 14, 2023

BenjaminBossan deleted the allow-args-for-autotokenizer-from_pretrained-in-prompt-tuning branch November 14, 2023 11:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prompt tuning: Allow to pass additional args to AutoTokenizer.from_pretrained #1053

Prompt tuning: Allow to pass additional args to AutoTokenizer.from_pretrained #1053

BenjaminBossan commented Oct 25, 2023

HuggingFaceDocBuilderDev commented Oct 25, 2023 •

edited

Loading

pacman100 left a comment

Prompt tuning: Allow to pass additional args to AutoTokenizer.from_pretrained #1053

Prompt tuning: Allow to pass additional args to AutoTokenizer.from_pretrained #1053

Conversation

BenjaminBossan commented Oct 25, 2023

Description

Additional changes

HuggingFaceDocBuilderDev commented Oct 25, 2023 • edited Loading

pacman100 left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Oct 25, 2023 •

edited

Loading