about some small bug of prompt_tuning.py #1032

duxiaoyu555 · 2023-10-17T09:34:42Z

System Info

peft ==0.5.0
python == 3.9
transformers==4.33.1

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder
My own task or dataset (give details below)

Reproduction

from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PromptTuningConfig, get_peft_model, TaskType, PromptTuningInit
import torch
tokenizer = AutoTokenizer.from_pretrained("/upp/xgen/xgen-7b-8k-base", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("/upp/xgen/xgen-7b-8k-base", torch_dtype=torch.bfloat16,trust_remote_code=True)
config = PromptTuningConfig(task_type=TaskType.CAUSAL_LM,
                            prompt_tuning_init=PromptTuningInit.TEXT,
                            prompt_tuning_init_text="下面是一段人与机器人的对话。",
                            num_virtual_tokens=len(tokenizer("下面是一段人与机器人的对话。")["input_ids"]),
                            tokenizer_name_or_path="xxxxx")  #(local file)

model = get_peft_model(model, config)

Expected behavior

i have an advice of get_peft_model this method , in this function ,have an class PromptEmbedding in prompt_tuning.py
and line 112 tokenizer = AutoTokenizer.from_pretrained(config.tokenizer_name_or_path) should have an args trust_remote_code=True
i met an issue Tokenizer class xxxx does not exist or is not currently imported. because of it .

The text was updated successfully, but these errors were encountered:

Fixes huggingface#1032 Description Currently, when using prompt tuning with TEXT, we call AutoTokenizer.from_pretrained with only the model id. However, it may be necessary to pass additional arguments, e.g. trust_remote_code=True. This fix allows to pass more arguments by setting the argument tokenizer_kwargs in the PromptTuningConfig. I also added a check that when tokenizer_kwargs is set, the TEXT option is actually being used. Moreover, I noticed that we have no tests for prompt tuning with TEXT, so I added those tests for decoder models. Additional changes There was a bug in PromptEmbedding where the device of the init_token_ids was not set, which resulted in errors when using CUDA. Finally, I removed an unused constant CONFIG_CLASSES from a test.

Fixes #1032 Description Currently, when using prompt tuning with TEXT, we call AutoTokenizer.from_pretrained with only the model id. However, it may be necessary to pass additional arguments, e.g. trust_remote_code=True. This fix allows to pass more arguments by setting the argument tokenizer_kwargs in the PromptTuningConfig. I also added a check that when tokenizer_kwargs is set, the TEXT option is actually being used. Moreover, I noticed that we have no tests for prompt tuning with TEXT, so I added those tests for decoder models. Additional changes There was a bug in PromptEmbedding where the device of the init_token_ids was not set, which resulted in errors when using CUDA. Finally, I removed an unused constant CONFIG_CLASSES from a test.

BenjaminBossan mentioned this issue Oct 25, 2023

Prompt tuning: Allow to pass additional args to AutoTokenizer.from_pretrained #1053

Merged

pacman100 closed this as completed in #1053 Nov 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about some small bug of prompt_tuning.py #1032

about some small bug of prompt_tuning.py #1032

duxiaoyu555 commented Oct 17, 2023 •

edited by BenjaminBossan

Loading

about some small bug of prompt_tuning.py #1032

about some small bug of prompt_tuning.py #1032

Comments

duxiaoyu555 commented Oct 17, 2023 • edited by BenjaminBossan Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

duxiaoyu555 commented Oct 17, 2023 •

edited by BenjaminBossan

Loading