You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
Traceback (most recent call last):
File "/workspace/mamba-chat/xdan-chat.py", line 12, in
model = MambaLMHeadModel.from_pretrained(model_path, device="cuda", dtype=torch.float16)
File "/root/miniconda3/envs/axo/lib/python3.10/site-packages/mamba_ssm/models/mixer_seq_simple.py", line 231, in from_pretrained
model = cls(**config, device=device, dtype=dtype, **kwargs)
File "/root/miniconda3/envs/axo/lib/python3.10/site-packages/mamba_ssm/models/mixer_seq_simple.py", line 190, in init
self.backbone = MixerModel(
TypeError: MixerModel.init() got an unexpected keyword argument 'bos_token_id'
The text was updated successfully, but these errors were encountered:
It looks like some huggingface-specific arguments (bos_token_id) might be passed to the model automatically, but there's very little context here to understand what happens exactly. Could you maybe share some more code, specifically how you initialize the model?
I train the model via axolot .
Heres the chat.py error:
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
Traceback (most recent call last):
File "/workspace/mamba-chat/xdan-chat.py", line 12, in
model = MambaLMHeadModel.from_pretrained(model_path, device="cuda", dtype=torch.float16)
File "/root/miniconda3/envs/axo/lib/python3.10/site-packages/mamba_ssm/models/mixer_seq_simple.py", line 231, in from_pretrained
model = cls(**config, device=device, dtype=dtype, **kwargs)
File "/root/miniconda3/envs/axo/lib/python3.10/site-packages/mamba_ssm/models/mixer_seq_simple.py", line 190, in init
self.backbone = MixerModel(
TypeError: MixerModel.init() got an unexpected keyword argument 'bos_token_id'
The text was updated successfully, but these errors were encountered: