question about Pretrained LLAMA applicable to Llama_adapter model. thanks #134

jzssz · 2023-11-24T11:16:14Z

I noticed that the pretrained llama model in the code is in .pth format. So if I have a pretrained llama model in .bin format, can I use it as a Llama base model to train llama-adapter v2 (put it in "--llama_path") ?
If I have a llama (7B) pre-trained with LORA (get a light-weight model like"adapter_model.bin"), can I use it as a Llama base model to train llama-adapter v2 (put it in "--llama_path") ?

thanks a lot

csuhan · 2023-11-30T07:30:38Z

I guess your checkpoint is transformers-based, while our code only support the original llama format (https://github.com/facebookresearch/llama). So you need to download the original llama weights instead of transformers format.

jzssz changed the title ~~question about pretrained llama（with LORA），thanks~~ question about pretrained llama，thanks Nov 24, 2023

jzssz changed the title ~~question about pretrained llama，thanks~~ question about Pretrained LLAMA applicable to Llama_adapter model. thanks Nov 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question about Pretrained LLAMA applicable to Llama_adapter model. thanks #134

question about Pretrained LLAMA applicable to Llama_adapter model. thanks #134

jzssz commented Nov 24, 2023 •

edited

Loading

csuhan commented Nov 30, 2023

question about Pretrained LLAMA applicable to Llama_adapter model. thanks #134

question about Pretrained LLAMA applicable to Llama_adapter model. thanks #134

Comments

jzssz commented Nov 24, 2023 • edited Loading

csuhan commented Nov 30, 2023

jzssz commented Nov 24, 2023 •

edited

Loading