Bugs when fine-tune llama-2-7b with instructions using llama-2's conversation template #2871

YJiangcm · 2023-12-29T02:32:25Z

Thanks for your great work! I met some problems when using fastchat/train/train.py to fine-tune a llama-2-7b by using llama-2's conversation template.

I have changed the get_conversation_template("vicuna") to get_conversation_template("llama-2"), and delated assert conv.sep_style == SeparatorStyle.ADD_COLON_TWO. However, the tokenization mismatch warning was reported, and the training loss was always 0.

WARNING: tokenization mismatch: 78 vs. 80.
#turn = 1. (ignored)

Could you please tell me how to adapt the code to llama-2's conversation template? Thanks a lot!

The text was updated successfully, but these errors were encountered:

infwinston mentioned this issue Jan 24, 2024

feat: train with template #2951

Merged

3 tasks

hychaochao mentioned this issue Jan 31, 2024

Bugs when fine-tune tiny-llama with instructions using tiny-llama's conversation template #2992

Closed

congchan mentioned this issue Feb 1, 2024

fix: tokenization mismatch when training with different templates #2996

Merged

3 tasks

merrymercy closed this as completed in #2996 Feb 1, 2024

congchan mentioned this issue Feb 3, 2024

fix: inconsistent tokenization by llama tokenizer #3006

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugs when fine-tune llama-2-7b with instructions using llama-2's conversation template #2871

Bugs when fine-tune llama-2-7b with instructions using llama-2's conversation template #2871

YJiangcm commented Dec 29, 2023

Bugs when fine-tune llama-2-7b with instructions using llama-2's conversation template #2871

Bugs when fine-tune llama-2-7b with instructions using llama-2's conversation template #2871

Comments

YJiangcm commented Dec 29, 2023