LLaVA image_token_index is not 64000 but 64002 #29836

XuweiyiChen · 2024-03-24T08:13:52Z

System Info

LLaVA image_token_index is not 64000 but 64002 in the latest version of the code. (main)

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

pip install the latest transformer from main branch
run llava 34b model.

Expected behavior

You can do inference freely but now an error will pop out saying there is 0 image token.

The text was updated successfully, but these errors were encountered:

NielsRogge · 2024-03-24T08:30:23Z

Hi,

Thanks for reporting, this is also reported here: https://huggingface.co/llava-hf/llava-v1.6-34b-hf/discussions/2 and will be resolved by #29797

amyeroberts added Should Fix This has been identified as a bug and should be fixed. bug Multimodal labels Mar 24, 2024

This was referenced Mar 25, 2024

[LlamaSlowConverter] Slow to Fast better support #29797

Merged

transformers 4.39.1 caused all my previous model training loss hard to converge #29927

Closed

ArthurZucker closed this as completed in #29797 Mar 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLaVA image_token_index is not 64000 but 64002 #29836

LLaVA image_token_index is not 64000 but 64002 #29836

XuweiyiChen commented Mar 24, 2024

NielsRogge commented Mar 24, 2024

LLaVA image_token_index is not 64000 but 64002 #29836

LLaVA image_token_index is not 64000 but 64002 #29836

Comments

XuweiyiChen commented Mar 24, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

NielsRogge commented Mar 24, 2024