Inf Loss Problem When Training #30

nreHieW · 2024-07-19T12:01:24Z

In line 1628-1629 of transformers/src/transformers/models/chameleon/modeling_chameleon.py

image_tokens = self.model.vocabulary_mapping.image_tokens
logits[:, :, image_tokens] = torch.finfo(logits.dtype).min

My understanding is that this is here because the original Chameleon intentionally did not want to generate any image tokens. But keeping this in for training would lead to inf loss.

Is there an updated version of the code?

The text was updated successfully, but these errors were encountered:

leloykun · 2024-07-19T12:47:51Z

Hi! You can either use the version in the transformers folder or this PR of mine to the main Transformers library: huggingface/transformers#32013

nreHieW closed this as completed Jul 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inf Loss Problem When Training #30

Inf Loss Problem When Training #30

nreHieW commented Jul 19, 2024

leloykun commented Jul 19, 2024

Inf Loss Problem When Training #30

Inf Loss Problem When Training #30

Comments

nreHieW commented Jul 19, 2024

leloykun commented Jul 19, 2024