🌋 Add support for LLaVA-Next in `DPOTrainer` #2413

chenweize1998 · 2024-11-29T03:44:10Z

What does this PR do?

This PR provides a quick fix for issue #2403. Previously, DPOTrainer did not support LLaVA-Next because the required image_sizes parameter from the LLaVA-Next forward function was being removed during data processing within DPOTrainer. This update modifies DPOTrainer to retain the image_sizes parameter if it is returned by the image processor and passes it to the model when present.

While this fix resolves the issue on my end - I successfully ran DPOTrainer with LLaVA-Next - but it has not been extensively tested. I would appreciate assistance or guidance on the next steps to ensure broader compatibility and robustness.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines.
Did you write any new necessary tests?

Who can review?

Maybe @qgallouedec

qgallouedec · 2024-11-29T11:26:21Z

Great! Thanks @chenweize1998! Can you try to uncomment

trl/tests/test_dpo_trainer.py

Line 1142 in ac26778

# ("trl-internal-testing/tiny-LlavaNextForConditionalGeneration",),

trl/trainer/dpo_trainer.py

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

HuggingFaceDocBuilderDev · 2024-11-29T12:51:19Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec

Thanks @chenweize1998!

qgallouedec · 2024-11-29T14:45:01Z

As expected, "Tests / Tests with dev dependencies" will fail until huggingface/transformers#34953 is merged. We can safely ignore this failing test.

add support for llava-next in dpotrainer

e729ef9

chenweize1998 changed the title ~~add support for llava-next in dpotrainer~~ add support for LLaVA-Next in DPOTrainer Nov 29, 2024

qgallouedec reviewed Nov 29, 2024

View reviewed changes

trl/trainer/dpo_trainer.py Outdated Show resolved Hide resolved

1rubbishyuan and others added 2 commits November 29, 2024 20:46

enable unit test

e6f18b0

code style

27b6e7a

Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Merge branch 'main' into llava-dpo-trainer

690270a

qgallouedec mentioned this pull request Nov 29, 2024

fix variable undefined bug when return_tensors is not specified in llava processing huggingface/transformers#34953

Merged

5 tasks

Ignore last layer in test

5122347

qgallouedec changed the title ~~add support for LLaVA-Next in DPOTrainer~~ 🌋 Add support for LLaVA-Next in DPOTrainer Nov 29, 2024

qgallouedec approved these changes Nov 29, 2024

View reviewed changes

qgallouedec merged commit 8d9cfaa into huggingface:main Nov 29, 2024
12 of 13 checks passed

qgallouedec mentioned this pull request Nov 30, 2024

🏎 Fix deepspeed preparation of ref_model in OnlineDPOTrainer #2417

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🌋 Add support for LLaVA-Next in `DPOTrainer` #2413

🌋 Add support for LLaVA-Next in `DPOTrainer` #2413

chenweize1998 commented Nov 29, 2024

qgallouedec commented Nov 29, 2024

HuggingFaceDocBuilderDev commented Nov 29, 2024

qgallouedec left a comment

qgallouedec commented Nov 29, 2024

🌋 Add support for LLaVA-Next in DPOTrainer #2413

🌋 Add support for LLaVA-Next in DPOTrainer #2413

Conversation

chenweize1998 commented Nov 29, 2024

What does this PR do?

Before submitting

Who can review?

qgallouedec commented Nov 29, 2024

HuggingFaceDocBuilderDev commented Nov 29, 2024

qgallouedec left a comment

Choose a reason for hiding this comment

qgallouedec commented Nov 29, 2024

🌋 Add support for LLaVA-Next in `DPOTrainer` #2413

🌋 Add support for LLaVA-Next in `DPOTrainer` #2413