Let DPOTrainer Support padding_free #2422

fzyzcjy · 2024-12-01T09:44:10Z

Feature request

Hi thanks for the library! It seems that https://huggingface.co/blog/packing-with-FA2 introduces a way to avoid a lot of pad tokens in SFT, and makes training faster. Therefore, it would be great if the same thing can be used for DPO.

Motivation

(see above)

Your contribution

n/a

qgallouedec · 2024-12-01T10:09:58Z

Thanks @fzyzcjy! Can you elaborate a bit? What is this padding free method?

fzyzcjy · 2024-12-01T10:20:24Z

Oh sorry I provided the wrong link, now the link is updated to point to the correct "padding_free" article

qgallouedec · 2024-12-01T14:05:38Z

Thanks for the pointer. This would be nice addition! Any contribution is welcome. I mark this one as good second issue

qgallouedec · 2024-12-01T14:12:37Z

The guideline is basically to:

Update PreferenceCollator to add padding_free like in add arg padding_free to DataCollatorForCompletionOnlyLM #1887
Update concatenated_inputs to (a) make xxx_attention_mask optional and add support for xxx_position_ids
Add a test

fzyzcjy · 2024-12-01T14:19:15Z

Thank you!

dame-cell · 2024-12-01T14:58:10Z

should padding_free in PreferenceCollator be like kind of a optional arguement or like keep it default ?

but why make `xxx_attention_mask' optional ?
is it because padding-free sequences might not use attention masks at all.?
for example

In regular training with padding, attention_masks are needed to tell the model which tokens are real and which are padding (0s for padding, 1s for real tokens)
In padding-free training, since we remove all padding tokens, every token is a real token, so we don't need explicit masks to distinguish between real and padding

does this make sense?
Thank you for your patience - I wanted to verify that I understand these concepts correctly

qgallouedec · 2024-12-01T15:02:21Z

I think it makes sense yes.

dame-cell · 2024-12-01T15:03:37Z

@fzyzcjy @qgallouedec if no one is working on this I would like to help

fzyzcjy · 2024-12-01T23:58:27Z

@dame-cell I do not have time recently for that, and your PR would be great and many thanks!

zwhe99 · 2024-12-02T09:10:10Z

Is it possible for PPO to support padding_free?

qgallouedec added ✨ enhancement New feature or request 🙋 help from community wanted Open invitation for community members to contribute 🧒 good second issue Good for contributors with basic project familiarity 🏋 DPO Related to DPO labels Dec 1, 2024

dame-cell linked a pull request Dec 4, 2024 that will close this issue

Padding free dpo #2437

Open

5 tasks

qgallouedec linked a pull request Dec 13, 2024 that will close this issue

Padding free dpo #2437

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Let DPOTrainer Support padding_free #2422

Let DPOTrainer Support padding_free #2422

fzyzcjy commented Dec 1, 2024 •

edited

Loading

qgallouedec commented Dec 1, 2024

fzyzcjy commented Dec 1, 2024

qgallouedec commented Dec 1, 2024

qgallouedec commented Dec 1, 2024

fzyzcjy commented Dec 1, 2024

dame-cell commented Dec 1, 2024

qgallouedec commented Dec 1, 2024

dame-cell commented Dec 1, 2024

fzyzcjy commented Dec 1, 2024

zwhe99 commented Dec 2, 2024

Let DPOTrainer Support padding_free #2422

Let DPOTrainer Support padding_free #2422

Comments

fzyzcjy commented Dec 1, 2024 • edited Loading

Feature request

Motivation

Your contribution

qgallouedec commented Dec 1, 2024

fzyzcjy commented Dec 1, 2024

qgallouedec commented Dec 1, 2024

qgallouedec commented Dec 1, 2024

fzyzcjy commented Dec 1, 2024

dame-cell commented Dec 1, 2024

qgallouedec commented Dec 1, 2024

dame-cell commented Dec 1, 2024

fzyzcjy commented Dec 1, 2024

zwhe99 commented Dec 2, 2024

fzyzcjy commented Dec 1, 2024 •

edited

Loading