preference loss sign is inverted and leads to negative loss #481

winglian · 2024-12-16T15:57:05Z

Summary

In testing cases where there is no alpha/NLL loss used, the loss becomes negative, which is probably not the intended behavior. see https://github.com/huggingface/trl/blob/main/trl/trainer/dpo_trainer.py#L1234

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

austin362667

Would you mind updating the corresponding tests? They failed in my CI as expected.

austin362667 · 2024-12-16T16:54:59Z

I think the corresponding snippet lies here:

Liger-Kernel/test/utils.py

Line 514 in 0bb6c72

loss = policy_nll_loss * self.alpha - losses.mean()

.

austin362667

Thank you!

shivam15s · 2024-12-17T00:24:17Z

src/liger_kernel/chunked_loss/fused_linear_preference.py

@@ -408,7 +408,7 @@ def _compute_loss(
        else:
            preference_loss, aux_outputs = preference_loss_outputs, []

-        loss = alpha * chosen_nll_loss - preference_loss


Seems like the original logic is correct for all losses except dpo_loss. I think the sign change should be here instead:

Liger-Kernel/src/liger_kernel/chunked_loss/dpo_loss.py

Line 52 in 21baccc

loss = -F.logsigmoid(logits_diff).sum() / (full_target.shape[0] // 2)

Hi @shivam15s Thanks for noticing this. What are your thoughts on negating each preference loss term to align with the formulas in the docstrings? This would allow us to maintain the base preference structure as nll_loss + preference_loss while making both terms represent losses to be minimized.

Sure @austin362667 ! I think that might help with readability.
If you have some time, could you add the fix to this PR?

Ya sure! I might need to open a new PR (base on this one) cc @winglian

@winglian

## Summary Thanks to @winglian and @shivam15s noticed and fixed this #481. This PR suggests negating the preference loss terms to align with the formulas in the docstrings, while maintaining the base preference structure as `nll_loss + preference_loss`. This would make our loss computations more consistent since both terms would represent losses to be minimized. [UPDATE: It seems like being addressed now in [here](3205342#diff-3048cb37b97e27515852c200994f3257b8ae33a465421d05184713377c0895b1R150)] This PR also tightened the tolerance in case of encountering a similar issue.  ## Testing Done   - Hardware Type: <BLANK> - [X] run `make test` to ensure correctness - [X] run `make checkstyle` to ensure code style - [ ] run `make test-convergence` to ensure convergence --------- Signed-off-by: Austin Liu <austin362667@gmail.com> Co-authored-by: Wing Lian <wing@axolotl.ai> Co-authored-by: Shivam Sahni <shivam15800@gmail.com>

preference loss sign is inverted and leads to negative loss

e208924

This comment was marked as outdated.

Sign in to view

austin362667 requested changes Dec 16, 2024

View reviewed changes

fix test sign too

1c3a631

austin362667 approved these changes Dec 16, 2024

View reviewed changes

shivam15s requested changes Dec 17, 2024

View reviewed changes

austin362667 mentioned this pull request Dec 17, 2024

Fix Preference Loss and Refactor for Readability #484

Merged

3 tasks

winglian closed this Dec 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

preference loss sign is inverted and leads to negative loss #481

preference loss sign is inverted and leads to negative loss #481

winglian commented Dec 16, 2024

This comment was marked as outdated.

austin362667 left a comment

austin362667 commented Dec 16, 2024 •

edited

Loading

austin362667 left a comment

shivam15s Dec 17, 2024 •

edited

Loading

austin362667 Dec 17, 2024

shivam15s Dec 17, 2024

austin362667 Dec 17, 2024

preference loss sign is inverted and leads to negative loss #481

preference loss sign is inverted and leads to negative loss #481

Conversation

winglian commented Dec 16, 2024

Summary

This comment was marked as outdated.

austin362667 left a comment

Choose a reason for hiding this comment

austin362667 commented Dec 16, 2024 • edited Loading

austin362667 left a comment

Choose a reason for hiding this comment

shivam15s Dec 17, 2024 • edited Loading

Choose a reason for hiding this comment

austin362667 Dec 17, 2024

Choose a reason for hiding this comment

shivam15s Dec 17, 2024

Choose a reason for hiding this comment

austin362667 Dec 17, 2024

Choose a reason for hiding this comment

austin362667 commented Dec 16, 2024 •

edited

Loading

shivam15s Dec 17, 2024 •

edited

Loading