Best practice recommendation update for dpo_trainer.mdx #1325

R-seny · 2024-02-11T00:54:20Z

In the document as it is now, the best practice recommendations on merging adaptors into the base model seem neither consistent nor correct.

For example, the documentation links a tweet with a recommendation to merge adaptors into a quantized model, and a script that supposedly illustrates how to apply that recommendation. But the script actually does the opposite of what the tweet recommends, first dequantizing the model.

There are similar inconsistencies/ambiguities further in that paragraph. For example, saying that using an unquantized model would lead to lower performance (I changed it to "higher memory demand").

Overall, I updated the paragraph to improve consistency and provided links to slightly more evidence-based merging recommendations.

In the document as it is now the best practice recommendations don't seem neither consistent nor correct. For example, the documentation links a tweet with a recommendation to merge adaptors into a quantized model, and a script that supposedly illustrates how to apply that recommendation. But the script actually does the opposite of what the tweet recommends, first dequantizing the model. There are similar inconsistencies/ambiguities further in that paragraph. For example, saying that using an unquantized model would lead to lower performance (I changed it to "higher memory demand"). Overall, I updated the paragraph to improve consistency and provided links to slightly more evidence-based merging recommendations.

HuggingFaceDocBuilderDev · 2024-02-13T05:32:28Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

lewtun

Thanks for the fix!

…1325) In the document as it is now the best practice recommendations don't seem neither consistent nor correct. For example, the documentation links a tweet with a recommendation to merge adaptors into a quantized model, and a script that supposedly illustrates how to apply that recommendation. But the script actually does the opposite of what the tweet recommends, first dequantizing the model. There are similar inconsistencies/ambiguities further in that paragraph. For example, saying that using an unquantized model would lead to lower performance (I changed it to "higher memory demand"). Overall, I updated the paragraph to improve consistency and provided links to slightly more evidence-based merging recommendations.

R-seny mentioned this pull request Feb 11, 2024

Questionable best-practice recommendation in DPO docs #1324

Closed

lewtun approved these changes Feb 14, 2024

View reviewed changes

lewtun merged commit 29f162b into huggingface:main Feb 14, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Best practice recommendation update for dpo_trainer.mdx #1325

Best practice recommendation update for dpo_trainer.mdx #1325

R-seny commented Feb 11, 2024

HuggingFaceDocBuilderDev commented Feb 13, 2024

lewtun left a comment

Best practice recommendation update for dpo_trainer.mdx #1325

Best practice recommendation update for dpo_trainer.mdx #1325

Conversation

R-seny commented Feb 11, 2024

HuggingFaceDocBuilderDev commented Feb 13, 2024

lewtun left a comment

Choose a reason for hiding this comment