Implement NEFTune Augmentation for Improved Language Model Fine-tuning #718

kostum123 · 2023-10-12T22:40:06Z

⚠️ Please check that this feature request hasn't been suggested before.

I searched previous Ideas in Discussions didn't find any similar feature requests.
I searched previous Issues didn't find any similar feature requests.

🔖 Feature description

I would like to propose the addition of a new method called NEFTune to the Axolotl repository. NEFTune is a recent and highly effective augmentation technique for language model fine-tuning that has shown remarkable improvements in performance across various tasks without increasing training time significantly.

Motivation:

Currently, Axolotl relies on qlora, lora, and full fine-tuning methods. While these methods are effective, integrating NEFTune into the repository can further enhance the model's capabilities. NEFTune operates by adding noise to the embedding vectors during training, leading to substantial performance gains.

Benefits:

Performance Boost: Incorporating NEFTune can significantly improve the performance of the models in various evaluations, such as AlpacaEval.
Compatibility: NEFTune is designed to be compatible with existing fine-tuning methods, including qlora, lora, and full fine-tuning, ensuring ease of implementation.
Generalization: NEFTune has demonstrated the ability to improve model performance on a range of modern instruction datasets, including Evol-Instruct, ShareGPT, and OpenPlatypus.
Versatility: Even powerful models that undergo further refinement, such as those using RLHF like LLaMA-2-Chat, can benefit from the addition of NEFTune during training.

Paper Reference:

NEFTune: Improved Language Model Fine-Tuning with Noisy Embeddings

✔️ Solution

Official Repo of the Method:
https://github.com/neelsjain/neftune

Request Implementation Steps:

Evaluate the feasibility of integrating NEFTune into the Axolotl repository.
Implement NEFTune as an optional augmentation method in the training pipeline.
Provide clear documentation and guidelines for users on how to utilize NEFTune within the Axolotl framework.
Conduct thorough testing and validation to ensure that the integration does not negatively impact existing functionalities.
Monitor and maintain the NEFTune feature to keep it up-to-date with any changes in the Axolotl repository.

Additional Information:

I believe that incorporating NEFTune into the Axolotl repository can elevate the capabilities of the models and make them more competitive in various natural language understanding tasks. It aligns with the goal of continuous improvement and innovation in language model training.

I appreciate your consideration of this feature request and look forward to the potential enhancement of the Axolotl repository's capabilities.

❓ Alternatives

No response

📝 Additional Context

No response

Acknowledgements

My issue title is concise, descriptive, and in title casing.
I have searched the existing issues to make sure this feature has not been requested yet.
I have provided enough information for the maintainers to understand and evaluate this request.

winglian · 2023-10-12T23:08:32Z

probably just needs to monkeypatch it on this https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/modeling_llama.py#L787

philpax · 2023-11-13T01:18:15Z

Was this closed by #721 ?

enn-nafnlaus · 2023-11-27T20:18:21Z

And if it's implemented, how do we use it?

NanoCode012 · 2024-03-30T17:42:32Z

Can be used following: https://github.com/OpenAccess-AI-Collective/axolotl/blob/89134f2143cd3325802813eb97cd05c783932201/README.md?plain=1#L944-L947

kostum123 added the enhancement New feature or request label Oct 12, 2023

maximegmd mentioned this issue Oct 13, 2023

add noisy embedding #721

Merged

NanoCode012 closed this as completed Mar 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement NEFTune Augmentation for Improved Language Model Fine-tuning #718

Implement NEFTune Augmentation for Improved Language Model Fine-tuning #718

kostum123 commented Oct 12, 2023 •

edited

Loading

winglian commented Oct 12, 2023

philpax commented Nov 13, 2023

enn-nafnlaus commented Nov 27, 2023

NanoCode012 commented Mar 30, 2024

Implement NEFTune Augmentation for Improved Language Model Fine-tuning #718

Implement NEFTune Augmentation for Improved Language Model Fine-tuning #718

Comments

kostum123 commented Oct 12, 2023 • edited Loading

⚠️ Please check that this feature request hasn't been suggested before.

🔖 Feature description

✔️ Solution

❓ Alternatives

📝 Additional Context

Acknowledgements

winglian commented Oct 12, 2023

philpax commented Nov 13, 2023

enn-nafnlaus commented Nov 27, 2023

NanoCode012 commented Mar 30, 2024

kostum123 commented Oct 12, 2023 •

edited

Loading