llm-training-playground

Here are some of my notes for Fine-Tuning LLM with Hugging Face's Library (PEFT, SFT, PPO, etc.).

Finetuning Notebook Table

Notebook/file Title	Description	Model	GPU
Finetuning a Mistral 7B Model with LoRA	Training with custom dataset	Mistral-7B	RTX 4090
Fine-tune Llama-3 with SFT	Training with custom dataset	Llama 3	A6000

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
1_finetune_mistral_LorA.py		1_finetune_mistral_LorA.py
2_finetune_llama3_LoRA.py		2_finetune_llama3_LoRA.py
README.md		README.md
training_config.py		training_config.py