Code Llama Fine-tuning Support #194

adam-weinberger · 2023-10-19T17:50:33Z

@JegernOUTT
This pull request adds support for fine-tuning Code Llama 7b. The biggest update is in the supported_models.py file which adds the configurations for Code Llama. The second biggest update adds support for additional model configurations (in finetune_train_default.py and finetune_train.py). The other updates are minor and are mostly for logging.

To test this update from command line:

Update the finetune.cfg file:

{
    "model_name": "codellama/7b",
    "use_heuristics": false,
    "train_steps": 75,
    "batch_size": 16,
    "save_every": 1
}

Put your code files (easiest is a .zip of the repo) in the folder: ~/.refact/perm-storage/uploaded-files
Run:

DEBUG=1 python -m refact.refact_data_pipeline.finetune.process_uploaded_files
DEBUG=1 python -m refact.refact_data_pipeline.finetune.finetune_filter
DEBUG=1 python -m refact.refact_data_pipeline.finetune.finetune_train

Verify the model has trained correctly by checking the "status" is "finished" in .refact/perm-storage/loras/lora-YYYYmmDD-MMSSss/status.json

To test this update from the UI:

Run:

DEBUG=1 python -m self_hosting_machinery.webgui.webgui
DEBUG=1 python -m self_hosting_machinery.inference.inference_worker --model codellama/7b

On the Model Hosting select "codellama/7b". Click through the "Sources" and "Finetune" tabs as normal, being sure To select the "codellama/7b" model under "Start New Fine Tune". NOTE: sometimes portions of the UI are grayed out when I try to go through this workflow and I'm not always able to completed. Is this expected? I have not touched any of the UI code.
Verify how the model has trained correctly by looking in the "Completed Runs"

refact_data_pipeline/finetune/supported_models.py

JegernOUTT · 2023-11-01T07:06:39Z

@adam-weinberger please, rebase your branch onto https://github.com/smallcloudai/refact/tree/v1.2.0 and resolve all conflicts

* Added multiple print statements for debugging fine tuning * Added support for Code Llama 7b * Depending on the training parameters I set I either get an out of memory GPU error or ValueError(“optimizer got an empty parameter list”)

* Print statements for debugging and initial support for Code Llama * Added multiple print statements for debugging fine tuning * Added support for Code Llama 7b * Depending on the training parameters I set I either get an out of memory GPU error or ValueError(“optimizer got an empty parameter list”) * Code Llama fine-tuning but fails on checkpoint * commenting print statements * updating default config behavior * Begin adding encoding for Code Llama * adding BOS and EOS tokens for Code Llama, model running properly * getting rid of #? * Print statements for debugging and initial support for Code Llama * Added multiple print statements for debugging fine tuning * Added support for Code Llama 7b * Depending on the training parameters I set I either get an out of memory GPU error or ValueError(“optimizer got an empty parameter list”) * Code Llama fine-tuning but fails on checkpoint * commenting print statements * updating default config behavior * Begin adding encoding for Code Llama * adding BOS and EOS tokens for Code Llama, model running properly * getting rid of #?

JegernOUTT self-requested a review October 20, 2023 04:52

JegernOUTT requested changes Oct 20, 2023

View reviewed changes

refact_data_pipeline/finetune/supported_models.py Outdated Show resolved Hide resolved

JegernOUTT self-requested a review November 1, 2023 06:56

adam-weinberger added 7 commits November 1, 2023 11:05

Code Llama fine-tuning but fails on checkpoint

d086c8a

commenting print statements

8c844cd

updating default config behavior

d2b0a74

Begin adding encoding for Code Llama

4e3dc3b

adding BOS and EOS tokens for Code Llama, model running properly

682894a

getting rid of #?

bb0fa25

adam-weinberger force-pushed the main branch from ab61915 to bb0fa25 Compare November 1, 2023 15:05

adam-weinberger added 8 commits November 1, 2023 11:07

Code Llama fine-tuning but fails on checkpoint

70d57d8

commenting print statements

f031dd9

updating default config behavior

a8a5cc2

Begin adding encoding for Code Llama

6ce3d79

adding BOS and EOS tokens for Code Llama, model running properly

cfb8646

getting rid of #?

936e15c

Merge branch 'main' of https://github.com/adam-weinberger/refact

77fcb97

JegernOUTT changed the base branch from main to v1.2.0 November 1, 2023 16:50

JegernOUTT approved these changes Nov 1, 2023

View reviewed changes

JegernOUTT merged commit dbb9fd9 into smallcloudai:v1.2.0 Nov 1, 2023

klink linked an issue Nov 8, 2023 that may be closed by this pull request

Add Code LLaMA #217

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code Llama Fine-tuning Support #194

Code Llama Fine-tuning Support #194

adam-weinberger commented Oct 19, 2023 •

edited

Loading

JegernOUTT commented Nov 1, 2023

Code Llama Fine-tuning Support #194

Code Llama Fine-tuning Support #194

Conversation

adam-weinberger commented Oct 19, 2023 • edited Loading

JegernOUTT commented Nov 1, 2023

adam-weinberger commented Oct 19, 2023 •

edited

Loading