Add training callback to send predictions to WandB table #521

Glavin001 · 2023-09-03T05:42:25Z

Closes #490

What's New?

See examples:
https://www.loom.com/share/acaa23516b524aa29328b87b90f82599?sid=e2796761-d398-46a8-96a9-7f965b77437c

accelerate launch scripts/finetune.py examples/llama-2/tiny-random.yml

How to configure

1️⃣ Enable WandB
2️⃣ Every Eval will push updates to WandB

Change eval_steps in config as desired

Tasks

Glavin001 · 2023-09-03T05:43:18Z

Making progress, very slowly 😆 Learning a lot more about Python, Pytorch, HF Transformers though, which is good!

src/axolotl/utils/callbacks.py

src/axolotl/utils/trainer.py

src/axolotl/utils/callbacks.py

…to feat/wandb-pred-table

Glavin001 · 2023-09-05T08:15:52Z

Today's progress update: https://www.loom.com/share/1d1cb34d846440e2a5258fd3593a9e80

src/axolotl/utils/callbacks.py

…rt 2

…diction table

src/axolotl/utils/callbacks.py

Glavin001 · 2023-09-08T10:29:40Z

Latest WandB report: https://wandb.ai/glavin-wiechert/test-issue-490/runs/u5dcsq5l?workspace=user-glavin-wiechert

docker-compose.yaml

examples/llama-2/tiny-random.yml

scripts/finetune.py

src/axolotl/monkeypatch/llama_attn_hijack_flash.py

src/axolotl/utils/trainer.py

src/axolotl/utils/callbacks.py

.vscode/launch.json

src/axolotl/monkeypatch/llama_attn_hijack_flash.py

winglian · 2023-09-12T01:18:07Z

src/axolotl/utils/callbacks.py

+                        prompt_encoding = tokenizer(
+                            prompt_texts, padding=True, return_tensors="pt"
+                        ).to(self.cfg.device)
+                        predictions = trainer.model.generate(


trainer.prediction_step(...) might be easier to use

It was. However, I had that previously and it appeared to output strange predictions. At one point I actually had both to compare and only model.generate was useful.

I'm struggling to find the WandB report. I'll try again and see how it goes.

Top is trainer.prediction_step and bottom is trainer.model.generate:

Not sure if this is correct though? I originally showed both, I could add it too?

Added both now

src/axolotl/utils/trainer.py

src/axolotl/utils/config.py

winglian · 2023-09-12T13:17:47Z

also, the pre-commit checks are failing, just run pre-commit run --all-files

…ting/formatting

winglian

thanks!

Glavin001 · 2023-09-13T04:41:33Z

PS. Whenever you merge please squash the commits. Normally I’d have nice atomic/semantically sensible commits, although this PR had a ton of quick trial and error and saving WIP on remote GPU servers smile😁

teknium1 · 2023-10-21T22:26:46Z

PS. Whenever you merge please squash the commits. Normally I’d have nice atomic/semantically sensible commits, although this PR had a ton of quick trial and error and saving WIP on remote GPU servers smile😁

So how do I enable this and where do I set the prompts I want it to inference

Glavin001 · 2023-10-22T03:42:21Z

The new options (copied from the README, section Config > All yaml options, docs could be improved) are:

eval_table_size: # Approximate number of predictions sent to wandb depending on batch size. Enabled above 0. Default is 0
eval_table_max_new_tokens: # Total number of tokens generated for predictions sent to wandb. Default is 128

For example,

eval_table_size: 5
eval_table_max_new_tokens: 64

will:

Enable the predictions table being sent to WandB.
Have a table with at least 5 examples (depends on batch size, if there are 4 within a batch you may get 8 instead, Math.ceil(eval_table_size / batch_size) * batch_size).
Each prediction in table will generate at most 64 new tokens in length

The examples are currently extracted from the eval dataset, which is automatically set aside via the val_set_size option:

# How much of the dataset to set aside as evaluation. 1 = 100%, 0.50 = 50%, etc. 0 for no eval.
val_set_size: 0.04

In the future, a separate dataset entirely could be used, providing more control.
Still more work to be done on this feature.

Hope this helps! Let us know how it goes testing it, thanks!

teknium1 · 2023-10-22T21:02:38Z

The new options (copied from the README, section Config > All yaml options, docs could be improved) are:
eval_table_size: # Approximate number of predictions sent to wandb depending on batch size. Enabled above 0. Default is 0
eval_table_max_new_tokens: # Total number of tokens generated for predictions sent to wandb. Default is 128
For example,
eval_table_size: 5
eval_table_max_new_tokens: 64
will:

Enable the predictions table being sent to WandB.

Have a table with at least 5 examples (depends on batch size, if there are 4 within a batch you may get 8 instead, Math.ceil(eval_table_size / batch_size) * batch_size).

Each prediction in table will generate at most 64 new tokens in length

The examples are currently extracted from the eval dataset, which is automatically set aside via the val_set_size option:
# How much of the dataset to set aside as evaluation. 1 = 100%, 0.50 = 50%, etc. 0 for no eval.
val_set_size: 0.04
In the future, a separate dataset entirely could be used, providing more control. Still more work to be done on this feature.

Hope this helps! Let us know how it goes testing it, thanks!

I see, ok. I am hoping to have a seperate dataset of prompts mainly because afaik axolotl does not allow setting a seperate eval dataset, it just comes randomly from % of the dataset to train on, and I have a very specific set of prompts not in the training data I need to eval with, I also dont use normal evals at all, because I have a very specific dataset that needs every entry trained on, and I dont trust eval loss anyways, so I'd rather not section some % of my dataset for evals

…cloud#521) * WIP Add training callback to send predictions to WandB table * WIP improve wandb table reporting callback * WIP improve wandb table reporting callback (cont) * Add VSCode launching for debugging * Add tiny llama example * WIP attempt to improve post-eval prediction generation for table * WIP attempt to improve post-eval prediction generation for table - part 2 * WIP batch generation * WIP attempt to handle sample_packing using position_ids for wandb prediction table * WIP add code for debugging * Fix sample_packing support for wandb prediction table * Clean up code for PR review * Add eval_table_size, eval_table_max_new_tokens configs & clean up code * Clean up PR, delete VSCode config, add tiny-llama example * Add eval_table_size, eval_table_max_new_tokens documentation. Fix linting/formatting

WIP Add training callback to send predictions to WandB table

84d4476

Glavin001 commented Sep 3, 2023

View reviewed changes

src/axolotl/utils/callbacks.py Outdated Show resolved Hide resolved

Glavin001 added 5 commits September 5, 2023 01:08

Merge branch 'main' of github.com:OpenAccess-AI-Collective/axolotl in…

766875f

…to feat/wandb-pred-table

WIP improve wandb table reporting callback

0c743e3

WIP improve wandb table reporting callback (cont)

5a7f301

Add VSCode launching for debugging

8c7b7c5

Add tiny llama example

88c31f1

WIP attempt to improve post-eval prediction generation for table

06a44de

Glavin001 commented Sep 8, 2023

View reviewed changes

src/axolotl/utils/callbacks.py Outdated Show resolved Hide resolved

Glavin001 commented Sep 8, 2023

View reviewed changes

src/axolotl/utils/callbacks.py Show resolved Hide resolved

Glavin001 commented Sep 8, 2023

View reviewed changes

src/axolotl/utils/callbacks.py Show resolved Hide resolved

Glavin001 commented Sep 8, 2023

View reviewed changes

src/axolotl/utils/callbacks.py Outdated Show resolved Hide resolved

Glavin001 added 4 commits September 8, 2023 07:40

WIP attempt to improve post-eval prediction generation for table - pa…

ab3cffa

…rt 2

WIP batch generation

b22d1c6

WIP attempt to handle sample_packing using position_ids for wandb pre…

6f3216e

…diction table

WIP add code for debugging

e9eae77

Glavin001 commented Sep 8, 2023

View reviewed changes

src/axolotl/utils/callbacks.py Outdated Show resolved Hide resolved

Fix sample_packing support for wandb prediction table

83e6b29

Glavin001 commented Sep 9, 2023

View reviewed changes

docker-compose.yaml Outdated Show resolved Hide resolved

Glavin001 commented Sep 9, 2023

View reviewed changes

examples/llama-2/tiny-random.yml Outdated Show resolved Hide resolved

Glavin001 commented Sep 9, 2023

View reviewed changes

scripts/finetune.py Outdated Show resolved Hide resolved

Glavin001 commented Sep 9, 2023

View reviewed changes

src/axolotl/monkeypatch/llama_attn_hijack_flash.py Outdated Show resolved Hide resolved

Clean up code for PR review

aaf4d1e

Glavin001 changed the title ~~[WIP] Add training callback to send predictions to WandB table~~ Add training callback to send predictions to WandB table Sep 9, 2023

Glavin001 marked this pull request as ready for review September 9, 2023 07:53

winglian reviewed Sep 12, 2023

View reviewed changes

Glavin001 added 2 commits September 12, 2023 06:51

Add eval_table_size, eval_table_max_new_tokens configs & clean up code

14d26e1

Clean up PR, delete VSCode config, add tiny-llama example

c6c54ee

Glavin001 mentioned this pull request Sep 12, 2023

Add AutoGPTQ quantization script #545

Draft

4 tasks

winglian requested changes Sep 12, 2023

View reviewed changes

src/axolotl/utils/trainer.py Show resolved Hide resolved

src/axolotl/utils/config.py Show resolved Hide resolved

Add eval_table_size, eval_table_max_new_tokens documentation. Fix lin…

7ed9be0

…ting/formatting

winglian approved these changes Sep 13, 2023

View reviewed changes

winglian merged commit 5b67ea9 into axolotl-ai-cloud:main Sep 13, 2023
3 checks passed

LeonardoEmili mentioned this pull request Feb 9, 2024

Add seq2seq eval benchmark callback #1274

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add training callback to send predictions to WandB table #521

Add training callback to send predictions to WandB table #521

Glavin001 commented Sep 3, 2023 •

edited

Loading

Glavin001 commented Sep 3, 2023

Glavin001 commented Sep 5, 2023

Glavin001 commented Sep 8, 2023

winglian Sep 12, 2023

Glavin001 Sep 12, 2023

Glavin001 Sep 12, 2023

Glavin001 Sep 12, 2023

Glavin001 Sep 13, 2023

winglian commented Sep 12, 2023

winglian left a comment

Glavin001 commented Sep 13, 2023

teknium1 commented Oct 21, 2023

Glavin001 commented Oct 22, 2023

teknium1 commented Oct 22, 2023 •

edited

Loading

Add training callback to send predictions to WandB table #521

Add training callback to send predictions to WandB table #521

Conversation

Glavin001 commented Sep 3, 2023 • edited Loading

What's New?

How to configure

Tasks

Glavin001 commented Sep 3, 2023

Glavin001 commented Sep 5, 2023

Glavin001 commented Sep 8, 2023

winglian Sep 12, 2023

Choose a reason for hiding this comment

Glavin001 Sep 12, 2023

Choose a reason for hiding this comment

Glavin001 Sep 12, 2023

Choose a reason for hiding this comment

Glavin001 Sep 12, 2023

Choose a reason for hiding this comment

Glavin001 Sep 13, 2023

Choose a reason for hiding this comment

winglian commented Sep 12, 2023

winglian left a comment

Choose a reason for hiding this comment

Glavin001 commented Sep 13, 2023

teknium1 commented Oct 21, 2023

Glavin001 commented Oct 22, 2023

teknium1 commented Oct 22, 2023 • edited Loading

Glavin001 commented Sep 3, 2023 •

edited

Loading

teknium1 commented Oct 22, 2023 •

edited

Loading