Make mpt7b finetuning more obvious #101

samhavens · 2023-05-10T20:51:39Z

Resolves RESEARCH-710

Also change the folder structure of the YAMLs to make it more clear which MPTs are pretrained from the HF Hub and which are an architecture for pretraining.

WARNING: This will not work until #90 is merged!

scripts/train/yamls/hf_causal_lm/mpt-7b_dolly_sft.yaml

vchiley · 2023-05-11T02:34:05Z

Note: #90 is merged

alextrott16

A couple minor suggestions to correct some issues with model configs but otherwise looks great!

BTW t5-small_dolly_sft.yaml also has the device instead of init_device issue, but I couldn't suggest a change on it because it was just a rename. Please apply the same fix there.

scripts/train/yamls/finetune/1b_local_data_sft.yaml

scripts/train/yamls/finetune/mpt-7b_dolly_sft.yaml

scripts/train/yamls/finetune/opt.yaml

scripts/train/yamls/finetune/mpt-7b_dolly_sft.yaml

* make mpt7b finetuning more obvious * change yaml structure and references to paths * needed to intall pre-commit * fix merge issue * Local dataset rework * Apply suggestions from code review * YAML touch ups --------- Co-authored-by: Vitaliy Chiley <6439018+vchiley@users.noreply.github.com> Co-authored-by: Alex Trott <alex@mosaicml.com>

make mpt7b finetuning more obvious

b8f5b3b

samhavens requested a review from abhi-mosaic May 10, 2023 20:51

dakinggg reviewed May 10, 2023

View reviewed changes

scripts/train/yamls/hf_causal_lm/mpt-7b_dolly_sft.yaml Outdated Show resolved Hide resolved

change yaml structure and references to paths

76c5977

samhavens requested a review from alextrott16 May 10, 2023 23:59

needed to intall pre-commit

d4c47f6

samhavens mentioned this pull request May 11, 2023

How to use the train.py finetuning the pre-trained MPT-7B? #91

Closed

vchiley and others added 3 commits May 10, 2023 19:34

Merge branch 'main' into mpt7b-ft-yaml

f896298

fix merge issue

c91dd10

Merge branch 'main' into mpt7b-ft-yaml

e9d834e

alextrott16 approved these changes May 12, 2023

View reviewed changes

scripts/train/yamls/finetune/1b_local_data_sft.yaml Show resolved Hide resolved

scripts/train/yamls/finetune/mpt-7b_dolly_sft.yaml Outdated Show resolved Hide resolved

scripts/train/yamls/finetune/mpt-7b_dolly_sft.yaml Outdated Show resolved Hide resolved

alextrott16 reviewed May 12, 2023

View reviewed changes

scripts/train/yamls/finetune/opt.yaml Outdated Show resolved Hide resolved

dakinggg approved these changes May 12, 2023

View reviewed changes

scripts/train/yamls/finetune/mpt-7b_dolly_sft.yaml Outdated Show resolved Hide resolved

zhranj reviewed May 12, 2023

View reviewed changes

scripts/train/yamls/finetune/mpt-7b_dolly_sft.yaml Show resolved Hide resolved

alextrott16 added 4 commits May 12, 2023 16:48

Local dataset rework

2838d01

Apply suggestions from code review

67db863

Merge branch 'main' into mpt7b-ft-yaml

46700f9

YAML touch ups

bf811f8

alextrott16 merged commit 918a22a into main May 13, 2023

alextrott16 deleted the mpt7b-ft-yaml branch May 13, 2023 00:41

bmosaicml pushed a commit that referenced this pull request Jun 6, 2023

fix issue with #94 (#101)

16d21c1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make mpt7b finetuning more obvious #101

Make mpt7b finetuning more obvious #101

samhavens commented May 10, 2023 •

edited

Loading

vchiley commented May 11, 2023

alextrott16 left a comment

Make mpt7b finetuning more obvious #101

Make mpt7b finetuning more obvious #101

Conversation

samhavens commented May 10, 2023 • edited Loading

vchiley commented May 11, 2023

alextrott16 left a comment

Choose a reason for hiding this comment

samhavens commented May 10, 2023 •

edited

Loading