Training recipe?? #14

milsun · 2023-03-14T08:02:09Z

The blog says training recipe is too released in the code, but I cannot find it. Can you update the repo with code used for training the model, along with required dependencies/guide, etc, to help us do the same, maybe with bigger models.
Thanks for this awesome repo.

lxuechen · 2023-03-14T08:10:33Z

Hi, thanks for your interest!

We will release the training code once the Hugging Face interface to LLaMA becomes stable (merged into main).

Our fine-tuning procedure is standard and was performed with huggingface's trainer. You can see our hyperparameters here.

Hiusam · 2023-03-15T06:32:31Z

Hi, do you fine-tune the model using Next Token Prediction as the pre-training? And how much CUDA memory does the training cost? Is it possible to do the training using only one 80G A100 GPU?

rtaori · 2023-03-15T13:37:56Z

Hi all,

We have released the training code, see https://github.com/tatsu-lab/stanford_alpaca#fine-tuning. Please open a new issue for any further questions/concerns.

pGit1 · 2023-03-25T09:22:07Z

@rtaori as far as I can tell from your code it looks like standard teacher forcing (aka next token prediction), is this accurate?

lxuechen closed this as completed Mar 14, 2023

newstronger mentioned this issue May 14, 2023

error of multi-GPU: torch.distributed.elastic.multiprocessing.api:failed (exitcode: -9) local_rank: 0 #162

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training recipe?? #14

Training recipe?? #14

milsun commented Mar 14, 2023

lxuechen commented Mar 14, 2023

Hiusam commented Mar 15, 2023

rtaori commented Mar 15, 2023

pGit1 commented Mar 25, 2023

Training recipe?? #14

Training recipe?? #14

Comments

milsun commented Mar 14, 2023

lxuechen commented Mar 14, 2023

Hiusam commented Mar 15, 2023

rtaori commented Mar 15, 2023

pGit1 commented Mar 25, 2023