OLMo Cookbook

OLMost every recipe you need to perform data interventions with the OLMo family of models.

How To Train an OLMo Model

Build your training configuration

See src/cookbook/recipes/train-1b-1xC-dclm.yaml for an example to clone

Launch your training job

olmo-cookbook launch -c src/cookbook/recipes/train-1b-1xC-dclm.yaml
Follow the interactive prompts. A link to the Beaker job will be provided upon successful submission.
Monitor your training job in wandb or Beaker

How To Evaluate an OLMo Model

Convert Checkpoint

For models trained with old trainer

olmo-cookbook-eval convert \
  -i /oe-training-default/kevinf/checkpoints/OLMo-medium/peteish7-medlr/step477000 \
  -t olmo2 \
  --use-beaker \
  --huggingface-tokenizer allenai/dolma2-tokenizer

For models trained with OLMo core

olmo-cookbook-eval convert \
  "/oe-training-default/ai2-llm/checkpoints/peteish32-anneal/OLMo2-32Bparams-5Ttokens-100Banneal/step11921" \
  -t olmo-core \
  --use-beaker \
  --huggingface-tokenizer allenai/OLMo-2-1124-7B

Run Evaluation

olmo-cookbook-eval evaluate \
  "/oe-training-default/ai2-llm/checkpoints/OLMoE/a0125/olmoe-8x1b-newhp-newds-dolmino-seed-42/step23842-hf" \
  --task core:mc --task mmlu:mc --task mmlu:rc --task gen \
  --priority high \
  --cluster aus80g \
  --num-gpus 1 \
  --model-backend vllm \
  --dashboard olmoe-0125

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
scripts		scripts
src/cookbook		src/cookbook
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OLMo Cookbook

How To Train an OLMo Model

Build your training configuration

Launch your training job

How To Evaluate an OLMo Model

Convert Checkpoint

For models trained with old trainer

For models trained with OLMo core

Run Evaluation

About

Releases

Packages

Contributors 4

Languages

License

allenai/olmo-cookbook

Folders and files

Latest commit

History

Repository files navigation

OLMo Cookbook

How To Train an OLMo Model

Build your training configuration

Launch your training job

How To Evaluate an OLMo Model

Convert Checkpoint

For models trained with old trainer

For models trained with OLMo core

Run Evaluation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages