Skip to content

Commit

Permalink
fast training
Browse files Browse the repository at this point in the history
  • Loading branch information
Alessandro Sordoni committed Nov 10, 2024
1 parent cabf565 commit d993409
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions projects/modular_llm/configs/models/gptneo_125m_fast.json
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@
"modify_layers": "q_proj|v_proj|k_proj",
"trainable_param_names": ".*lora_[ab].*",
"num_train_epochs": 10,
"learning_rate": 1e-3,
"learning_rate": 1e-2,
"micro_batch_size": 16,
"train_batch_size": 16,
"predict_batch_size": 8,
Expand All @@ -17,7 +17,7 @@
"model_family": "gpt",
"optimizer": "adamw",
"dataset": "sordonia/flan-debug-flat",
"warmup_proportion": 0.06,
"warmup_proportion": 0.0,
"max_input_length": 1024,
"max_output_length": 128,
"truncation_side": "left"
Expand Down

0 comments on commit d993409

Please sign in to comment.