-
Notifications
You must be signed in to change notification settings - Fork 210
Training: Step 2 #16
Comments
Same question here. According to In any case, when I train |
have you solve the problem? |
Yes, it's the learning rate. It should be decreased to 1e-5, and then step 2 works. Note, that it will indeed use the groundtruth programs in Step 2. |
But how about the val accuracy? |
I get smth like 95-96%, which is what is reported in the paper. |
@rizar can you please tell how did you get 95-96% accuracy by directly training the execution engine using the ground truth programs (as in step 2). My accuracy is oscillating around 0.47 even after 5000 iterations when using lr = 1e-5 |
Can you try training longer? |
minimum pc specs you all are using to train this model, can anyone suggest |
Hello,
Regarding the training procedure on step 2:
python scripts/train_model.py --model_type EE --program_generator_start_from data/program_generator.py --num_iterations 100000 --checkpoint_path data/execution_engine.pt
I do not know if I have missed something, but
program_generator_start_from
is only invoked inside get_program_generator, for 'PG+EE' and 'PG' model types.Thank you.
The text was updated successfully, but these errors were encountered: