Training: Step 2 #16

affromero · 2017-08-15T21:08:32Z

Hello,

Regarding the training procedure on step 2:
python scripts/train_model.py --model_type EE --program_generator_start_from data/program_generator.py --num_iterations 100000 --checkpoint_path data/execution_engine.pt

I do not know if I have missed something, but program_generator_start_from is only invoked inside get_program_generator, for 'PG+EE' and 'PG' model types.

Thank you.

The text was updated successfully, but these errors were encountered:

rizar · 2018-03-09T15:05:15Z

Same question here. According to TRAINING.md, in Step 2 "we train the execution engine, using programs predicted from the program generator in the previous step". In train_model.py line 238 says "train execution engine with ground-truth programs". Can you please explain this discrepancy?

In any case, when I train --model_type=EE without Step 1 pretraining, the learning doesn't really progress (still at ~50% accuracy after 100000) iterations.

liuweide01 · 2018-04-10T07:15:24Z

have you solve the problem?

rizar · 2018-04-10T12:34:43Z

Yes, it's the learning rate. It should be decreased to 1e-5, and then step 2 works. Note, that it will indeed use the groundtruth programs in Step 2.

liuweide01 · 2018-04-14T00:50:47Z

But how about the val accuracy?

rizar · 2018-04-14T15:52:13Z

I get smth like 95-96%, which is what is reported in the paper.

ankursikarwar · 2020-08-17T01:51:10Z

@rizar can you please tell how did you get 95-96% accuracy by directly training the execution engine using the ground truth programs (as in step 2). My accuracy is oscillating around 0.47 even after 5000 iterations when using lr = 1e-5

rizar · 2020-08-18T18:37:10Z

Can you try training longer?

ankursikarwar · 2020-08-20T04:59:12Z

Can you try training longer?

Thanks to you, I trained the execution engine for 100000 iterations with lr=1e-5 and got around 89% accuracy. Actually, accuracy increased quite slowly initially, and then between 40k and 60k iterations, it increased steeply.

smdp2000 · 2020-10-22T08:47:10Z

minimum pc specs you all are using to train this model, can anyone suggest

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training: Step 2 #16

Training: Step 2 #16

affromero commented Aug 15, 2017

rizar commented Mar 9, 2018

liuweide01 commented Apr 10, 2018

rizar commented Apr 10, 2018

liuweide01 commented Apr 14, 2018

rizar commented Apr 14, 2018

ankursikarwar commented Aug 17, 2020

rizar commented Aug 18, 2020

ankursikarwar commented Aug 20, 2020

smdp2000 commented Oct 22, 2020

Training: Step 2 #16

Training: Step 2 #16

Comments

affromero commented Aug 15, 2017

rizar commented Mar 9, 2018

liuweide01 commented Apr 10, 2018

rizar commented Apr 10, 2018

liuweide01 commented Apr 14, 2018

rizar commented Apr 14, 2018

ankursikarwar commented Aug 17, 2020

rizar commented Aug 18, 2020

ankursikarwar commented Aug 20, 2020

smdp2000 commented Oct 22, 2020