Add "final_extra_valid_opt_filepath" to train_model.py #3883

moyapchen · 2021-07-30T13:34:39Z

I've been annoyed at having to kick off separate eval loops at the ends of trains, so here we are.

Also added a way to configure saving outputs to json (cause it's something that I've wanted for a while and I didn't want to add another return value in case scripts elsewhere were expecting only v_report and t_report).

Not super attached to names or to the default values I've got set. I tried using 'Optional[str]' for the cmdline arg, but the test was not happy with that...

Testing steps
Wrote a new test that exercises all of the new functionality.

Also a way to configure saving outputs to json. Tested with a test. (Not super attached to names or to the default values I've got set. I tried using 'Optional[str]' for the cmdline arg, but the test was not happy with that...)

stephenroller

Just want to finish discussion

parlai/scripts/train_model.py

stephenroller · 2021-07-30T15:49:56Z

parlai/scripts/train_model.py

+        help="A '.opt' file that is used for final eval. Useful for setting skip-generation to false. 'datatype' must be included as part of the opt.",
+    )
+    train.add_argument(
+        '--write-log-as-json',


Aren't the final evals stored in the trainstats file already? If not, my suggestion would be we include them in the trainstats.

And not even make it an option. Just do it.

parlai/scripts/train_model.py

tests/test_train_model.py

klshuster

agree with all of @stephenroller's suggestions

parlai/scripts/train_model.py

stephenroller · 2021-08-04T14:55:48Z

parlai/scripts/train_model.py

+            with PathManager.open(
+                opt['model_file'] + log_suffix + '.' + datatype + ".json", 'a'
+            ) as f:
+                json.dump(dict_report(report), f)


I think we'd prefer keeping it with the rest of the .trainstats?

The other valid/reports aren't saved to .trainstats right now.

Every validation run, except the final one, is stored in trainstats.

ParlAI/parlai/scripts/train_model.py

Lines 474 to 481 in 9b5f8c6

json.dump(

{

'parleys': self.parleys,

'train_time': self.train_time.time(),

'train_steps': self._train_steps,

'total_epochs': self._total_epochs,

'train_reports': self.train_reports,

'valid_reports': self.valid_reports,

I'm humbly requesting you expand it to also include the final valid/test reports. (They can just be None during the intermediate stages)

Alright, wording wasn't quite clear there prior. Anyway, done.

stephenroller

Seems like you have a bug for when model file isn't set (and we should be throwing trainstats into the void). Otherwise lgtm, thanks for making the changes.

Add "final_extra_valid_opt_filepath" to train_model.py

cef1bc7

Also a way to configure saving outputs to json. Tested with a test. (Not super attached to names or to the default values I've got set. I tried using 'Optional[str]' for the cmdline arg, but the test was not happy with that...)

moyapchen requested review from stephenroller and klshuster July 30, 2021 13:34

facebook-github-bot added the CLA Signed label Jul 30, 2021

stephenroller reviewed Jul 30, 2021

View reviewed changes

klshuster reviewed Aug 2, 2021

View reviewed changes

parlai/scripts/train_model.py Outdated Show resolved Hide resolved

Make requested changes

0589e68

stephenroller reviewed Aug 4, 2021

View reviewed changes

Moya Chen added 4 commits August 5, 2021 10:18

save final reports to .trainstats

bd27b3a

Merge branch 'master' into final_eval_opts

11c2414

delete unnecessary lines

b156240

run tests + linter

a0368e9

stephenroller approved these changes Aug 5, 2021

View reviewed changes

Fix that test since it is actually relevant

8ce7b93

moyapchen merged commit f3bf7a1 into master Aug 5, 2021

moyapchen deleted the final_eval_opts branch August 5, 2021 21:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add "final_extra_valid_opt_filepath" to train_model.py #3883

Add "final_extra_valid_opt_filepath" to train_model.py #3883

moyapchen commented Jul 30, 2021 •

edited

Loading

stephenroller left a comment

stephenroller Jul 30, 2021

stephenroller Jul 30, 2021

klshuster left a comment

stephenroller Aug 4, 2021

moyapchen Aug 4, 2021

stephenroller Aug 4, 2021

moyapchen Aug 5, 2021 •

edited

Loading

stephenroller left a comment

	json.dump(
	{
	'parleys': self.parleys,
	'train_time': self.train_time.time(),
	'train_steps': self._train_steps,
	'total_epochs': self._total_epochs,
	'train_reports': self.train_reports,
	'valid_reports': self.valid_reports,

Add "final_extra_valid_opt_filepath" to train_model.py #3883

Add "final_extra_valid_opt_filepath" to train_model.py #3883

Conversation

moyapchen commented Jul 30, 2021 • edited Loading

stephenroller left a comment

Choose a reason for hiding this comment

stephenroller Jul 30, 2021

Choose a reason for hiding this comment

stephenroller Jul 30, 2021

Choose a reason for hiding this comment

klshuster left a comment

Choose a reason for hiding this comment

stephenroller Aug 4, 2021

Choose a reason for hiding this comment

moyapchen Aug 4, 2021

Choose a reason for hiding this comment

stephenroller Aug 4, 2021

Choose a reason for hiding this comment

moyapchen Aug 5, 2021 • edited Loading

Choose a reason for hiding this comment

stephenroller left a comment

Choose a reason for hiding this comment

moyapchen commented Jul 30, 2021 •

edited

Loading

moyapchen Aug 5, 2021 •

edited

Loading