Added GPFA parser function to gp_from_aimd #173

stevetorr · 2020-05-18T22:47:05Z

Adds:

parse_trajectory_trainer_output to gp_from_aimd.py which allows for easy parsing on both a frame-by-frame level as well as on the GP level.
A unit test for the parser, and a test file for it to run on.
Changes:
Certain output.py functions now have their arguments changed to allow optional input. This should change nothing about the way they are used in other scripts, but makes for cleaner GPFA output parsing.
Minor cosmetic change to GaussianProcess' training_statistics method.

codecov-io · 2020-05-18T22:57:00Z

Codecov Report

Merging #173 into master will increase coverage by 0.36%.
The diff coverage is 97.87%.

@@            Coverage Diff             @@
##           master     #173      +/-   ##
==========================================
+ Coverage   58.10%   58.47%   +0.36%     
==========================================
  Files          35       35              
  Lines        7467     7533      +66     
==========================================
+ Hits         4339     4405      +66     
  Misses       3128     3128

Impacted Files	Coverage Δ
flare/gp_from_aimd.py	`91.95% <97.40%> (+2.45%)`	⬆️
flare/gp.py	`78.27% <100.00%> (ø)`
flare/output.py	`84.44% <100.00%> (+0.35%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0743870...54406c6. Read the comment docs.

nw13slx · 2020-05-19T01:00:33Z

flare/gp_from_aimd.py

@@ -182,7 +182,7 @@ def __init__(self, frames: List[Structure],
 assert (isinstance(skip, int) and skip >= 1), "Skip needs to be a " \
 "positive integer."
 self.validate_ratio = validate_ratio
- assert (validate_ratio >= 0 and validate_ratio <= 1), \
+ assert (0 <= validate_ratio <= 1), \


Didn't know that python can do this. Cool

nw13slx · 2020-05-19T01:01:30Z

flare/gp_from_aimd.py

@@ -243,11 +243,16 @@ def pre_run(self):
 self.gp.opt_algorithm,
 dt=0,
 Nsteps=len(self.frames),
- structure=self.frames[0],


It's good to remove this. It gave me some trouble as well...

nw13slx · 2020-05-19T01:08:39Z

tests/test_gp_from_aimd.py

+ :return:
+ """
+ frames, gp_data = parse_trajectory_trainer_output(
+ './test_files/gpfa_parse_test.out', True)


Is it possible to also use the gpfa unit test output as input here? so the test can check whether the code is self consistent?

That's a nice idea and it occurred to me; gpfa_parse_test.out is based on a unit test output, but I also wanted to make sure the tests could run independently. Is there a better way to do this you see?

nw13slx · 2020-05-19T01:11:56Z

flare/gp_from_aimd.py

+
+ initial_gp_statistics = json.loads(gp_stats_line)
+
+ # Get pre_run statistics (if pre-run was done):


do we have a mechanism to check whether the gp model began with some training data before the whole gpfa run? ideally, we can throw a warning to let the user know that the parser will not fully recovered the gp model unless they use the same initial model.

I never know where to set the bar with these things, but, I think it might be okay to trust the user here. The way I see it, the initial GP statistics being empty or not is the signal to the user. However-- I think this is a great idea to keep in our back pocket if/when we write a `rewind GPFA' function, and the warning should be raised then that the GPFA output is not a complete record of the GP's training.

Added GPFA parser function to gp_from_aimd

54406c6

stevetorr requested a review from nw13slx May 18, 2020 22:47

nw13slx approved these changes May 19, 2020

View reviewed changes

stevetorr merged commit a819259 into master May 19, 2020

stevetorr deleted the feature/steven/gpfa_parser_update branch August 6, 2020 15:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added GPFA parser function to gp_from_aimd #173

Added GPFA parser function to gp_from_aimd #173

stevetorr commented May 18, 2020

codecov-io commented May 18, 2020 •

edited

Loading

nw13slx May 19, 2020

nw13slx May 19, 2020

nw13slx May 19, 2020

stevetorr May 19, 2020 •

edited

Loading

nw13slx May 19, 2020

stevetorr May 19, 2020


		initial_gp_statistics = json.loads(gp_stats_line)

		# Get pre_run statistics (if pre-run was done):

Added GPFA parser function to gp_from_aimd #173

Added GPFA parser function to gp_from_aimd #173

Conversation

stevetorr commented May 18, 2020

codecov-io commented May 18, 2020 • edited Loading

Codecov Report

nw13slx May 19, 2020

Choose a reason for hiding this comment

nw13slx May 19, 2020

Choose a reason for hiding this comment

nw13slx May 19, 2020

Choose a reason for hiding this comment

stevetorr May 19, 2020 • edited Loading

Choose a reason for hiding this comment

nw13slx May 19, 2020

Choose a reason for hiding this comment

stevetorr May 19, 2020

Choose a reason for hiding this comment

codecov-io commented May 18, 2020 •

edited

Loading

stevetorr May 19, 2020 •

edited

Loading