Update develop-ref after #1750 #1751

JohnHalleyGotway · 2021-04-12T15:33:43Z

Pull Request Testing

Update develop-ref after PR #1750 for issue #1714. This PR modifies the configuration for unit_tc_gen.xml to exercise the new configuration options.

Describe testing already performed for these changes:

The NB for develop failed on 20210412 and I ran the following command to confirm that the changes are limited to the tc_gen output:

egrep -i "ERROR:|file1:" test_regression_20210412.log | egrep -B 1 ERROR: | grep file1
file1: MET-develop-ref/test_output/tc_gen/tc_gen_2016_ctc.txt
file1: MET-develop-ref/test_output/tc_gen/tc_gen_2016_cts.txt
file1: MET-develop-ref/test_output/tc_gen/tc_gen_2016.stat

I inspected the modified files and confirmed that the only diffs are due to the new config options.

Recommend testing for the reviewer(s) to perform, including the location of input datasets, and any additional instructions:

No additional testing needed.
Do these changes include sufficient documentation updates, ensuring that no errors or warnings exist in the build of the documentation? [Yes]
This PR serves as documentation of the change.
Do these changes include sufficient testing updates? [Yes]
Will this PR result in changes to the test suite? [Yes]

If yes, describe the new output and/or changes to the existing output:

As described abvoe.

Pull Request Checklist

See the METplus Workflow for details.

Complete the PR definition above.
Ensure the PR title matches the feature or bugfix branch name.
Define the PR metadata, as permissions allow.
Select: Reviewer(s), Project(s), and Milestone
After submitting the PR, select Linked Issues with the original issue number.
After the PR is approved, merge your changes. If permissions do not allow this, request that the reviewer do the merge.
Close the linked issue and delete your feature or bugfix branch from GitHub.

…th_pbl Bugfix 1715 pb2nc seg fault with pbl

…n ATCF line to create a new track. (#1726)

…() function to determine the climatological probability of a CDP-type threshold. Also update derive_climo_prob() in pair_base.cc to call the new climo_prob() function. (#1724)

* Per #1716, committing changes from Randy Bullock to support floating point percentile thresholds. * Per #1716, no code changes, just consistent formatting. * Per #1716, change SFP50 example to SFP33.3 to show an example of using floating point percentile values.

* Per #1733, add column_exc_name, column_exc_val, init_exc_name, and init_exc_val options to the TCStat config files. * Per #1733, enhance tc_stat to support the column_exc and init_exc config file and job command filtering options. * Per #1733, update stat_analysis to support the -column_exc job filtering option. Still need to update docuementation and add unit tests. * Per #1773, update the user's guide with the new config and job command options. * Per #1733, add call to stat_analysis to exercise -column_str and -column_exc options. * Per #1733, I ran into a namespace conflict in tc_stat where -init_exc was used for to filter by time AND my string value. So I switched to using -init_str_exc instead. And made the corresponding change to -column_str_exc in stat_analysis and tc_stat. Also changed internal variable names to use IncMap and ExcMap to keep the logic clear. * Per #1733, tc_stat config file updates to switch from column_exc and init_exc to column_str_exc and init_str_exc. * Per #1733, add tc_stat and stat_analysis jobs to exercise the string filtering options.

* Per #1737, migrate the same fix from main_v9.1 over to the develop branch. * Per #1737, add another unit test for running ascii2nc with corrupt littl_r records.

* Adding files to build documenation via GitHub Actions * Removing html_theme_options * Removed warnings.log from help section

* Per #1575, add mpr_column and mpr_thresh entries to all of the Grid-Stat and Point-Stat config files. * Per #1575, define config strings to be parsed from the config files. * Per #1575, store col_name_ptr and col_thresh_ptr in PairBase. They are being used for PairDataPoint to do MPR filtering in Grid-Stat and Point-Stat. But they could be eventually be extended to filter ORANK columns for Ensemble-Stat. * Per #1575, add MPR filtering logic to pair_data_point.cc. Include filtering logic in PairDataPoint instead of VxPairDataPoint since Grid-Stat uses PairDataPoint. * Per #1575, update point_stat to parse the mpr_column and mpr_thresh config file options. Include the MPR rejection reason code counts in the log output. * Per #1575, updated Grid-Stat to parse mpr_column and mpr_thresh options. * Per #1575, update Point-Stat to store mpr_sa and mpr_ta locally and then call set_mpr_filt() after the VxPairDataPoint object has been sized and allocated. * Per #1575, renamed PairDataEnsemble::subset_pairs() to subset_pairs_obs_thresh() to be a little more explicit about things. I'll do the same for PairDataPoint using names subset_pairs_cnt_thresh() and subset_pairs_mpr_thresh(). * Per #1575, some cleanup, moving check_fo_thresh() utility function from vx_config to vx_statistics library. * Per #1575, when implementing this for Grid-Stat, I realized that there isn't much benefit in storing col_name_ptr and col_name_thresh in PairBase. These changes remove that. * Per #1575, updating pair_data_point.h/.cc to handle the subsetting of data based on the MPR thresh. * Per #1575, rename subset_pairs() to subset_pairs_cnt_thresh() to be a bit more explicit with the naming conventions. * Per #1575, no real changes here. Just reorganizing the location of the mpr_sa and mpr_ta members. * Per #1575, make the subset_pairs() utility function a member function of the PairDataPoint class named subset_pairs_cnt_thresh() and update the application code to call it. * Per #1575, need to actually set the mpr_thresh! * Per #1575, update subset_pairs_mpr_thresh() to make sure the StringArray and ThreshArray lengths are the same. * Per #1575, replace PairDataPoint::subset_pairs_mpr_thresh() with a utility function named apply_mpr_thresh_mask(). This is for Grid-Stat to apply the mpr_thresh settings after the DataPlane pairs have been created but prior to applying any smoothing operations. * Per #1575, add documentation about mpr_column and mpr_thresh. * Per #1575, mpr_columns can also include CLIMO_CDF. * Per #1575, add tests for Grid-Stat and Point-Stat to exercise the mpr_column and mpr_thresh config file options.

* Try path insert. * sys.path insert. * Per #1319, adding David's changes back into the feature_1319_no_pickle branch. It compiles but TEST: python_numpy_plot_data_plane_pickle fails when testing on my Mac. Comitting now to test on kiowa. * Per #1319, small updated to write_tmp_dataplane.py script. Had a couple of if statements that should really be elif. Co-authored-by: John Halley Gotway <johnhg@kiowa.rap.ucar.edu> Co-authored-by: John Halley Gotway <johnhg@ucar.edu>

* Per #1736, if -out_stat was used for aggregate or aggregate_stat jobs, do not write output to the -out or log output. * Per #1736, clarify stat_analysis documentation for -out_stat option. * Per #1736, for jobs which can write .stat output, don't waste time populating the output AsciiTable unless it's actually going to be written. Co-authored-by: John Halley Gotway <johnhg@kiowa.rap.ucar.edu>

…_python.xml works via the command line, it fails when run through cron. The problem is the PATH setting. Need to have the anaconda bin directory in the path for it to succeeed. Adding that for the single test.

…les set prior to the test are unset afterwards! So we'd run all the rest of the tests after unit_python.xml with an empty path. That would likely cause any subsequent call to Rscript to fail. Recommend tightening up this logic when we move these tests to GHA.

…vailable for python embedding cases that use MET_PYTHON_EXE

* Per #1747, update MET to interpret longlong values as integers. NetCDF file attributes that have an LL suffix are read into python as numpy.int64 objects. Right now MET fails when trying to read those as integers. Update the parsing logic to interpret those as ints. * Per #1747, since MET can now interpret both long and longlong's as ints, there's no need to cast nx and ny to ints in the read_tmp_dataplane.py script anymore. * Per #1747, this is slightly unrelated. But after installing the netCDF4 module on kiowa for /usr/local/met-python3/bin/python3, we should no longer need a custom PATH setting to get unit_python.xml to work. Reverting the change I made to it a couple of days ago to get it working.

…kept failing through GHA with a divide by zero error. It occurs in compute_track_err() but only for a very specific set of data. The bdeck valid increment evaluates to 0 which causes the divide by 0 error. It also can evaluate to bad data (e.g. -9999). The fix is to check for 0 and bad data. If found, use the constant best_track_time_step value instead.

…numbers can change

* Per #1714, add tc_gen genesis_match_window configuration option to define a search window relative to the forecast genesis time. * Per #1714, clarify docs to state the genesis_match_window.end = 12 allows for matches for early forecasts. Also add an example of this option to the tc_gen unit test. * Per #1714, switch ops_hit_tdiff to ops_hit_window. * Per #1714, skip genesis events for tracks where the cyclone number is > 50. * Per #1714, only discard cyclone numbers > 50 from the Best track, not the forecast tracks. * Per #1716, add note to the tc_gen chapter about skipping Best tracks with cyclone number > 50. * Per #1714, adding genesis_match_point_to_track config file option for TC-Gen. Note that this version of the code is close but doesn't actually compile yet. I still need to figure out exactly how to process the operational tracks. Should this logic also apply to the matching for those tracks? * Per #1714, the logic for checking the operational tracks is pretty simple. We only store/check operational track points for lead time = 0. So applying the genesis_match_point_to_track boolean config option does not make sense. * Per #1714, update the tc-gen user's guide chapter to describe the updated logic and new config file option. * Per #1714, fix the logic of the is_match() function. * Per #1714, reconfigure the call to tc_gen to exercise the new genesis_match_track_to_point option. * Per #1714, just fixing spacing in source code. * Committing 2 small changes not specifically related to #1714, but related the processing of genesis tracks. When getting items from ATCFGenLines, the columns to be shifted are off by one. We had been shifting offset 2 up to 3, but it should have remained at 2. Also when initializing a TrackInfo object, set the StormID by calling ATCFLineBase::storm_id() instead of constructing it from BASIN:CYCLONE:YYYY. For ATCFGenLines we want to set the Storm ID equal to the 3rd column rather than constructing it! * Per #1714, fix an error in the logic of GenesisInfo::is_match(const GenesisInfo &,...). I was using the index of the current GenesisInfo object instead of the one from the input argument. Fix this by adding GenesisInfo::genesis() member function to return a reference the TrackPoint for Genesis. * Per #1714, correcting logic for parsing the storm_id and warning_time columns for ATCFGen and regular ATCF line types. For ATCFGen line types, the code was incorrectly using the 3rd column when it should have used the 4th column! Co-authored-by: John Halley Gotway <johnhg@kiowa.rap.ucar.edu>

David Fillmore added 30 commits January 26, 2021 09:32

Start on write netcdf pickle alternative.

Verified

This commit was signed with the committer’s verified signature.

rouault Even Rouault

GPG key ID: 33EBBFC47B3DD87D

Verified
Learn about vigilant mode

ebddb56

Write dataplane array.

0fdbfdd

Start on read of netcdf as pickle alternative.

6d46603

Create attribute variables.

6fe4245

Use global attributes for met_info attrs.

644db21

Add grid structure.

6594062

Read metadata back into met_info.attrs.

c6667e3

Convert grid.nx and grid.ny to int.

1e6eb9e

Rename _name key to name.

e005585

Removed pickle write.

ab986ca

Fixed write_pickle_dataplane to work for both numpy and xarray.

760b690

Use items() to iterate of key, value attrs.

791ebf0

Write temporary text file.

c5f17e8

Renamed scripts.

d6142e8

Changed script names in Makefile.am.

b39ca28

Replaced pickle with tmp_nc.

7cc2d77

Fixed wrapper script names.

df0db18

Test for attrs in met_in.met_data.

044c704

Initial version of read_tmp_point module.

d798e9d

Added read_tmp_point.py to install list.

8116e75

Start on Python3_Script::read_tmp_point.

7b57715

Write MPR tmp ascii file.

5502da9

Renamed to read_tmp_ascii to use for point point and MPR.

961b4fc

Renamed to read_tmp_ascii to use for point point and MPR.

4c0963d

Define Python3_Script::import_read_tmp_ascii_py.

91122be

Call Python3_Script::import_read_tmp_ascii_py.

fef8484

Append MET_BASE/wrappers to sys.path.

93e9762

Finished implementation of Python3_Script::import_read_tmp_ascii_py.

44d8328

Call Python3_Script::read_tmp_ascii in python_handler.

3953aba

Revised python3_script::read_tmp_ascii with call to run, PyRun_String.

25961d6

hsoh-u and others added 23 commits March 18, 2021 14:17

Merge pull request #1719 from dtcenter/bugfix_1715_pb2nc_seg_fault_wi…

7998d89

…th_pbl Bugfix 1715 pb2nc seg fault with pbl

Per #1725, return good status from TrackInfoArray::add() when using a…

8dbef78

…n ATCF line to create a new track. (#1726)

Per #1705, update the threshold node heirarchy by adding a climo_prob…

5866b2a

…() function to determine the climatological probability of a CDP-type threshold. Also update derive_climo_prob() in pair_base.cc to call the new climo_prob() function. (#1724)

Update pull_request_template.md

8dfd7c0

Bugfix 1737 develop little_r (#1739)

6055600

* Per #1737, migrate the same fix from main_v9.1 over to the develop branch. * Per #1737, add another unit test for running ascii2nc with corrupt littl_r records.

Feature GitHub actions (#1742)

804b1ac

* Adding files to build documenation via GitHub Actions * Removing html_theme_options * Removed warnings.log from help section

Per #1319, this is a hotfix to the develop branch. While running unit…

338c6e2

…_python.xml works via the command line, it fails when run through cron. The problem is the PATH setting. Need to have the anaconda bin directory in the path for it to succeeed. Adding that for the single test.

Merge branch 'develop' of https://github.com/dtcenter/MET into develop

4b7b768

Just lining up a log message in the output of gen_vx_mask.

69f5b5c

Trying to get the PATH setting correct for unit_python.xml.

e18f0b5

Changed weblink for METplus documentation

1bafac0

per #1319 added netCDF4 python package to MET docker image so it is a…

9a9bbc7

…vailable for python embedding cases that use MET_PYTHON_EXE

Turned specific section numbers into linked sections because section …

c06e6da

…numbers can change

Removed hard-coded references to section numbers

b92a6f6

JohnHalleyGotway added this to the MET 10.0.0 milestone Apr 12, 2021

JohnHalleyGotway linked an issue Apr 12, 2021 that may be closed by this pull request

Allow tc_gen to match fcst/obs when forecast genesis times are prior to observed genesis times #1714

Closed

21 tasks

Merge branch 'develop-ref' into develop

5ba033c

JohnHalleyGotway merged commit 18957c5 into develop-ref Apr 12, 2021

JohnHalleyGotway changed the title ~~Updated develop-ref after #1750~~ Update develop-ref after #1750 Apr 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update develop-ref after #1750 #1751

Update develop-ref after #1750 #1751

JohnHalleyGotway commented Apr 12, 2021

Update develop-ref after #1750 #1751

Update develop-ref after #1750 #1751

Conversation

JohnHalleyGotway commented Apr 12, 2021

Pull Request Testing

Pull Request Checklist