Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature 1943 gridstat seeps #2344

Merged
merged 47 commits into from
Nov 16, 2022
Merged

Feature 1943 gridstat seeps #2344

merged 47 commits into from
Nov 16, 2022

Conversation

hsoh-u
Copy link
Collaborator

@hsoh-u hsoh-u commented Nov 14, 2022

  • point_stat and grid_stat
    • The SEEPS QC (by p1, Probability of being dry) is added
      • Configured by using "seeps_p1_thresh" (default is disabled, "seeps_p1_thresh = NA;")
      • p1 threshold for SYNOP: "seeps_p1_thresh = >=0.1&&<=0.85;"
    • The weighted SEEPS score is computed (it was averaged score)
    • The columns for SEEPS are computed (S12, S13, S21, S23, S31, S32, PV1, PV2, PV3, PF1, PF2, PF3)
  • grid_stat
    • SEEPS line type is generated
    • The three columns of SEEPS MPR (SCORE, OBS_CAT, FCST_CAT) are added to NetCDF
    • "seeps" is added to output_flag of grid_stat configuration files
    • unit test, grid_stat_UK_SEEPS, is added
      • The climo file for grid is added to MET_TEST_INPUT/climatology_data/seeps/PPT24_seepsweights_grid.nc
    • The climo file for grid is configurable by using the environment variable MET_SEEPS_GRID_CLIMO_NAME
  • point_stat
    • The density vector for point_stat is computed with density_radius 0.75 degree

Expected Differences

  • Do these changes introduce new tools, command line arguments, or configuration file options? [No]

    If yes, please describe:

It produces the SEESP text output. It adds SEEPS MPR score, obs_category and forecast category to NetCDF.

  • Do these changes modify the structure of existing or add new output data types (e.g. statistic line types or NetCDF variables)? [No]

    If yes, please describe:

SEEPS line type is added like point_stat
Some SEEPS_MPR data is saved to NetCDF output

Pull Request Testing

  • Describe testing already performed for these changes:

Added it to unit test

  • Recommend testing for the reviewer(s) to perform, including the location of input datasets, and any additional instructions:

  • Do these changes include sufficient documentation updates, ensuring that no errors or warnings exist in the build of the documentation? [Yes]

  • Do these changes include sufficient testing updates? [Yes]

  • Will this PR result in changes to the test suite? [Yes or No]

    If yes, describe the new output and/or changes to the existing output:

Yes two more unit test: one for grid_stat and the other for stat_ananysisfrom another PR, Feature #2339 stat_analysis_seeps by @JohnHalleyGotway

  • Please complete this pull request review by [Fill in date].

Pull Request Checklist

See the METplus Workflow for details.

  • Review the source issue metadata (required labels, projects, and milestone).
  • Complete the PR definition above.
  • Ensure the PR title matches the feature or bugfix branch name.
  • Define the PR metadata, as permissions allow.
    Select: Reviewer(s)
    Select: Organization level software support Project or Repository level development cycle Project
    Select: Milestone as the version that will include these changes
  • After submitting the PR, select Development with the original issue number.
  • After the PR is approved, merge your changes. If permissions do not allow this, request that the reviewer do the merge.
  • Close the linked issue and delete your feature or bugfix branch from GitHub.

Howard Soh and others added 30 commits November 9, 2022 13:00
Copy link
Collaborator

@JohnHalleyGotway JohnHalleyGotway left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are still problems remaining to be fixed. I inspected the diffs flagged in this GHA run.

It adds 5 new output files and modifies 3 Point-Stat output files.

Please look specifically at this file that is modified:

point_stat_NCMET_NAM_NDAS_SEEPS_360000L_20120410_120000V_seeps.txt

The SEEPS column that previously reported reasonable values between 0 and 1 are now all 0. Also the S, PF, and PV columns that were previously NA are now also 0.

@hsoh-u
Copy link
Collaborator Author

hsoh-u commented Nov 16, 2022

There are still problems remaining to be fixed. I inspected the diffs flagged in this GHA run.

It adds 5 new output files and modifies 3 Point-Stat output files.

Please look specifically at this file that is modified:

point_stat_NCMET_NAM_NDAS_SEEPS_360000L_20120410_120000V_seeps.txt

The SEEPS column that previously reported reasonable values between 0 and 1 are now all 0. Also the S, PF, and PV columns that were previously NA are now also 0.

I informed Rachel that our test data and my sample IDL run (lat=(10,20,30,40,50), lon=(20,40,60,80,100)) produce all 0 density_vector with density_radius 0.75. That's why all columns becomes 0 and weighed SEEPS score, too. I will change back to use averaged SEEPS score, not weighted score.

Copy link
Collaborator

@JohnHalleyGotway JohnHalleyGotway left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Inspected the diffs flagged in this GHA run:

  1. Adds 5 new output files:
ERROR: folder /data/output/met_test_truth missing 5 files
    grid_stat/grid_stat_SEEPS_000000L_20211202_000000V.stat
    grid_stat/grid_stat_SEEPS_000000L_20211202_000000V_pairs.nc
    grid_stat/grid_stat_SEEPS_000000L_20211202_000000V_seeps.txt
    stat_analysis_ps/CONFIG_POINT_STAT_filter_seeps.stat
    stat_analysis_ps/POINT_STAT_SEEPS.out

It modifies the data in 7 existing output files.

  1. Diffs in 3 files from Point-Stat:
file1: /data/output/met_test_truth/point_stat/point_stat_NCMET_NAM_NDAS_SEEPS_360000L_20120410_120000V.stat
file1: /data/output/met_test_truth/point_stat/point_stat_NCMET_NAM_NDAS_SEEPS_360000L_20120410_120000V_seeps.txt
file1: /data/output/met_test_truth/point_stat/point_stat_NCMET_NAM_NDAS_SEEPS_360000L_20120410_120000V_seeps_mpr.txt
  • For SEEPS_MPR, the P2 column for the same point locations have changed. For example, for 71940 55.18 -118.88 P2 changes from 0.78 to 0.927. Is this expected?
  • For SEEPS, the TOTAL column generally decreases from 249 to 230. I assume that's because we're doing more filtering and is expected. The MEAN_FCST, MEAN_OBS, and SEEPS columns change, but that makes sense with a different number of points used.
  • For SEEPS, the S, PF, and PV columns remain set as a value of 0. Is this expected? If we can populate those columns in the output of Grid-Stat, why aren't they populated for Point-Stat?
  1. Diffs in 4 files TC-Pairs and TC-Stat that can safely be ignored:
file1: /data/output/met_test_truth/tc_pairs/al092022_20220926_DIAGNOSTICS.tcst
file1: /data/output/met_test_truth/tc_pairs/al132020_CONSENSUS.tcst
file1: /data/output/met_test_truth/tc_stat/DIAGNOSTICS_filter_match_points.tcst
file1: /data/output/met_test_truth/tc_stat/DIAGNOSTICS_stat.out

@JohnHalleyGotway
Copy link
Collaborator

JohnHalleyGotway commented Nov 16, 2022

@hsoh-u I see the details you sent me separately via email. Sorry if I raised issues you already addressed there.

Please just confirm that all diffs I noted in my comment above are expected, and I'll approve the PR.

@hsoh-u
Copy link
Collaborator Author

hsoh-u commented Nov 16, 2022

@hsoh-u I see the details you sent me separately via email. Sorry if I raised issues you already addressed there.

Please just confirm that all diffs I noted in my comment above are expected, and I'll approve the PR.

The differences are expected:

  • point_stat
    • Compute S12, S13, S21, S23, S31, S32, PV1, PV2, PV3, PF1, PF2, PF3 columns
    • All of them are 0 because the density_vector is zero with density_radius 0.75.
    • If density_radius is big enough, non zero density_vector is generated
    • The SEEPS QC is enabled (0.1 <= p1 <= 0.85, so less SEEPS MPR lines are saved
    • It supposed to save the weighted SEEPS score, but averaged SEEPS score is printed out for SEEPS line type (becuase weighted SEEPS score is 0)
  • grid_stat
    • three new output files with new unit test.

@JohnHalleyGotway JohnHalleyGotway self-requested a review November 16, 2022 21:43
Copy link
Collaborator

@JohnHalleyGotway JohnHalleyGotway left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

After @hsoh-u confirmed that all diffs are expected, I approve of these changes.

I do suspect that testing from @RachelNorth may reveal that some tweaks to the logic are needed, but let's proceed with these changes now.

Thanks for all your work on this @hsoh-u!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Development

Successfully merging this pull request may close these issues.

Enhance Grid-Stat to compute SEEPS for gridded observations and write the SEEPS STAT line type.
2 participants