Added gCNV integration test to detect numerical differences in the outputs. #7889

asmirnov239 · 2022-06-09T21:03:40Z

This PR expands the GermlineCNVCaller integration test suite and addresses #6893 and #4375. The tests that were added are:

Numerical accuracy test that checks for changes of gCNV model posterior values as compared to a previously computed model. This test is meant to detect Python library updates that affect gCNV results and unintentional consequences of minor gCNV model changes.
A test that runs gCNV in the COHORT mode with a pre-trained model as a starting point.
A test that runs gCNV with an annotated intervals file that contain GC content column.

As @samuelklee suggested we should consider adding functionality to the GermlineCNVCallerIntegrationTest to regenerate test files when there is a discrepancy in gCNV model outputs and we are okay with that discrepancy. See example of it in the HaplotypeCallerSparkIntegrationTest class -- specifically note UPDATE_EXACT_MATCH_EXPECTED_OUTPUTS flag. @mwalker174 let me know what you think.

asmirnov239 · 2022-06-09T21:04:30Z

...st/java/org/broadinstitute/hellbender/tools/copynumber/GermlineCNVCallerIntegrationTest.java

+        runCommandLine(argsBuilder);
+        MODEL_FILES_TO_COMPARE.forEach(f -> CopyNumberTestUtils.assertFilesEqualUpToAllowedDeltaForDoubleValues(
+                new File(Paths.get(OUTPUT_DIR.getAbsolutePath(), outputPrefix + "-model").toString(), f),
+                new File(MODEL_EXACT_MATCH_EXPECTED_SUB_DIR,  f),


Note that I reused a previosly computed gCNV model for the numerical accuracy test. Good news is that the results haven't deviated since the time we made these test files. However, perhaps we should regenerate and create dedicated model files just for this test - especially to have a more stable test that compares denoised copy ratio values (see comment about the mu_denoised_copy_ratio_t.tsv).

Yes let's do what @samuelklee suggested and have a UPDATE_EXACT_MATCH_EXPECTED_OUTPUTS option.

I've implemented that. Note that I had to move the test files into the new directory 'gcnv-numerical-accuracy'.

asmirnov239 · 2022-06-09T21:04:54Z

...st/java/org/broadinstitute/hellbender/tools/copynumber/GermlineCNVCallerIntegrationTest.java

+
+    final List<String> MODEL_FILES_TO_COMPARE = Arrays.asList("log_q_tau_tk.tsv", "mu_ard_u_log__.tsv", "mu_psi_t_log__.tsv",
+            "std_ard_u_log__.tsv", "std_psi_t_log__.tsv", "mu_W_tu.tsv", "mu_log_mean_bias_t.tsv", "std_W_tu.tsv", "std_log_mean_bias_t.tsv");
+    final List<String> CALLS_FILES_TO_COMPARE = Arrays.asList("baseline_copy_number_t.tsv", "log_c_emission_tc.tsv",


I removed the 'mu_denoised_copy_ratio_t.tsv' and 'std_denoised_copy_ratio_t.tsv' files from the numerical accuracy test because it was not passing with the ALLOWED_DELTA_FOR_DOUBLE_VALUES = 1E-6, however the values are not too different. I can potentially get around it by 1) regenerating the model, 2) taking more samples from the posterior or 3) setting the ALLOWED_DELTA_FOR_DOUBLE_VALUES to something smaller or some combination of the above.

I'd prefer (2)

It looks like the test pass now with the default value for the number of copy ratio samples.

codecov · 2022-06-09T22:15:47Z

Codecov Report

Merging #7889 (211aefb) into master (9ae1fd8) will decrease coverage by 1.679%.
The diff coverage is 17.647%.

❗ Current head 211aefb differs from pull request most recent head e4100fe. Consider uploading reports for the commit e4100fe to get more accurate results

@@               Coverage Diff               @@
##              master     #7889       +/-   ##
===============================================
- Coverage     86.954%   85.275%   -1.679%     
- Complexity     36897     37924     +1027     
===============================================
  Files           2214      2310       +96     
  Lines         173540    180395     +6855     
  Branches       18736     19841     +1105     
===============================================
+ Hits          150900    153831     +2931     
- Misses         16037     19755     +3718     
- Partials        6603      6809      +206

Impacted Files	Coverage Δ
...s/copynumber/GermlineCNVCallerIntegrationTest.java	`16.346% <17.647%> (-72.333%)`	⬇️
...adinstitute/hellbender/utils/R/RScriptLibrary.java	`0.000% <0.000%> (-100.000%)`	⬇️
...e/hellbender/utils/R/RScriptExecutorException.java	`0.000% <0.000%> (-100.000%)`	⬇️
...nder/metrics/analysis/AlleleFrequencyQCMetric.java	`0.000% <0.000%> (-100.000%)`	⬇️
...trics/analysis/BaseDistributionByCycleMetrics.java	`0.000% <0.000%> (-100.000%)`	⬇️
...OverlappingIntegerCopyNumberSegmentCollection.java	`0.000% <0.000%> (-100.000%)`	⬇️
.../CollectInsertSizeMetricsSparkIntegrationTest.java	`4.878% <0.000%> (-95.122%)`	⬇️
...der/utils/python/PythonScriptExecutorUnitTest.java	`4.225% <0.000%> (-94.366%)`	⬇️
...der/tools/walkers/vqsr/CNNVariantPipelineTest.java	`5.882% <0.000%> (-94.118%)`	⬇️
.../QualityScoreDistributionSparkIntegrationTest.java	`5.970% <0.000%> (-94.030%)`	⬇️
... and 348 more

...st/java/org/broadinstitute/hellbender/tools/copynumber/GermlineCNVCallerIntegrationTest.java

src/testUtils/java/org/broadinstitute/hellbender/testutils/CopyNumberTestUtils.java

mwalker174 · 2022-06-14T19:09:46Z

...st/java/org/broadinstitute/hellbender/tools/copynumber/GermlineCNVCallerIntegrationTest.java

+        runCommandLine(argsBuilder);
+        MODEL_FILES_TO_COMPARE.forEach(f -> CopyNumberTestUtils.assertFilesEqualUpToAllowedDeltaForDoubleValues(
+                new File(Paths.get(OUTPUT_DIR.getAbsolutePath(), outputPrefix + "-model").toString(), f),
+                new File(MODEL_EXACT_MATCH_EXPECTED_SUB_DIR,  f),


Yes let's do what @samuelklee suggested and have a UPDATE_EXACT_MATCH_EXPECTED_OUTPUTS option.

gatk-bot · 2022-07-29T17:52:39Z

Github actions tests reported job failures from actions build 2762081645
Failures in the following jobs:

Test Type	JDK	Job ID	Logs
conda	8	2762081645.3	logs

mwalker174

Thank you @asmirnov239 and @samuelklee for the numerical accuracy testing.

Do you need to include the elbo history files? They're a bit long. Otherwise, good to merge!

…tput of gCNV in the COHORT mode. Also added two minor tests covering different use cases.

…rationTest and regenerated test files for GermlineCNVCaller numerical accuracy integration tests.

asmirnov239 commented Jun 9, 2022

View reviewed changes

mwalker174 reviewed Jun 14, 2022

View reviewed changes

samuelklee mentioned this pull request Jun 24, 2022

Added numerical-stability tests and updated test data for all ModelSegments single-sample and multiple-sample modes. #7652

Merged

asmirnov239 force-pushed the as_add_gcnv_integration_tests branch 2 times, most recently from e4100fe to fb4611c Compare August 1, 2022 21:53

asmirnov239 marked this pull request as ready for review August 2, 2022 04:04

mwalker174 approved these changes Aug 5, 2022

View reviewed changes

asmirnov239 added 2 commits September 10, 2022 03:46

Added gCNV integration test to detect numerical differences in the ou…

b53b41b

…tput of gCNV in the COHORT mode. Also added two minor tests covering different use cases.

Added an option to update expected outputs for GermlineCNVCallerInteg…

2a44263

…rationTest and regenerated test files for GermlineCNVCaller numerical accuracy integration tests.

asmirnov239 force-pushed the as_add_gcnv_integration_tests branch from 08625c3 to 2a44263 Compare September 10, 2022 03:49

asmirnov239 merged commit fee7b94 into master Sep 12, 2022

asmirnov239 deleted the as_add_gcnv_integration_tests branch September 12, 2022 15:09

samuelklee mentioned this pull request Nov 22, 2022

Add pytorch to the conda environment #8094

Draft

samuelklee mentioned this pull request Oct 31, 2023

Updated Python and PyMC, removed TensorFlow, and added PyTorch in conda environment. #8561

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added gCNV integration test to detect numerical differences in the outputs. #7889

Added gCNV integration test to detect numerical differences in the outputs. #7889

asmirnov239 commented Jun 9, 2022

asmirnov239 Jun 9, 2022

mwalker174 Jun 14, 2022

asmirnov239 Aug 3, 2022

asmirnov239 Jun 9, 2022

mwalker174 Jun 14, 2022

asmirnov239 Aug 3, 2022

codecov bot commented Jun 9, 2022 •

edited

Loading

mwalker174 Jun 14, 2022

gatk-bot commented Jul 29, 2022

mwalker174 left a comment

Added gCNV integration test to detect numerical differences in the outputs. #7889

Added gCNV integration test to detect numerical differences in the outputs. #7889

Conversation

asmirnov239 commented Jun 9, 2022

asmirnov239 Jun 9, 2022

Choose a reason for hiding this comment

mwalker174 Jun 14, 2022

Choose a reason for hiding this comment

asmirnov239 Aug 3, 2022

Choose a reason for hiding this comment

asmirnov239 Jun 9, 2022

Choose a reason for hiding this comment

mwalker174 Jun 14, 2022

Choose a reason for hiding this comment

asmirnov239 Aug 3, 2022

Choose a reason for hiding this comment

codecov bot commented Jun 9, 2022 • edited Loading

Codecov Report

mwalker174 Jun 14, 2022

Choose a reason for hiding this comment

gatk-bot commented Jul 29, 2022

mwalker174 left a comment

Choose a reason for hiding this comment

codecov bot commented Jun 9, 2022 •

edited

Loading