fix: Check parameter shapes for pdf API calls #1461

kratsg · 2021-05-17T12:05:43Z

Pull Request Description

Resolves #1459. Check that expected_data, expected_auxdata, and expected_actualdata are being called with the right shape for pars (the parameters) before evaluating. This is usually caught by lower-level code, however the error is not as user-friendly.

A quick note: it is ok to add if/raise in these API calls as they're not meant to be "fast" in our code -- but this could potentially be a problem if we need to allow for autodiff capabilities. For now, we're only enforcing that logpdf() calls are the performant ones.

Checklist Before Requesting Reviewer

Tests are passing
"WIP" removed from the title of the pull request
Selected an Assignee for the PR to be responsible for the log summary

Before Merging

For the PR Assignees:

Summarize commit messages into a comprehensive review of the PR

* Add checks on input parameter shapes for the pdf API calls to expected_auxdata,
expected_actualdata, and expected_data.
* Add tests for parameter shape checks to properly raise exceptions.InvalidPdfParameters

codecov · 2021-05-17T12:15:25Z

Codecov Report

Merging #1461 (390b100) into master (ce70574) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master    #1461   +/-   ##
=======================================
  Coverage   98.12%   98.12%           
=======================================
  Files          64       64           
  Lines        4270     4278    +8     
  Branches      683      687    +4     
=======================================
+ Hits         4190     4198    +8     
  Misses         46       46           
  Partials       34       34

Flag	Coverage Δ
contrib	`26.20% <0.00%> (-0.05%)`	⬇️
doctest	`60.47% <0.00%> (-0.12%)`	⬇️
unittests	`96.18% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/pyhf/pdf.py	`97.85% <100.00%> (+0.05%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ce70574...390b100. Read the comment docs.

lukasheinrich · 2021-05-17T12:15:56Z

I wonder whether for these types of API we shsould have a pattern

def method(self,...):
    # check inputs
    ....
   self._method(...)

such that for performance critical or internal paths (i.e. where pyhf itself calls method we're able to call the "unsafe" _method - relying on prior checks, while the user gets a friendly API with safety checks

kratsg · 2021-05-17T12:21:09Z

I wonder whether for these types of API we shsould have a pattern

I had this idea where I wanted to use decorators. It can introspect the arguments and do the checks based on something consistent, rather than copy/paste code around a lot. In this case, something like @pyhf.checks.pars that checks the number of parameters passed in for pdf calls, and would become a passthrough based on some pyhf.config configuration or similar.

lukasheinrich · 2021-05-17T12:22:31Z

src/pyhf/pdf.py

+            raise exceptions.InvalidPdfParameters(
+                f'eval failed as pars has len {pars.shape[-1]} but {self.config.npars} was expected'
+            )
+
        return self.make_pdf(pars)[1].expected_data()

    def _modifications(self, pars):


given that we do input validation here it suggests that this has become a publicly consumable API. maybe we should add a model.modifications and do the input checks there and call _modifications if inputs are ok

_modifications isn't currently a public API. This certainly could become one, although I might argue that we remove it from Model and keep it on MainModel unless there's a reason to pass-through it. e.g. pdf.main_model.modifications is just as clear to me. Unless the suggestion here is to remove the checks from main_model and constraint_model and keep all checks on model which is also possible, but feels like a mess.

main_model is needed for return_by_sample, so it feels similarly "public" as Model.

Sure, but main_model.expected_actualdata(..., return_by_sample) is fine -- since that's public. However main_model._modifications isn't necessarily public. Although at the moment only used by main_model.expected_data -- so I'm fine either way.

Unless the suggestion here is to remove the checks from main_model and constraint_model

My comment was meant as an example for why that may not be desirable, sorry I should have made that clear. Promoting _modifications to public in the longer term is a nice idea, it looks very useful for model debugging.

I created an issue regarding the proposal of making _modifications public: #1652.

as it is now this will be called on each logpdf call.. so it's perf. critical.. should we have some kind of split of "pdf.method" and "pdf._method_unsafe"? or do we think it doesn't make a difference?

@lukasheinrich @kratsg If we can revisit the performance impact here soon it would be nice to have this get into v0.7.0.

matthewfeickert

This all LGTM @kratsg — thanks. I'll let you and @lukasheinrich resolve the current discussion and implement any changes that you want, but I'm happy to have this merged whenever you both are.

src/pyhf/pdf.py

kratsg added API Changes the public API feat/enhancement New feature or request fix A bug fix labels May 17, 2021

kratsg self-assigned this May 17, 2021

kratsg requested review from lukasheinrich and matthewfeickert May 17, 2021 12:05

lukasheinrich reviewed May 17, 2021

View reviewed changes

matthewfeickert added the tests pytest label May 17, 2021

matthewfeickert approved these changes May 17, 2021

View reviewed changes

alexander-held mentioned this pull request Oct 19, 2021

Promote model._modifications to public API #1652

Closed

1 task

matthewfeickert force-pushed the feat/assertExpectedDataAPI branch from 62a7a0c to 9ea3230 Compare October 25, 2021 21:57

matthewfeickert requested review from alexander-held and lukasheinrich October 25, 2021 21:58

alexander-held reviewed Oct 26, 2021

View reviewed changes

src/pyhf/pdf.py Outdated Show resolved Hide resolved

kratsg and others added 5 commits December 10, 2021 14:43

checks for passed in parameter shapes

c61b0fa

add test

b993093

fix up coverage

f3e9e8e

Rebase

b3ef03d

Apply Alex's suggestion

390b100

matthewfeickert force-pushed the feat/assertExpectedDataAPI branch from e36109f to 390b100 Compare December 10, 2021 20:43

matthewfeickert changed the base branch from master to main September 21, 2022 20:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Check parameter shapes for pdf API calls #1461

fix: Check parameter shapes for pdf API calls #1461

kratsg commented May 17, 2021 •

edited by matthewfeickert

Loading

codecov bot commented May 17, 2021 •

edited

Loading

lukasheinrich commented May 17, 2021

kratsg commented May 17, 2021

lukasheinrich May 17, 2021

kratsg May 17, 2021

alexander-held May 17, 2021

kratsg May 17, 2021

alexander-held May 17, 2021

alexander-held Oct 19, 2021

lukasheinrich Oct 26, 2021

matthewfeickert Dec 7, 2021

matthewfeickert left a comment

fix: Check parameter shapes for pdf API calls #1461

Are you sure you want to change the base?

fix: Check parameter shapes for pdf API calls #1461

Conversation

kratsg commented May 17, 2021 • edited by matthewfeickert Loading

Pull Request Description

Checklist Before Requesting Reviewer

Before Merging

codecov bot commented May 17, 2021 • edited Loading

Codecov Report

lukasheinrich commented May 17, 2021

kratsg commented May 17, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

matthewfeickert left a comment

Choose a reason for hiding this comment

kratsg commented May 17, 2021 •

edited by matthewfeickert

Loading

codecov bot commented May 17, 2021 •

edited

Loading