extend model arg usage in io_pymc3 to fix plot_ppc with prior #1045

OriolAbril · 2020-02-05T14:34:33Z

Description

Try to improve handling of prior groups and observed data in io_pymc3. Related to #1002. It introduces an important change in behaviour, it proposes to exclude the observed_data group whenever predictions are present. @corriebar @rpgoldman

I also extended tests on io_pymc and improved the helper function check_multiple_attrs.

And finally, ~~how do you feel about deprecating the use of from_pymc3 without model nor trace (or even without model). I was thinking on adding a warning.~~ For now not using a model nor trace will raise a pending deprecation warning as a start. @canyon289 @aloctavodia @ColCarroll

Related to #939.

Checklist

Does the PR follow official
PR format?
Is the new feature properly documented with an example?
Does the PR include new or updated tests to cover the new feature (using pytest fixture pattern)?
Is the code style correct (follows pylint and black guidelines)?
Is the change listed in changelog?

rpgoldman · 2020-02-05T17:21:18Z

arviz/data/io_pymc3.py

@@ -121,8 +121,7 @@ def arbitrary_element(dct: Dict[Any, np.ndarray]) -> np.ndarray:
    def find_observations(self) -> Optional[Dict[str, Var]]:
        """If there are observations available, return them as a dictionary."""
        has_observations = False
-        if self.trace is not None:


I don't really like leaving out the check that tells the caller why they are not getting observations out of their trace when they expect it. Perhaps replace my error with a UserWarning?
It seems like Arviz tries to get whatever information it can out of a trace, and ignores whatever it can't figure out. This makes it relatively robust, but means that the user doesn't know that they could get more information into the InferenceData by providing a model.

I completely agree, I was just waiting in order to replace it with an informative warning or a deprecation warning.

rpgoldman · 2020-02-05T17:23:02Z

Per my comment in the code and your question in the MR description -- I agree it would be a good idea to require supply of the model. I think this is particularly important because the user could get an inappropriate model out of the trace.
This may seem far-fetched, but for PyMC3 one actually has to move a trace to a new model in some cases in order to do out of sample predictions.

arviz/tests/helpers.py

OriolAbril · 2020-02-06T00:00:53Z

Only the model argument warning-deprecation is left to do, after that we can merge

OriolAbril · 2020-02-10T13:54:28Z

I decided to start with the deprecation on no model nor trace (it will be triggered in cases such as only prior or only posterior predictive/predictions). We can make it more strict next release by requiring a model even when the trace is present and eventually remove the _straces trick. In my opinion, PyMC3 users should not mind having to call from_pymc3 within a model context or passing the model as kwarg, it already is the workflow for sample, sample_prior_predictive...

I think it is ready to merge

rpgoldman · 2020-02-10T14:13:01Z

I agree. If the tests pass again, we should merge. Looks very good!

OriolAbril · 2020-02-11T22:39:53Z

This allows to make prior predictive checks right away with PyMC3!

with pm.Model():
    # define model
    prior = pm.sample_prior_predictive()
    idata = az.from_pymc3(prior) # has to be either from a model context or
                                 # passing model=model to from_pymc3

az.plot_ppc(idata, group="prior")

I am not really sure about the way to advertise this though, is there a PyMC3 tutorial on prior predictive sampling? @canyon289 @aloctavodia

AlexAndorra · 2020-02-12T13:27:20Z

This is great -- thanks guys!!
To my knowledge, there is no tutorial dedicated to prior pred checks on PyMC's website. I found:

The putting notebook, where Colin does some prior checks -- but maybe too complicated a use-case for an introduction.
I updated the radon NB a few weeks ago, and among other things added prior checks. The new version is not up yet (but it's on master if you wanna check it out), but this could be a good avenue to advertise this new use.
One NB dedicated to posterior predictive checks. We could extend (and update) it -- this would make a useful educational ressource to point to.

In general, I noticed on the Discourse that people find it hard to understand the concept of prior/posterior pred checks -- or just don't know about them. So I think an intro tutorial would be really useful.

OriolAbril added 3 commits February 5, 2020 14:49

extend check_multiple attrs to assert negations

5a1020b

do not create observed_data if predictions are present

ad510ee

divide prior groups using model instead of trace

1ef4ba8

OriolAbril changed the title ~~Io pymc model~~ extend model arg usage in io_pymc3 to fix plot_ppc with prior Feb 5, 2020

rpgoldman reviewed Feb 5, 2020

View reviewed changes

arviz/tests/helpers.py Show resolved Hide resolved

rpgoldman approved these changes Feb 5, 2020

View reviewed changes

OriolAbril added 2 commits February 5, 2020 20:37

add to changelog

fceccf9

type hinting and docstring extension

c564199

OriolAbril changed the title ~~extend model arg usage in io_pymc3 to fix plot_ppc with prior~~ [WIP] extend model arg usage in io_pymc3 to fix plot_ppc with prior Feb 6, 2020

rpgoldman approved these changes Feb 6, 2020

View reviewed changes

OriolAbril added 4 commits February 6, 2020 22:20

Add docstring to from_pymc3

46f9ddc

change legend in plot_ppc to match group used

5236351

pending deprecation warning

14994e6

black

8814826

lint

710fc23

OriolAbril changed the title ~~[WIP] extend model arg usage in io_pymc3 to fix plot_ppc with prior~~ extend model arg usage in io_pymc3 to fix plot_ppc with prior Feb 10, 2020

OriolAbril merged commit d33192e into master Feb 11, 2020

OriolAbril deleted the io_pymc_model branch February 11, 2020 22:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extend model arg usage in io_pymc3 to fix plot_ppc with prior #1045

extend model arg usage in io_pymc3 to fix plot_ppc with prior #1045

OriolAbril commented Feb 5, 2020 •

edited

Loading

rpgoldman Feb 5, 2020

OriolAbril Feb 5, 2020

rpgoldman commented Feb 5, 2020 •

edited

Loading

OriolAbril commented Feb 6, 2020

OriolAbril commented Feb 10, 2020

rpgoldman commented Feb 10, 2020

OriolAbril commented Feb 11, 2020

AlexAndorra commented Feb 12, 2020

extend model arg usage in io_pymc3 to fix plot_ppc with prior #1045

extend model arg usage in io_pymc3 to fix plot_ppc with prior #1045

Conversation

OriolAbril commented Feb 5, 2020 • edited Loading

Description

Checklist

rpgoldman Feb 5, 2020

Choose a reason for hiding this comment

OriolAbril Feb 5, 2020

Choose a reason for hiding this comment

rpgoldman commented Feb 5, 2020 • edited Loading

OriolAbril commented Feb 6, 2020

OriolAbril commented Feb 10, 2020

rpgoldman commented Feb 10, 2020

OriolAbril commented Feb 11, 2020

AlexAndorra commented Feb 12, 2020

OriolAbril commented Feb 5, 2020 •

edited

Loading

rpgoldman commented Feb 5, 2020 •

edited

Loading