data io - change `from_cmdstanpy` to work with CmdStanPy version 0.9.68 #1558

mitzimorris · 2021-02-13T00:47:59Z

Description

CmdStanPy version 0.9.68 changed several properties and methods on the CmdStanMCMC object;
critically, the protected property _num_warmup was replaced with property num_draws_warmup
and corresponding num_draws_sampling. Also, the CmdStanMCMC object properties
stan_vars_cols and sampler_vars_cols provide a mapping from variable names to the output columns.

To fix from_cmdstanpy, I added a few helper functions which manipulate lists of variable names.
Going forward, the method _unpack_fit can be used instead of _unpack_frame.

Checklist

Follows official PR format

mitzimorris · 2021-02-13T00:48:30Z

I didn't add any new unit tests; existing tests work.

ahartikainen

Looks good.

I think the latest release is the default way to handle (and we can drop support for the old code when time goes forward; 1year?)

ahartikainen · 2021-02-13T17:14:30Z

arviz/data/io_cmdstanpy.py

@@ -393,6 +438,105 @@ def to_inference_data(self):
            },
        )

+    @requires("posterior")
+    def posterior_to_xarray_v68(self):


Could we change the logic so this is "default" and the other function is "old"?

codecov · 2021-02-13T17:45:58Z

Codecov Report

Merging #1558 (989fe73) into main (6fa1ce8) will decrease coverage by 0.89%.
The diff coverage is 59.86%.

@@            Coverage Diff             @@
##             main    #1558      +/-   ##
==========================================
- Coverage   91.07%   90.17%   -0.90%     
==========================================
  Files         105      105              
  Lines       11361    11420      +59     
==========================================
- Hits        10347    10298      -49     
- Misses       1014     1122     +108

Impacted Files	Coverage Δ
arviz/data/io_cmdstanpy.py	`60.70% <59.86%> (-37.50%)`	⬇️
arviz/data/io_cmdstan.py	`91.89% <0.00%> (-0.08%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6fa1ce8...989fe73. Read the comment docs.

mitzimorris · 2021-02-13T20:36:09Z

hi @ahartikainen, refactored - added a bunch of helper functions that do the data munging, specifically:

parsing base variable names from the column labels
munging sampler stats column names, data types

OriolAbril

It looks great, thanks!

Regarding developing and maintaining the converters, do you think it would be easier to have a to_arviz in cmdstanpy instead?

It will probably be a bit more complicated in terms of setting up tests and ci in the inference libraries repos, but I feel like all the converters are becoming increasingly complicated due to having to support multiple versions: from_pystan is basically 2 converters in one to work with pystan2 and 3, from_pymc3 also has multiple checks, which get more complicated every time and it already gets to the point of monkeypatching pymc3.

mitzimorris · 2021-02-13T23:57:05Z

It will probably be a bit more complicated in terms of setting up tests and ci in the inference libraries repos, but I feel like all the converters are becoming increasingly complicated due to having to support multiple versions: from_pystan is basically 2 converters in one to work with pystan2 and 3, from_pymc3 also has multiple checks, which get more complicated every time and it gets to the point of monkeypatching pymc3.

I can see how this is a problem. are you suggesting that to_arviz would return an InferenceData object?

OriolAbril · 2021-02-14T00:11:47Z

I can see how this is a problem. are you suggesting that to_arviz would return an InferenceData object?

Yes, we can discuss that on the lab meeting this Friday, I have added a point about it. Currently ArviZ has cmdstanpy as a runtime dependency for from_cmdstanpy to work, same for pystan, pymc3... we could change the approach so that cmdstanpy has arviz as a runtime dependency for to_arviz to work (and we could keep from_cmdstanpy in ArviZ as an alias to to_arviz in cmdstanpy).

It may also require some changes to ArviZ (making sure all functions used by the converters are not private methods to begin with, maybe documenting how to create converters) but in the long run it this approach will probably serve ArviZ's goals better and ease ArviZ integration with new libraries. In fact, mcx supports InferenceData natively: https://github.com/rlouf/mcx/blob/master/mcx/trace.py using this approach, and it basically uses dict_to_dataset and InferenceData.

mitzimorris · 2021-02-14T04:05:22Z

PR is now failing because of code coverage - we need tests for CmdStanPy 0.9.65 and 0.9.68.
what is the way to set this up?
other option, merge as is.

mitzimorris · 2021-02-14T04:09:10Z

Yes, we can discuss that on the lab meeting this Friday, I have added a point about it. Currently ArviZ has cmdstanpy as a runtime dependency for from_cmdstanpy to work, same for pystan, pymc3... we could change the approach so that cmdstanpy has arviz as a runtime dependency for to_arviz to work (and we could keep from_cmdstanpy in ArviZ as an alias to to_arviz in cmdstanpy).

on further consideration, this isn't possible because the dependency is going in the wrong direction - CmdStanPy is upstream. also, desiderata for CmdStanPy is minimal dependencies on other packages.

OriolAbril · 2021-02-14T04:43:38Z

I think it's no problem that the converters for old cmdstanpy versions are not tested. I am merging like this, it can always be added in a follow up pr.

…68 (arviz-devs#1558) * from_cmdstanpy for v0.9.68 * from_cmdstanpy for v0.9.68 * lint fix * lint fix * echanges per code review, also, refactor * lint fix * added docstrings * docstrings lint fix * docstrings lint fix * lintfix, baby one more time

mitzimorris added 4 commits February 1, 2021 10:37

Merge branch 'main' of https://github.com/arviz-devs/arviz into main

3c2568d

Merge branch 'main' of https://github.com/arviz-devs/arviz into main

fb85b55

from_cmdstanpy for v0.9.68

7591966

from_cmdstanpy for v0.9.68

3734ec1

mitzimorris requested review from ahartikainen and OriolAbril February 13, 2021 00:48

lint fix

b9b4680

ahartikainen reviewed Feb 13, 2021

View reviewed changes

lint fix

d76dad0

OriolAbril mentioned this pull request Feb 13, 2021

Change Theano references to Aesara #1553

Merged

mitzimorris added 2 commits February 13, 2021 15:19

echanges per code review, also, refactor

79cd81a

lint fix

6435880

mitzimorris added 2 commits February 13, 2021 15:57

added docstrings

0530981

docstrings lint fix

b956e77

OriolAbril approved these changes Feb 13, 2021

View reviewed changes

docstrings lint fix

0c04979

lintfix, baby one more time

989fe73

OriolAbril merged commit 43e7999 into main Feb 14, 2021

OriolAbril deleted the update/from_cmdstanpy branch February 14, 2021 04:43

mitzimorris mentioned this pull request Feb 14, 2021

Bugfix/from cmdstanpy patch #1562

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data io - change `from_cmdstanpy` to work with CmdStanPy version 0.9.68 #1558

data io - change `from_cmdstanpy` to work with CmdStanPy version 0.9.68 #1558

mitzimorris commented Feb 13, 2021

mitzimorris commented Feb 13, 2021

ahartikainen left a comment

ahartikainen Feb 13, 2021

mitzimorris Feb 13, 2021

codecov bot commented Feb 13, 2021 •

edited

Loading

mitzimorris commented Feb 13, 2021

OriolAbril left a comment •

edited

Loading

mitzimorris commented Feb 13, 2021

OriolAbril commented Feb 14, 2021

mitzimorris commented Feb 14, 2021

mitzimorris commented Feb 14, 2021

OriolAbril commented Feb 14, 2021

data io - change from_cmdstanpy to work with CmdStanPy version 0.9.68 #1558

data io - change from_cmdstanpy to work with CmdStanPy version 0.9.68 #1558

Conversation

mitzimorris commented Feb 13, 2021

Description

Checklist

mitzimorris commented Feb 13, 2021

ahartikainen left a comment

Choose a reason for hiding this comment

ahartikainen Feb 13, 2021

Choose a reason for hiding this comment

mitzimorris Feb 13, 2021

Choose a reason for hiding this comment

codecov bot commented Feb 13, 2021 • edited Loading

Codecov Report

mitzimorris commented Feb 13, 2021

OriolAbril left a comment • edited Loading

Choose a reason for hiding this comment

mitzimorris commented Feb 13, 2021

OriolAbril commented Feb 14, 2021

mitzimorris commented Feb 14, 2021

mitzimorris commented Feb 14, 2021

OriolAbril commented Feb 14, 2021

data io - change `from_cmdstanpy` to work with CmdStanPy version 0.9.68 #1558

data io - change `from_cmdstanpy` to work with CmdStanPy version 0.9.68 #1558

codecov bot commented Feb 13, 2021 •

edited

Loading

OriolAbril left a comment •

edited

Loading