Important Distribution-to-RandomVariable logic changes

PyMC's `Distribution` classes are being converted to `RandomVariable`s in the `v4` branch.  A lot of core changes have already been made but a handful of important logic changes are still required in order to reinstate multiple PyMC features.  This issue lists some points in the codebase where these changes are needed. 

First, for anyone who's interested in helping out with the `v4` branch, searching for the string `XXX:` will generally reveal parts of the logic that have been disabled and need to be refactored.  

Here is a potentially outdated list of those parts accompanied by short summaries of the problem(s) and the work involved:
- [x] [`pymc3.model.Model.register_rv`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/model.py#L11150)
  - I don't know what the whole `dict`-as-observed-data thing is about, so I couldn't finish refactoring this.
  - **Marking as stale**
- [x] [`pymc3.distributions.transforms.TransformedDistribution`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/distributions/transforms.py#L152)
  - The `forward_val` methods use `draw_values`, which has been removed.  I think these methods can be removed entirely, because they only appear to be used by `pymc3.sampling.sample_prior_predictive` and I don't think that logic is relevant any longer.
  - **Marking as stale**
- [x] [`pymc3.gp.gp.[Marginal, MarginalKron]`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/gp/gp.py#L556)
  - These also use `draw_values`, and I think they only need to be replaced by the creation and use of `theano.function`.  For example, `draw_values([mu, cov], point=point)` roughly translates to something like `theano.function([model[n] for n in point.keys()], [mu, cov])(*point.values())`, but we wouldn't want to compile that function _every_ time a sample is needed, so we would need to do `self.mu_cov_fn = theano.function([model[n] for n in point.keys()], [mu, cov])` somewhere (e.g. the constructor) and reuse `self.mu_cov_fn` in those methods.
  - **Addressed by PR** #5055 
- [x] [`pymc3.parallel_sampling._Process`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/parallel_sampling.py#L157)
  - The whole parallel sampling approach relies on a single set of fixed-shape shared memory arrays for each random variable, and the processes all appear to write to that set of memory.  Unfortunately, that approach doesn't work when the random variables change shape during sampling.  This doesn't need to be fixed immediately, but it is an unnecessary restriction caused by this specific approach to multi-processing.
  - **Reminder issue:** https://github.com/pymc-devs/pymc/issues/5139
- [x] [`pymc3.variational.opvi.Group`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/variational/opvi.py#L953)
  - The constructor for this class creates a`DictToArrayBijection` in a peculiar way and requires explicit shape information to do it.  Just like all the other changes of this sort, we need to move the creation of the bijection(s) to the places where actual concrete samples are generated (and said bijection(s) are actually used/needed).  I don't know enough about this code to do that, so someone who has worked on this should take a look.
  - **Addressed by PR** #4582 
- [x] [`pymc3.sampling.sample_posterior_predictive_w`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/sampling.py#L1914)
  - This change is probably just like the ones I made for the `sample_[prior|posterior]_predictive` functions.
  - **Reminder issue:** #4807
- [x] [`pymc3.step_methods.elliptical_slice.EllipticalSlice.astep`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/step_methods/elliptical_slice.py#L103)
   - This one is probably just another `draw_values` that needs to be replaced with a `theano.function`.
   - **Reminder issue:** #5137
- [x] [`pymc3.step_methods.metropolis`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/step_methods/metropolis.py#L157)
  - The first instance uses the old `dsize` field on the random variables to create a NumPy array that&mdash;in turn&mdash;is used to initialize the proposal distributions.  Instead, we should use the initial sample values for each variable to do this, and perhaps update the proposal distributions when/if new samples are drawn that have new shapes.  
  - As always, for all these cases we can assume that the sizes of the initial values are representative of all the future samples (i.e. shapes don't change), but we shouldn't if that assumption isn't necessary.
  - Besides the size/shape refactoring, there's also a `draw_values` that looks like it might be straightforward to fix.
  - **Fixed in:** https://github.com/pymc-devs/pymc/commit/28fdda5c4638da157442621e30fa81f1111f74cc
- [x] [`pymc3.step_methods.hmc.base_hmc.BaseHMC.__init__`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/step_methods/hmc/base_hmc.py#L98)
  - This is another place where we assume a fixed shape that's equal to the initial sample point.  If we can remove this assumption, we should.
  - **Reminder issue:** https://github.com/pymc-devs/pymc/issues/5139
- [x] [`pymc3.step_methods.gibbs.ElemwiseCategorical`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/step_methods/gibbs.py#L54)
  - Yet another use of the deprecated `dshape` property that can always be replaced by using the shapes of the initial sample point (but shouldn't, if avoidable).
  - **Stepper was removed in #https://github.com/pymc-devs/pymc3/commit/97dc4bd233a49aca8e7f7a84fab8eb87d1feb8d6**
- [x] [`pymc3.step_methods.sgmcmc.BaseStochasticGradient`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/step_methods/sgmcmc.py#L157)
  - Yet another use of the deprecated `dshape` and `dsize` properties that can always be replaced by using the shapes of the initial sample point (but shouldn't, if avoidable).
  - **Reminder issue:** #5138
- [x] [`pymc3.data`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/data.py#L545)
  - This code attempts to evaluate the shape, but I think the shape is necessarily for a (shared) constant variable, so it might not need to be changed, but it's weird nonetheless.
  - **Marking as stale**
- [x] [`pymc3.model_graph.ModelGraph`](https://github.com/pymc-devs/pymc3/blob/3cc457ba10188f80d6c3829ae1a7713c41cf2aac/pymc3/model_graph.py#L174)
  - This is another use of `dshape`; however, this one might not need to be changed, since the whole class could be replaced by the sample-space graphs provided by a `Model` object.  The only unique feature I can immediately see is the Graphviz plate notation.  Theano graphs, like the sample-space graphs, can already be converted to Graphviz Digraphs using functionality that's been in Theano for a while, but the plate notation may be a new thing that requires changes/additions. 
  - **Refactored in https://github.com/pymc-devs/pymc3/pull/4818**

_Originally posted by @brandonwillard in https://github.com/pymc-devs/pymc3/issues/4440#issuecomment-773059214_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Important Distribution-to-RandomVariable logic changes #4463

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Important Distribution-to-RandomVariable logic changes #4463

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions