How to speed up model fitting of CustomDist? #6661

thipokKub · 2023-04-08T11:51:38Z

thipokKub
Apr 8, 2023

Hello, I'm trying to implement Skew Student-T distribution using pm.CustomDist. The logp parameters is defined as

def logp_skewt(value, nu, mu, sigma, alpha, *args, **kwargs):
    return (
        pm.math.log(2) + 
        pm.logp(pm.StudentT.dist(nu, mu=mu, sigma=sigma), value) + 
        pm.logcdf(pm.StudentT.dist(nu, mu=mu, sigma=sigma), alpha*value) - 
        pm.math.log(sigma)
    )

I am able to sample from this distribution

with pm.Model():
    pm.CustomDist('target', 1, 0, 3, -10, logp=logp_skewt)
    model_trace = pm.sample(
        nuts_sampler="numpyro",
        draws=2_000,
        chains=1,
    )
samples = model_trace.posterior.target.to_numpy()
eps = 0.01
min_val, max_val = np.quantile(samples, [eps, 1 - eps])
valid_samples = samples[(samples >= min_val) & (samples <= max_val)]

However, when I try to re-fit the model, it became very slow

with pm.Model() as fitted_model:
    nu = pm.HalfCauchy('nu', beta=1)
    mu = pm.Normal('mu', mu=0, sigma=1)
    sigma = pm.HalfCauchy('sigma', beta=1)
    alpha = pm.Normal('alpha', mu=0, sigma=1)
    
    skewt = pm.CustomDist('likelihood', nu + eps, mu, sigma + eps, alpha, logp=logp_skewt, observed=valid_samples[:1000])
    
    model_trace = pm.sample(
        nuts_sampler="pymc",
        draws=100,
        tune=100,
        chains=1,
    )

There are warnings which are

/Users/admin/miniforge3/envs/python311/lib/python3.11/site-packages/pytensor/tensor/rewriting/elemwise.py:680: UserWarning: Optimization Warning: The Op betainc does not provide a C implementation. As well as being potentially slow, this also disables loop fusion.
  warn(
/Users/admin/miniforge3/envs/python311/lib/python3.11/site-packages/pytensor/tensor/rewriting/elemwise.py:680: UserWarning: Optimization Warning: The Op betainc_der does not provide a C implementation. As well as being potentially slow, this also disables loop fusion.
  warn(

It took about 16 minutes to finish fitting on 100 draw, and 100 tune
If I switch to numpyro it failed out right with error

ValueError: Betainc gradient with respect to a and b not supported.

So is there a common way to speedup the computation?
FYI I use pymc 5.1.2

Answered by ricardoV94

Apr 8, 2023

We are working in speeding up these type of gradients in pymc-devs/pytensor#174

Right now they are implemented in Numpy and can't be compiled to JAX.

I will try to push that PR over the finish line sometime in the next weeks.

For now, if you want to speed them you might need to re-implement the Ops manually in your target backend which isn't trivial if you are not familiar with PyTensor and/or JAX

View full answer

ricardoV94 · 2023-04-08T12:08:40Z

ricardoV94
Apr 8, 2023
Maintainer

We are working in speeding up these type of gradients in pymc-devs/pytensor#174

Right now they are implemented in Numpy and can't be compiled to JAX.

I will try to push that PR over the finish line sometime in the next weeks.

For now, if you want to speed them you might need to re-implement the Ops manually in your target backend which isn't trivial if you are not familiar with PyTensor and/or JAX

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to speed up model fitting of CustomDist? #6661

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

How to speed up model fitting of CustomDist? #6661

Uh oh!

thipokKub Apr 8, 2023

Replies: 1 comment

Uh oh!

Uh oh!

ricardoV94 Apr 8, 2023 Maintainer

thipokKub
Apr 8, 2023

ricardoV94
Apr 8, 2023
Maintainer