Allow for a named tuple for `initial_params` #2286

DominiqueMakowski · 2024-07-12T08:08:28Z

My understanding is that currently initial_params must be a vector of the length of the parameters, but this can become a bit tricky for models with lots of parameters.

Would it be possible to allow for passing, for instance, a named tuple (initial_params=(μ=2.5, σ=1.0)) to set initial values on desired parameters?

The text was updated successfully, but these errors were encountered:

Red-Portal · 2024-07-14T20:23:56Z

I would also love to see named tuple generally supported across the Turing ecosystem, but that would probably need some work. It's also somewhat dependent on this PR

torfjelde · 2024-07-15T09:00:20Z

Would it be possible to allow for passing, for instance, a named tuple (initial_params=(μ=2.5, σ=1.0)) to set initial values on desired parameters?

Definitively possible! I made a quick PoC concept here: TuringLang/DynamicPPL.jl#632

I'm somewhat limited in time these days, but maybe someone can complete it (shouldn't be much work).

I would also love to see named tuple generally supported across the Turing ecosystem, but that would probably need some work. It's also somewhat dependent on JuliaStats/Distributions.jl#1803

Supporting initial_params::NamedTuple would actually be quite a bit simpler than supporting NamedTuple across the entire TuringLang ecosystem:)

Red-Portal · 2024-07-15T21:15:26Z

@torfjelde We don't yet officially support passing NamedTuples to a LogDensityFunction right?

torfjelde · 2024-07-16T07:43:48Z

We don't yet officially support passing NamedTuples to a LogDensityFunction right?

No, but we don't need this to support initial_params::NamedTuple.

DominiqueMakowski · 2024-07-16T11:19:16Z

Another potential feature for TuringLang/DynamicPPL.jl#632 would be to set all/some of these initial parameters as functions that are then applied the prior. For instance, mean() or mode() (I reckon this would be quite convenient to use, like sample(..., init_params=mean(), to take the mean of all the priors as starting params)

torfjelde · 2024-07-18T10:28:29Z

Though I agree that seems convenient, it's just a bit too much hassle to maintain as it really only saves a single call from the user perspective:

init_parmas=mean(rand(Vector, model))`

(though this is somewhat "new" / not well-documented tbh)

sunxd3 · 2024-07-18T10:54:03Z

I agree with Tor

torfjelde · 2024-07-25T20:15:49Z

This has now been solved by TuringLang/DynamicPPL.jl#632 I believe? @sunxd3 ?:)

DominiqueMakowski · 2024-07-31T19:21:34Z

Cheers for the work!

DominiqueMakowski · 2024-08-01T18:24:39Z

Small thing, but mean(rand(Vector, model)) does not return the mean for each parameter (but just the mean of a random draw from all the parameters). Thus I'm not sure how to initialize using the mean of the prior distribution of each parameter?

sunxd3 · 2024-08-02T06:30:14Z

I am probably understanding wrongly, but for

@model function f()
    x ~ Normal(0, 1)
    y ~ Normal(x, 1)
end

mean of x is 0, is this what you mean? And yes, what would the mean of y is?

torfjelde · 2024-08-02T09:17:24Z

Small thing, but mean(rand(Vector, model)) does not return the mean for each parameter (but just the mean of a random draw from all the parameters). Thus I'm not sure how to initialize using the mean of the prior distribution of each parameter?

Ah sorry, yes that's very true. I wrote that a bit too quickly trying to convey that you can take the mean of smaoples from the model to get it.

You can do:

chain_prior = sample(model, Prior(), 1000)

and then extract from this:)

DominiqueMakowski · 2024-08-02T11:38:28Z

Well yes but my point was about some syntactic sugar to conveniently (and efficiently) set the initial parameters to the prior means, and sampling from the priors first appears as overkill given that the distributions usually have analytically defined means.

Hence the initial_params=:mean proposition or ``initial_params=mean` (i.e., passing a function that would be applied to each prior distribution).

Alternatively, is there a way to extract the prior distributions as a vector, for instance:

priors = get_priors(model)  
priors

(mu=Normal(0, 1), sigma=Gamma(2, 2))

One could then run:

initial_params=mean.(priors)

torfjelde mentioned this issue Jul 15, 2024

Allowing using NamedTuple as initial_params TuringLang/DynamicPPL.jl#632

Merged

1 task

sunxd3 closed this as completed Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow for a named tuple for `initial_params` #2286

Allow for a named tuple for `initial_params` #2286

DominiqueMakowski commented Jul 12, 2024

Red-Portal commented Jul 14, 2024

torfjelde commented Jul 15, 2024

Red-Portal commented Jul 15, 2024

torfjelde commented Jul 16, 2024

DominiqueMakowski commented Jul 16, 2024

torfjelde commented Jul 18, 2024

sunxd3 commented Jul 18, 2024

torfjelde commented Jul 25, 2024

DominiqueMakowski commented Jul 31, 2024

DominiqueMakowski commented Aug 1, 2024

sunxd3 commented Aug 2, 2024

torfjelde commented Aug 2, 2024

DominiqueMakowski commented Aug 2, 2024

Allow for a named tuple for initial_params #2286

Allow for a named tuple for initial_params #2286

Comments

DominiqueMakowski commented Jul 12, 2024

Red-Portal commented Jul 14, 2024

torfjelde commented Jul 15, 2024

Red-Portal commented Jul 15, 2024

torfjelde commented Jul 16, 2024

DominiqueMakowski commented Jul 16, 2024

torfjelde commented Jul 18, 2024

sunxd3 commented Jul 18, 2024

torfjelde commented Jul 25, 2024

DominiqueMakowski commented Jul 31, 2024

DominiqueMakowski commented Aug 1, 2024

sunxd3 commented Aug 2, 2024

torfjelde commented Aug 2, 2024

DominiqueMakowski commented Aug 2, 2024

Allow for a named tuple for `initial_params` #2286

Allow for a named tuple for `initial_params` #2286