Add probs methods to discrete distributions #305

lindahua · 2014-11-08T09:17:52Z

Computing pmf over a contiguous range can take advantage of the recursive relations in the pmf formula, and thus can be much more efficient than computing them individually using pdf.

Hence, I introduce the probs function for discrete distribution, with the following semantics:

probs(d, a:b)   # return a vector of probabilities corresponding to each value in the range a:b
probs(d)   # only for bounded distributions, equivalent to probs(minimum(d):maximum(d))

This function can be used to get an entire probability vector efficiently.

coveralls · 2014-11-08T09:22:46Z

Coverage decreased (-0.04%) when pulling 1efccbe on dh/probs into 67d5eae on master.

Add probs methods to discrete distributions

coveralls · 2014-11-08T09:28:01Z

Coverage decreased (-0.04%) when pulling 1efccbe on dh/probs into 67d5eae on master.

StefanKarpinski · 2014-11-08T10:35:16Z

I'm wondering if this couldn't reasonably be additional methods of the pdf function.

kmsquire · 2014-11-08T15:03:35Z

I was thinking the same thing as Stefan.

lindahua · 2014-11-08T15:57:44Z

This idea of using pdf did came to my mind but somehow I feel that pdf(d) producing a probability vector does not look very natural.

If the consensus is that it is ok to have pdf(d) yield a probability vector (for finite distributions), I can change it to pdf. I am pretty open to this.

kmsquire · 2014-11-08T17:47:34Z

To me this just seems like a vectorized version of the pdf function, which
seems fine to me.

On Saturday, November 8, 2014, Dahua Lin notifications@github.com wrote:

This idea of using pdf did came to my mind but somehow I feel that pdf(d)
producing a probability vector does not look very natural.

If the consensus is that it is ok to have pdf(d) yield a probability
vector (for finite distributions), I can change it to pdf. I am pretty
open to this.

—
Reply to this email directly or view it on GitHub
#305 (comment)
.

StefanKarpinski · 2014-11-08T18:07:37Z

The pdf(d) form does seem a little odd – and there's the issue of what, if anything, it should do for continuous distributions – but I think once you see it, it makes sense, and you'd have to look up probs anyway, so there seems to be little point in having an additional generic function for this.

johnmyleswhite · 2014-11-08T22:05:47Z

Since this special case would only make sense for discrete distributions, perhaps we should use pmf, which I believe we already support for evaluation at a point.

lindahua · 2014-11-09T02:32:55Z

Based on the feedback, I think I can take the following steps at this point:

Use pdf(d, a:b) instead of probs(d, a:b) (for this case, their semantics are essentially the same), but using an efficient implementation that takes advantage of the relation between pdf(x) and pdf(x + 1).
I will keep probs(d) as it is for now. This method was originally only for Categorical, Multinomial and MixtureModel to retrieve the associated probability vector. However, other bounded discrete distributions can be considered as a special case of Categorical distribution, and as I extend the probs method for them.

Whereas for Categorical, probs is the same as pdf. However, this is no longer the case for Multinomial and MixtureModel.

lindahua · 2014-11-09T06:08:41Z

Think about it all, I lean more towards just using pdf.

Whereas it might be too cute to have pdf(d), however, it doesn't seem that it will cause ambiguity or confusion.

And probs(d) is only reserved for the case where there is a probability vector as a parameter and one can use probs(d) to retrieve that parameter. This applies to distributions like Categorical, Multinomial, and MixtureModel. In such cases probs are not necessarily the same as pdf.

lindahua · 2014-11-15T09:46:21Z

As of v0.6.1, we are now using pdf in all these cases.

lindahua added 6 commits November 8, 2014 16:26

add probs for Bernoulli and Binomial

d62567d

add probs for DiscreteUniform

5d78b97

add probs for Geometric

901e39a

add probs to Hypergeometric

6c209dc

add probs to NegativeBinomial

b9d6e24

add probs to lambda

2afa38d

lindahua added this to the v0.6 milestone Nov 8, 2014

minor update of testutils

1efccbe

lindahua added a commit that referenced this pull request Nov 8, 2014

Merge pull request #305 from JuliaStats/dh/probs

0792898

Add probs methods to discrete distributions

lindahua merged commit 0792898 into master Nov 8, 2014

ararslan deleted the dh/probs branch September 29, 2016 18:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add probs methods to discrete distributions #305

Add probs methods to discrete distributions #305

lindahua commented Nov 8, 2014

coveralls commented Nov 8, 2014

coveralls commented Nov 8, 2014

StefanKarpinski commented Nov 8, 2014

kmsquire commented Nov 8, 2014

lindahua commented Nov 8, 2014

kmsquire commented Nov 8, 2014

StefanKarpinski commented Nov 8, 2014

johnmyleswhite commented Nov 8, 2014

lindahua commented Nov 9, 2014

lindahua commented Nov 9, 2014

lindahua commented Nov 15, 2014

Add probs methods to discrete distributions #305

Add probs methods to discrete distributions #305

Conversation

lindahua commented Nov 8, 2014

coveralls commented Nov 8, 2014

coveralls commented Nov 8, 2014

StefanKarpinski commented Nov 8, 2014

kmsquire commented Nov 8, 2014

lindahua commented Nov 8, 2014

kmsquire commented Nov 8, 2014

StefanKarpinski commented Nov 8, 2014

johnmyleswhite commented Nov 8, 2014

lindahua commented Nov 9, 2014

lindahua commented Nov 9, 2014

lindahua commented Nov 15, 2014