Logpdf support #762

mborland · 2022-02-15T17:00:02Z

No description provided.

include/boost/math/distributions/arcsine.hpp

jzmaddock · 2022-02-15T18:38:41Z

Thanks Matt, that's more or less along the lines I was thinking too.

I think we can safely add a default implementation:

template <class Distribution>
inline typename Distribution::value_type logpdf(const Distribution& dist, const typename Distribution::value_type& x)
{
  using std::log;
   return log(pdf(dist, x));
}

For those distributions that do not have exponential-like pdf's. The Arcsine is actually a good case in point, do we need an actual implementation here, or is the default good enough? I can see some arguments either way: if this->hi is super large then (x-lo)(hi-x) could overflow (but no one has ever complained), on the other hand, evaluating by logs as you do, may lead to cancellation error when the pdf is close to 1. And I doubt there is much performance difference between them.

With regard to testing... we already have decent tests for the pdf's, so I suspect that a basic sanity check against the pdf may be good enough in thee cases?

Fails with long doubles

[ci skip]

test/test_normal.cpp

mborland · 2022-02-25T08:34:02Z

@jzmaddock This is clean and good for review. Logpdf has been specialized when there are exponentials in the pdf and there rest will default to the naive log(pdf). The only place where I had to adjust tolerances was in the test of long doubles on the normal distribution. The naive, specialized, nor hard coding the result from WolframAlpha got me into the existing tolerance band.

include/boost/math/distributions/extreme_value.hpp

include/boost/math/distributions/chi_squared.hpp

include/boost/math/distributions/gamma.hpp

include/boost/math/distributions/exponential.hpp

include/boost/math/distributions/extreme_value.hpp

include/boost/math/distributions/gamma.hpp

include/boost/math/distributions/inverse_gamma.hpp

include/boost/math/distributions/inverse_gaussian.hpp

include/boost/math/distributions/laplace.hpp

include/boost/math/distributions/normal.hpp

include/boost/math/distributions/poisson.hpp

include/boost/math/distributions/rayleigh.hpp

include/boost/math/distributions/weibull.hpp

jzmaddock · 2022-02-28T18:59:52Z

There's a few comments above, I haven't checked the details of the formulas, and am replying on your tests.

We should also add logpdf to the DistributionConcept checks in test/compile_test/test_compile_result.hpp

Otherwise, looks good, much thanks for taking this on!

NAThompson · 2022-02-28T21:44:25Z

test/test_arcsine.cpp

@@ -292,6 +301,16 @@ void test_spots(RealType)
    BOOST_CHECK_CLOSE_FRACTION(pdf(arcsine_01, static_cast<RealType>(1) - tolerance),
      1 /(sqrt(tolerance) * boost::math::constants::pi<RealType>()), 2 * tolerance); //

+    // Log PDF
+    BOOST_CHECK_CLOSE_FRACTION(logpdf(arcsine_01, 0.000001), static_cast<RealType>(5.7630258931329868780772138043668005779060097243996L), tolerance);


One thing I worry about these tests is that if they fail at some later date, it's not clear why (say) 5.7630258931329868780772138043668005779060097243996L is the correct value. Is there a property that can be tested instead?

test/test_gamma_dist.cpp

mborland · 2022-04-17T19:19:53Z

@jzmaddock Your review comments have been addressed; I apologize for the delay. I also ran the sonar lint and clang-tidy combs through this as suggested by @ckormanyos.

jzmaddock · 2022-04-18T16:12:32Z

Hi Matt, thanks again for this, I just spotted a couple of location where you're still using log(tgamma(x)) rather than lgamma(x) but other than that it all looks good to go to me.

NAThompson · 2022-04-18T16:24:44Z

@HDembinski: Looks like this is basically finished. Let us know if you hit any bugs in the usage.

mborland · 2022-04-19T01:45:39Z

CI is clean so merging and closing linked issue.

HDembinski · 2022-04-21T08:34:07Z

Thanks for this. It looked fine, but perhaps you can remove some of the now duplicated code between pdf and logpdf. For some distributions (poisson, normal, exponential), the pdf can be computed from the logpdf.

jzmaddock · 2022-04-21T10:01:16Z

Thanks for this. It looked fine, but perhaps you can remove some of the now duplicated code between pdf and logpdf. For some distributions (poisson, normal, exponential), the pdf can be computed from the logpdf.

Technically yes, but would tend to loose precision due to cancellation error in the logpdf calculation?

NAThompson · 2022-04-21T14:44:13Z

Technically yes, but would tend to loose precision due to cancellation error in the logpdf calculation?

Yeah I'd like to see an ulps plot before going for that . . . also log and exp are not particularly cheap.

HDembinski · 2022-04-21T17:59:12Z

Fair enough, but you can do it at least for the exponential distribution, no? In case of the normal, you may be right. In case of Poisson, I don't know.

NAThompson · 2022-04-21T18:48:02Z

Fair enough, but you can do it at least for the exponential distribution, no?

For the rate λ, we'd have log(pdf(x)) = log(λ) - λx, and then exp(log(pdf)) = exp(log(λ) - λx). That gives two transcendental function calls over 1 using the code-duplicated λexp(-λx). Losing (or gaining!) precision on the exp(log(λ)) also seems possible, although generally you have half-ulp guarantees with those functions. Maybe I'm overthinking it but I just feel like an ulps plot would help convince me.

Arcsine distribution logpdf

e367d3c

mborland linked an issue Feb 15, 2022 that may be closed by this pull request

Log pdf support? #525

Closed

jzmaddock reviewed Feb 15, 2022

View reviewed changes

include/boost/math/distributions/arcsine.hpp Outdated Show resolved Hide resolved

Add tests for arcsine logpdf and fix definition [ci skip]

2512a13

mborland added 14 commits February 16, 2022 13:54

Remove arcsine logpdf specialization and add default implementation

d739e1d

Add logpdf to normal distribution

18c3347

Add logpdf to poisson distribution

eb55d0a

Add logpdf to exponential distribution

5986233

Add logpdf to gamma distribution

68d000c

Attempt to increase resolution of logpdf for normal distribution

079ebca

Fails with long doubles

Remove normal dist logpdf in favor of default

4bd4951

Loosen tolerance on logpdf for long doubles for normal dist

6f1cda5

Add logpdf to chi squared distribution

93593fb

[ci skip]

Add logpdf to extreme value distribution

208a1be

[ci skip]

Add logpdf to inverse gamma distribution

570b40e

[ci skip]

Add logpdf to inverse gaussian distribution

269bf59

[ci skip]

Add logpdf to laplace distribution

cd4f2b7

[ci skip]

Add logpdf to rayleigh distribution

7681a4e

[ci skip]

mborland marked this pull request as ready for review February 24, 2022 14:58

Add logpdf to weibull distribution

2a3cb31

NAThompson reviewed Feb 24, 2022

View reviewed changes

test/test_normal.cpp Show resolved Hide resolved