Fix bug in which TruncatedNormal returns -inf for all values if any value is out of bounds #6128

adrn · 2022-09-14T16:23:47Z

What is this PR about?

With pymc v4.1.7, I found that evaluating TruncatedNormal's logp with an array of values was returning all -inf values -- for example:

import numpy as np
import pymc as pm

p = {'mu': 1, 'sigma': 0.1}
with pm.Model() as model:    
    dist1 = pm.TruncatedNormal.dist(**p, lower=0.)
    dist2 = pm.Normal.dist(**p)
    
    x_grid = np.linspace(-5, 10, 1024)
    dist1_logp = pm.Deterministic('dist1', pm.logp(dist1, x_grid))
    dist2_logp = pm.Deterministic('dist2', pm.logp(dist2, x_grid))

func1 = model.compile_fn(dist1_logp, inputs=[])
func2 = model.compile_fn(dist2_logp, inputs=[])

print(func1({}))
print(func2({}))

Output:

[-inf -inf -inf ... -inf -inf -inf]
[-1798.61635344 -1789.8294493  -1781.06404481 ... -4022.26639085
 -4035.43062232 -4048.61635344]

In the output above, the first line should not be -inf everywhere, as the grid we evaluate on includes values in the allowed range of values.

With @ricardoV94's help, we tracked this down to the way that TruncatedNormal.logp was enforcing the value bounds:
https://github.com/pymc-devs/pymc/blob/main/pymc/distributions/continuous.py#L779

I noticed this comment in the check_parameters() docstring: "Note that check_parameter should not be used to enforce the logic of the logp expression under the normal parameter support as it can be disabled by the user via check_bounds = False in pm.Model()" and indeed the above example works as expected with check_bounds=False.

This PR instead follows the implementation in other truncated distributions, for example, HalfStudentT to use a switch statement instead. I also added a regression test for the example case above.

See https://discourse.pymc.io/t/truncatednormal-logp-returning-all-inf/10398 for more context.

Checklist

Explain important implementation details 👆
Make sure that the pre-commit linting/style checks pass.
Link relevant issues (preferably in nice commit messages)
Are the changes covered by tests and docstrings?
Fill out the short summary sections 👇

Major / Breaking Changes

n/a

Bugfixes / New features

Fixed a bug in which TruncatedNormal would return -inf for all logp values if any input value was outside of the bounds.

Docs / Maintenance

n/a

pymc/distributions/continuous.py

pymc/tests/distributions/test_continuous.py

Co-authored-by: Ricardo Vieira <28983449+ricardoV94@users.noreply.github.com>

codecov · 2022-09-14T16:38:55Z

Codecov Report

Merging #6128 (8c9dbd2) into main (ec27b5c) will increase coverage by 1.14%.
The diff coverage is 100.00%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6128      +/-   ##
==========================================
+ Coverage   90.90%   92.05%   +1.14%     
==========================================
  Files          99      102       +3     
  Lines       20543    21299     +756     
==========================================
+ Hits        18675    19607     +932     
+ Misses       1868     1692     -176

Impacted Files	Coverage Δ
pymc/distributions/continuous.py	`97.50% <100.00%> (-0.01%)`	⬇️
pymc/tests/distributions/test_continuous.py	`99.76% <100.00%> (+<0.01%)`	⬆️
pymc/tests/distributions/test_shape_utils.py	`99.73% <0.00%> (-0.01%)`	⬇️
pymc/__init__.py	`100.00% <0.00%> (ø)`
pymc/exceptions.py	`100.00% <0.00%> (ø)`
pymc/distributions/bound.py	`100.00% <0.00%> (ø)`
pymc/distributions/__init__.py	`100.00% <0.00%> (ø)`
pymc/tests/distributions/test_bound.py	`100.00% <0.00%> (ø)`
pymc/tests/distributions/test_distribution.py	`97.83% <0.00%> (ø)`
pymc/distributions/truncated.py	`99.30% <0.00%> (ø)`
... and 19 more

ricardoV94 · 2022-09-14T16:54:03Z

pymc/distributions/continuous.py

@@ -777,11 +777,13 @@ def logp(
            norm = 0.0

        logp = _logprob(normal, (value,), None, None, None, mu, sigma) - norm
+        logp = at.switch(


Actually to be equivalent to what we had before we should do something like this:

pymc/pymc/distributions/truncated.py

Lines 316 to 320 in c53cd2f

if is_lower_bounded:

logp = at.switch(value < lower, -np.inf, logp)

if is_upper_bounded:

logp = at.switch(value <= upper, logp, -np.inf)

Isn't that equivalent to what is implemented here because of the default values for lower and upper?

pymc/pymc/distributions/continuous.py

Lines 714 to 715 in 91cbebd

lower = at.as_tensor_variable(floatX(lower)) if lower is not None else at.constant(-np.inf)

upper = at.as_tensor_variable(floatX(upper)) if upper is not None else at.constant(np.inf)

We retrieve the None case here:

pymc/pymc/distributions/continuous.py

Lines 763 to 764 in 91cbebd

unbounded_lower = isinstance(lower, TensorConstant) and np.all(lower.value == -np.inf)

unbounded_upper = isinstance(upper, TensorConstant) and np.all(upper.value == np.inf)

So in those cases we avoid introducing the useless switch. It's a small optimization but I don't see any reason yo modify it.

Oh I see. OK

The last commit makes the implementation here more analogous to the general truncated case you linked above - thanks for that pointer!

Thanks @ricardoV94 for taking a look and helping out already! Let me know if the implementation in the latest few commits looks ok. Also, it looks like this PR is waiting for approval to run the full test suite.

ricardoV94 · 2022-09-16T07:13:36Z

Pre-commit is complaining, otherwise looks good

adrn · 2022-09-16T13:09:46Z

Cool - I forgot to pre-commit install so just ran it manually and pushed up the changes. I think the workflows need to be approved to run again?

ricardoV94 · 2022-09-17T10:17:35Z

Thanks @adrn !

adrn added 2 commits September 14, 2022 11:24

use switch instead of relying on check_parameters

2316b90

add small regression test of evaluating logp for truncatednormal

a8c104f

ricardoV94 requested changes Sep 14, 2022

View reviewed changes

pymc/distributions/continuous.py Outdated Show resolved Hide resolved

pymc/tests/distributions/test_continuous.py Outdated Show resolved Hide resolved

adrn and others added 2 commits September 14, 2022 12:37

Update pymc/distributions/continuous.py

4545894

Co-authored-by: Ricardo Vieira <28983449+ricardoV94@users.noreply.github.com>

Update pymc/tests/distributions/test_continuous.py

0c5f742

Co-authored-by: Ricardo Vieira <28983449+ricardoV94@users.noreply.github.com>

move test by @ricardoV94's suggestion

d6621de

ricardoV94 requested changes Sep 14, 2022

View reviewed changes

only use switch logic if needed, as in general truncated distributions

d0a5626

adrn added 2 commits September 16, 2022 09:07

use isinf, isneginf

f9b90ba

apply pre-commit

8c9dbd2

adrn force-pushed the truncatednormal-bug branch from 05c5981 to 8c9dbd2 Compare September 16, 2022 13:08

ricardoV94 approved these changes Sep 16, 2022

View reviewed changes

ricardoV94 merged commit 5236d3e into pymc-devs:main Sep 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bug in which TruncatedNormal returns -inf for all values if any value is out of bounds #6128

Fix bug in which TruncatedNormal returns -inf for all values if any value is out of bounds #6128

adrn commented Sep 14, 2022 •

edited

Loading

codecov bot commented Sep 14, 2022 •

edited

Loading

ricardoV94 Sep 14, 2022

adrn Sep 14, 2022 •

edited

Loading

ricardoV94 Sep 14, 2022

adrn Sep 14, 2022

adrn Sep 14, 2022

adrn Sep 15, 2022

ricardoV94 commented Sep 16, 2022

adrn commented Sep 16, 2022

ricardoV94 commented Sep 17, 2022

	if is_lower_bounded:
	logp = at.switch(value < lower, -np.inf, logp)

	if is_upper_bounded:
	logp = at.switch(value <= upper, logp, -np.inf)

	lower = at.as_tensor_variable(floatX(lower)) if lower is not None else at.constant(-np.inf)
	upper = at.as_tensor_variable(floatX(upper)) if upper is not None else at.constant(np.inf)

	unbounded_lower = isinstance(lower, TensorConstant) and np.all(lower.value == -np.inf)
	unbounded_upper = isinstance(upper, TensorConstant) and np.all(upper.value == np.inf)

Fix bug in which TruncatedNormal returns -inf for all values if any value is out of bounds #6128

Fix bug in which TruncatedNormal returns -inf for all values if any value is out of bounds #6128

Conversation

adrn commented Sep 14, 2022 • edited Loading

Major / Breaking Changes

Bugfixes / New features

Docs / Maintenance

codecov bot commented Sep 14, 2022 • edited Loading

Codecov Report

ricardoV94 Sep 14, 2022

Choose a reason for hiding this comment

adrn Sep 14, 2022 • edited Loading

Choose a reason for hiding this comment

ricardoV94 Sep 14, 2022

Choose a reason for hiding this comment

adrn Sep 14, 2022

Choose a reason for hiding this comment

adrn Sep 14, 2022

Choose a reason for hiding this comment

adrn Sep 15, 2022

Choose a reason for hiding this comment

ricardoV94 commented Sep 16, 2022

adrn commented Sep 16, 2022

ricardoV94 commented Sep 17, 2022

adrn commented Sep 14, 2022 •

edited

Loading

codecov bot commented Sep 14, 2022 •

edited

Loading

adrn Sep 14, 2022 •

edited

Loading