Derive logprob of `<` and `>` operations #6662

shreyas3156 · 2023-04-09T06:18:54Z

This PR implements the logprob inference for binary comparison Ops > and <.

It creates a MeasurableComparison variable and evaluates the logprob based on the truth value of the condition.

Addresses #6633.

Checklist

Explain important implementation details 👆
Make sure that the pre-commit linting/style checks pass.
Link relevant issues (preferably in nice commit messages)
Are the changes covered by tests and docstrings?
Fill out the short summary sections 👇

New features

Logprob inference for GT and LT Ops.

📚 Documentation preview 📚: https://pymc--6662.org.readthedocs.build/en/6662/

codecov · 2023-04-09T06:31:44Z

Codecov Report

Merging #6662 (19ab8ad) into main (5d68bf3) will increase coverage by 0.01%.
The diff coverage is 98.24%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6662      +/-   ##
==========================================
+ Coverage   91.97%   91.99%   +0.01%     
==========================================
  Files          94       95       +1     
  Lines       15942    16000      +58     
==========================================
+ Hits        14663    14719      +56     
- Misses       1279     1281       +2

Impacted Files	Coverage Δ
pymc/logprob/binary.py	`98.00% <98.00%> (ø)`
pymc/logprob/__init__.py	`100.00% <100.00%> (ø)`
pymc/logprob/transforms.py	`95.67% <100.00%> (-0.21%)`	⬇️
pymc/logprob/utils.py	`100.00% <100.00%> (ø)`

... and 3 files with indirect coverage changes

ricardoV94

This is awesome!

I left some comments below, but I think it's nearly there.

pymc/logprob/binary.py

tests/logprob/test_binary.py

pymc/logprob/binary.py

shreyas3156 · 2023-04-11T22:49:04Z

@ricardoV94 I have made the suggested changes and I just wanted to confirm the approach before adding a test for the separate function logprob/utils.py. (As mentioned in a comment above)

tests/logprob/test_binary.py

ricardoV94

Looks good! Wanna tackle >= and <= next?

shreyas3156 · 2023-04-17T16:21:41Z

Looks good! Wanna tackle >= and <= next?

Definitely! Although I was wondering if >= and <= can be canonicalized to LT and GT respectively?

ricardoV94 · 2023-04-17T16:23:39Z

Looks good! Wanna tackle >= and <= next?

Definitely! Although I was wondering if >= and <= can be canonicalized to LT and GT respectively?

Yes if you allow either direction in const<dist and const>dist. In this PR only dist<const and dist>const works no?

shreyas3156 · 2023-04-17T18:23:27Z

Yes if you allow either direction in const<dist and const>dist. In this PR only dist<const and dist>const works no?

We can make this PR process expressions like const<dist by making the node rewriter check which input holds the base_rv. We can then assign the outputs to the rewritten node in a fixed order for all the comparison operations. (e.g. (base_rv, const))

But this may require adding a transform for <= and >= to GT and LT.

I think it may be better to simply add separate derivations of LE and GE like in this PR.

ricardoV94 · 2023-04-18T06:59:53Z

Yes if you allow either direction in const<dist and const>dist. In this PR only dist<const and dist>const works no?

We can make this PR process expressions like const<dist by making the node rewriter check which input holds the base_rv. We can then assign the outputs to the rewritten node in a fixed order for all the comparison operations. (e.g. (base_rv, const))

But this may require adding a transform for <= and >= to GT and LT.

I think it may be better to simply add separate derivations of LE and GE like in this PR.

You can see that for other MeasurableElemwise like Add we store a measurable_input_idx, so that we can have the right logp later on, regardless of the order in which the dist and consts came in. So maybe something like that would be simpler?

pymc/pymc/logprob/transforms.py

Line 348 in 1ed4475

measurable_input_idx: int

shreyas3156 · 2023-04-18T10:27:31Z

You can see that for other MeasurableElemwise like Add we store a measurable_input_idx, so that we can have the right logp later on, regardless of the order in which the dist and consts came in. So maybe something like that would be simpler?

Sure, I'll look into this approach.

Meanwhile, I had a question about the expression used in this PR to calculate logprob for discrete distributions. I think I should correct it to the following (please correct me if I've misunderstood)

For <: CDF - PMF (x = const) when True; 1 - CDF + PMF (x = const) when False
For >: 1-CDF when True; CDF when False

ricardoV94 · 2023-04-18T10:32:48Z

You can see that for other MeasurableElemwise like Add we store a measurable_input_idx, so that we can have the right logp later on, regardless of the order in which the dist and consts came in. So maybe something like that would be simpler?

Sure, I'll look into this approach.

Meanwhile, I had a question about the expression used in this PR to calculate logprob for discrete distributions. I think I should correct it to the following (please correct me if I've misunderstood)
1. For `<`: `CDF - PMF (x = const) when True; 1 - CDF + PMF (x = const) when False`

2. For `>`: `1-CDF when True; CDF when False`

For discrete variables the CDF gives us the probability <=, so yes. You can do
logp(dist < const, True) = logcdf(dist, const - 1).

ricardoV94

Flagging it so we don't merge with the wrong logp by accident

pymc/logprob/binary.py

ricardoV94

Looks great

shreyas3156 · 2023-04-20T00:34:01Z

You can see that for other MeasurableElemwise like Add we store a measurable_input_idx, so that we can have the right logp later on, regardless of the order in which the dist and consts came in. So maybe something like that would be simpler?

I went through transforms.py and could not understand why using the index is required. We already extract the measurable_input from the list of inputs first.

pymc/pymc/logprob/transforms.py

Lines 556 to 562 in 61be336

    
           measurable_inputs = [ 
        
               inp 
        
               for idx, inp in enumerate(node.inputs) 
        
               if inp.owner 
        
               and isinstance(inp.owner.op, MeasurableVariable) 
        
               and inp not in rv_map_feature.rv_values 
        
           ]

We then make a MeasurableTransform node from transform_inputs which always has measurable_input as its first element, which is also why the measurable_input_idx always seems to have the value 0.

pymc/pymc/logprob/transforms.py

Lines 585 to 586 in 61be336

    
           measurable_input_idx = 0 
        
           transform_inputs: Tuple[TensorVariable, ...] = (measurable_input,)

ricardoV94 · 2023-04-20T07:16:50Z

You're right, so maybe we don't need it at all? Just make it so the first input is always the measurable one?

ricardoV94 requested changes Apr 9, 2023

View reviewed changes

pymc/logprob/binary.py Show resolved Hide resolved

ricardoV94 requested changes Apr 9, 2023

View reviewed changes

ricardoV94 reviewed Apr 9, 2023

View reviewed changes

pymc/logprob/binary.py Outdated Show resolved Hide resolved

ricardoV94 changed the title ~~Logprob for binary comparison operations~~ Derive logprob of binary comparison operations Apr 9, 2023

ricardoV94 added enhancements logprob labels Apr 9, 2023

ricardoV94 changed the title ~~Derive logprob of binary comparison operations~~ Derive logprob of less and greater than comparisons Apr 9, 2023

ricardoV94 requested a review from larryshamalama April 9, 2023 08:05

shreyas3156 added 2 commits April 10, 2023 19:33

Implement logprob for binary ops

7540d9e

Tests for binary comparison ops lopgprob

6b40e55

shreyas3156 force-pushed the logprob-binary-operations-6633 branch from 04998ec to 6b40e55 Compare April 10, 2023 14:04

shreyas3156 added 4 commits April 10, 2023 19:36

Add test to github workflows

d99d66c

Use logprob and logcdf helpers

6aeb191

Combine lt and gt tests and add gt test for discrete

92bacb8

Refactor and check for potential measurability of const

cca969b

shreyas3156 added 2 commits April 14, 2023 23:32

Add test for when const is measurable

db7e0c0

Add expected mypy failure of logprob/binary.py

0900485

ricardoV94 reviewed Apr 17, 2023

View reviewed changes

tests/logprob/test_binary.py Show resolved Hide resolved

Add failed test for logp when const is measurable

597e44e

ricardoV94 approved these changes Apr 17, 2023

View reviewed changes

ricardoV94 requested changes Apr 18, 2023

View reviewed changes

shreyas3156 requested a review from ricardoV94 April 18, 2023 20:56

ricardoV94 reviewed Apr 19, 2023

View reviewed changes

pymc/logprob/binary.py Outdated Show resolved Hide resolved

Correction in logprob derivation of discrete distributions

19ab8ad

shreyas3156 force-pushed the logprob-binary-operations-6633 branch from a5c48c2 to 19ab8ad Compare April 19, 2023 08:07

ricardoV94 approved these changes Apr 19, 2023

View reviewed changes

ricardoV94 merged commit 9b712bf into pymc-devs:main Apr 19, 2023

shreyas3156 mentioned this pull request Apr 19, 2023

Derive logprob of >= and <= operations #6680

Merged

5 tasks

ricardoV94 changed the title ~~Derive logprob of less and greater than comparisons~~ Derive logprob of < and > than comparisons Apr 26, 2023

ricardoV94 changed the title ~~Derive logprob of < and > than comparisons~~ Derive logprob of < and > comparisons Apr 26, 2023

ricardoV94 changed the title ~~Derive logprob of < and > comparisons~~ Derive logprob of < and > operations Apr 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Derive logprob of `<` and `>` operations #6662

Derive logprob of `<` and `>` operations #6662

shreyas3156 commented Apr 9, 2023 •

edited by github-actions bot

Loading

codecov bot commented Apr 9, 2023 •

edited

Loading

ricardoV94 left a comment

shreyas3156 commented Apr 11, 2023

ricardoV94 left a comment

shreyas3156 commented Apr 17, 2023

ricardoV94 commented Apr 17, 2023

shreyas3156 commented Apr 17, 2023

ricardoV94 commented Apr 18, 2023 •

edited

Loading

shreyas3156 commented Apr 18, 2023

ricardoV94 commented Apr 18, 2023

ricardoV94 left a comment

ricardoV94 left a comment

shreyas3156 commented Apr 20, 2023

ricardoV94 commented Apr 20, 2023

Derive logprob of < and > operations #6662

Derive logprob of < and > operations #6662

Conversation

shreyas3156 commented Apr 9, 2023 • edited by github-actions bot Loading

New features

codecov bot commented Apr 9, 2023 • edited Loading

Codecov Report

ricardoV94 left a comment

Choose a reason for hiding this comment

shreyas3156 commented Apr 11, 2023

ricardoV94 left a comment

Choose a reason for hiding this comment

shreyas3156 commented Apr 17, 2023

ricardoV94 commented Apr 17, 2023

shreyas3156 commented Apr 17, 2023

ricardoV94 commented Apr 18, 2023 • edited Loading

shreyas3156 commented Apr 18, 2023

ricardoV94 commented Apr 18, 2023

ricardoV94 left a comment

Choose a reason for hiding this comment

ricardoV94 left a comment

Choose a reason for hiding this comment

shreyas3156 commented Apr 20, 2023

ricardoV94 commented Apr 20, 2023

Derive logprob of `<` and `>` operations #6662

Derive logprob of `<` and `>` operations #6662

shreyas3156 commented Apr 9, 2023 •

edited by github-actions bot

Loading

codecov bot commented Apr 9, 2023 •

edited

Loading

ricardoV94 commented Apr 18, 2023 •

edited

Loading