Support scan markov with history > 1 #848

fehiepsi · 2020-12-19T15:23:39Z

Resolves #702.

Tasks:

Revise log_density implementation to incorporate this feature.
Add hmm example with history > 1
Add test to verify the consistency of density, output carry, ys with unrolled scan

fehiepsi · 2020-12-20T02:03:25Z

@eb8680 Could you help me review this PR? Most of the changes are generalized from history=1 -> history>1. There is a complicated issue to return the correct shape for the last carry: assume history=2, then we run step 0, step 1 outside of lax.scan, then use lax.scan for steps 2 -> (T - 1). Under enumeration, we have 3 different shapes for carry, but we can only interpret those shapes using the input carry and output carry at step 2. So one of them is missing.

We can run step 0, step 1, step 2, then using lax.scan for steps 3 -> (T - 1). This way we can interpret all carry shapes (using infos at steps 2 and 3) but need to unroll history * 2 steps (in general). So unless it is really needed, I would like to unroll the number of steps as small as possible. :) Anyway, we already added a note previously that: there should not have any site outside of scan depend on the first output of scan (the last carry value)

eb8680 · 2020-12-20T15:46:16Z

numpyro/contrib/control_flow/scan.py

-            return wrapped_carry, (PytreeTrace({}), ys)
-        wrapped_carry, (pytree_trace, ys) = lax.scan(body_fn, wrapped_carry, xs_, length - 1, reverse)
+        y0s = []
+        for i in range(history):


For clarification, after changing this to for i in range(2 * history) what other changes would be necessary to get the correct final carry_shape?

eb8680 · 2020-12-20T15:54:20Z

numpyro/contrib/control_flow/scan.py

+    # XXX: unless we unroll more steps, we only know the correct shapes starting from
+    # the `history`-th iteration; that means we only know carry shapes when
+    # i == history - 1 or i == history, which corresponds to input carry or output carry
+    # at the `history`-th iteration.


@fehiepsi I think I understand this issue, thanks for explaining. How much compilation overhead does unrolling 2 * history steps add in the HMM examples? Unrolling history steps in exchange for incorrect output carry shapes is a very subtle optimization that only produces constant-time savings in program length, and may cause maintenance difficulties down the line, so it's important to be sure that it's worth it.

Thinking about it again, I guess it won't take much compiling time because history is usually small (1 or 2). Let me address the issue in this PR.

eb8680

LGTM after latest changes.

fehiepsi · 2020-12-21T06:32:02Z

Thanks for reviewing, @eb8680! Looking like there is a bug when I unrolled more steps. Looking into it...

fehiepsi · 2020-12-21T23:43:51Z

numpyro/contrib/control_flow/scan.py

    #   + msg: fn.batch_shape = (3,), value.shape = (2, 3) + fn.event_shape
    #     process_message(msg): promote fn so that fn.batch_shape = (1, 3).
-    def process_message(self, msg):
+    def postprocess_message(self, msg):


we only want to do this right before getting the trace.

fehiepsi · 2020-12-21T23:53:36Z

numpyro/contrib/control_flow/scan.py

                trace = {}
        else:
-            with handlers.block(), packed_trace() as trace, promote_shapes(), enum(), markov():


The usage of enum(), markov() here still gives correct log density but does not recycle the dimensions at this step. To resolve this, I have moved block right before the generator markov(unroll_steps + 1, history) and remove markov handler here.

fehiepsi · 2020-12-22T00:06:20Z

numpyro/contrib/control_flow/scan.py

-    # Note that `funsor.sum_product.sarkka_bilmes_product` does support history > 1.
+    # amount number of steps to unroll
+    history = min(history, length)
+    unroll_steps = min(2 * history - 1, length)


This is the minimum number of unroll_steps to get a correct last carry shape. But it is safe to use any number greater than this number.

fehiepsi · 2020-12-22T02:31:53Z

@eb8680 Though the previous implementation gives correct log density, the carry shapes are not recycled with

with markov(history=history):
    for i in range(unroll_steps):
        transition_fn()
    lax.scan(with markov(): transition_fn())

logic. I have revised it to be

for i in markov(range(unroll_steps + 1)):
    if i < unroll_steps:
        transition_fn()
    else:
        lax.scan(transition_fn())

Previously, this issue is not detected due to a typo in the test, where I computed expected and actual values on the same model:

    expected_x_prev, expected_x_curr = enum(config_enumerate(model))()
    actual_x_prev, actual_x_curr = enum(config_enumerate(model))()  # typo: should be fun_model

eb8680

LGTM. Now that sarkka_bilmes_product is actually being used, I guess we should probably try to move away from the P prefix naming scheme upstream in sarkka_bilmes_product to a safer interface so no non-Markov names from user code with P in them get incorrectly mangled. Maybe the simplest thing to do would be to make the prefixes longer (e.g. _PREV_ instead of P).

fehiepsi · 2020-12-22T03:59:15Z

no non-Markov names from user code with P in them get incorrectly mangled

Good point! _PREV_ should work well here.

fehiepsi added 5 commits December 19, 2020 01:52

initialize the implementation

bfb1bf6

add todo

51aaf62

use sarkka_bilmes_product when history > 1

2474ac8

add hmm and test

9a44196

revise docs

ab6a591

fehiepsi requested a review from eb8680 December 20, 2020 01:39

fehiepsi added the awaiting review label Dec 20, 2020

mark xfail for test_examples

60c9312

eb8680 reviewed Dec 20, 2020

View reviewed changes

eb8680 mentioned this pull request Dec 20, 2020

Support partial windows in sarkka_bilmes_product pyro-ppl/funsor#409

Merged

fehiepsi added 2 commits December 20, 2020 16:50

unroll more steps

5becd4f

fix docs

a1e55b7

eb8680 requested a review from neerajprad December 21, 2020 05:57

eb8680 previously approved these changes Dec 21, 2020

View reviewed changes

fehiepsi added WIP and removed awaiting review labels Dec 21, 2020

address bug of carry shape not recycled

a17571a

fehiepsi dismissed eb8680’s stale review via a17571a December 21, 2020 23:42

fehiepsi commented Dec 21, 2020

View reviewed changes

fehiepsi added 2 commits December 21, 2020 18:01

add explanation for why we need to delay promoting value shapes

629335d

no matter how many unroll steps we need, the result is still correct

74fe244

fehiepsi commented Dec 22, 2020

View reviewed changes

fehiepsi added awaiting review and removed WIP labels Dec 22, 2020

eb8680 approved these changes Dec 22, 2020

View reviewed changes

fehiepsi merged commit 4ac5d6f into pyro-ppl:master Dec 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support scan markov with history > 1 #848

Support scan markov with history > 1 #848

fehiepsi commented Dec 19, 2020 •

edited

Loading

fehiepsi commented Dec 20, 2020

eb8680 Dec 20, 2020

eb8680 Dec 20, 2020

fehiepsi Dec 20, 2020

eb8680 left a comment

fehiepsi commented Dec 21, 2020

fehiepsi Dec 21, 2020

fehiepsi Dec 21, 2020

fehiepsi Dec 22, 2020

fehiepsi commented Dec 22, 2020

eb8680 left a comment

fehiepsi commented Dec 22, 2020

Support scan markov with history > 1 #848

Support scan markov with history > 1 #848

Conversation

fehiepsi commented Dec 19, 2020 • edited Loading

fehiepsi commented Dec 20, 2020

eb8680 Dec 20, 2020

Choose a reason for hiding this comment

eb8680 Dec 20, 2020

Choose a reason for hiding this comment

fehiepsi Dec 20, 2020

Choose a reason for hiding this comment

eb8680 left a comment

Choose a reason for hiding this comment

fehiepsi commented Dec 21, 2020

fehiepsi Dec 21, 2020

Choose a reason for hiding this comment

fehiepsi Dec 21, 2020

Choose a reason for hiding this comment

fehiepsi Dec 22, 2020

Choose a reason for hiding this comment

fehiepsi commented Dec 22, 2020

eb8680 left a comment

Choose a reason for hiding this comment

fehiepsi commented Dec 22, 2020

fehiepsi commented Dec 19, 2020 •

edited

Loading