Create generate with obs dist function #49

ealt · 2025-05-23T23:36:26Z

No description provided.

Copilot

Pull Request Overview

This PR introduces a new generate_with_obs_dist method to both HMM implementations and adds tests to verify its output shapes.

Implements generate_with_obs_dist in GeneralizedHiddenMarkovModel for batch-wise sequence generation with returned observation probabilities.
Adds shape-based tests for generate_with_obs_dist in both test_hidden_markov_model.py and test_generalized_hidden_markov_model.py.
Extends the existing HMM tests to ensure both state and observation outputs match expected dimensions.

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
tests/generative_processes/test_hidden_markov_model.py	Added `test_generate_with_obs_dist` and related shape assertions
tests/generative_processes/test_generalized_hidden_markov_model.py	Parametrized `test_generate_with_obs_dist` for both model fixtures
simplexity/generative_processes/generalized_hidden_markov_model.py	Introduced `generate_with_obs_dist` method with vmapped vectorization

Comments suppressed due to low confidence (1)

tests/generative_processes/test_hidden_markov_model.py:127

Consider adding an assertion to verify that intermediate_obs_probs sums to 1 along the vocabulary axis (e.g., jnp.allclose(intermediate_obs_probs.sum(-1), 1.0)), ensuring they form valid probability distributions.

assert intermediate_obs_probs.shape == (batch_size, sequence_len, z1r.vocab_size)

Copilot · 2025-05-23T23:41:00Z

simplexity/generative_processes/generalized_hidden_markov_model.py

+        """Generate a batch of sequences of observations from the generative process.
+
+        Inputs:
+            state: (batch_size, num_states)
+            key: (batch_size, 2)
+        Returns: tuple of (belief states, observations, observation probabilities) where:


[nitpick] The docstring describes batch input shapes, but this method is vmapped over a per-sample state and key. Clarify that inputs are single-example (no batch dim) and outputs are vectorized across the batch.

Suggested change

"""Generate a batch of sequences of observations from the generative process.

Inputs:

state: (batch_size, num_states)

key: (batch_size, 2)

Returns: tuple of (belief states, observations, observation probabilities) where:

"""Generate sequences of observations from the generative process.

Inputs (per-sample, no batch dimension):

state: (num_states,)

key: (2,)

Returns (vectorized across the batch):

hrbigelow

I did just a basic wall-clock profiling comparison between generate_with_obs_dist and my original implementation without using the simplexity API functions. I get a roughly 2.4x speed up. 18 seconds for my implementation, 45 seconds for this one.

I believe this is mostly due to the redundant calculations in your transition_states and observation_probability_distribution functions.

As we discussed, I'm okay if you want to keep things as a reference implementation for purposes of code clarity, but I will probably stick with my side implementation in that case.

https://github.com/Astera-org/simplex-research/blob/henry/mods/in-context-learning/tests/process_gen_profile.py

Create generate with obs dist function

5297c38

ealt requested a review from hrbigelow May 23, 2025 23:37

ealt marked this pull request as ready for review May 23, 2025 23:39

Copilot AI review requested due to automatic review settings May 23, 2025 23:39

Copilot AI reviewed May 23, 2025

View reviewed changes

hrbigelow reviewed May 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Create generate with obs dist function #49

Create generate with obs dist function #49

Uh oh!

ealt commented May 23, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI May 23, 2025

Uh oh!

hrbigelow left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-        """Generate a batch of sequences of observations from the generative process.
-        Inputs:
-            state: (batch_size, num_states)
-            key: (batch_size, 2)
-        Returns: tuple of (belief states, observations, observation probabilities) where:
+        """Generate sequences of observations from the generative process.
+        Inputs (per-sample, no batch dimension):
+            state: (num_states,)
+            key: (2,)
+        Returns (vectorized across the batch):

Create generate with obs dist function #49

Are you sure you want to change the base?

Create generate with obs dist function #49

Uh oh!

Conversation

ealt commented May 23, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI May 23, 2025

Choose a reason for hiding this comment

Uh oh!

hrbigelow left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants