Add subspace orthogonality analysis for factored processes #136

loren-ac · 2025-12-12T03:07:45Z

Summary

Adds functionality to compute orthogonality metrics between activation
subspaces when models are trained on factored processes (processes whose
belief state is a Cartesian product of subprocess belief states).

Key Features

Subspace orthogonality computation: Computes pairwise orthogonality
metrics between learned coefficient subspaces using QR decomposition and
SVD
Orthogonality metrics:
- Subspace overlap (0 = fully orthogonal, 1 = fully aligned / contained)
- Singular values of interaction matrix between orthonormal bases
- Spectral norm, participation ratio, entropy, effective rank
Efficient computation: Computes coefficients once, reuses for both
regression and orthogonality
Clean API: Separate coeffs (linear transformation) and intercept
(translation) in return structure

Implementation

Core functionality:

`linear_regression.py`:

Add compute_subspace_orthogonality parameter to
layer_linear_regression()
Implement _compute_all_pairwise_orthogonality() and _compute_subspace_orthogonality() for pairwise subspace metrics for multi-factor scenarios
Ensure orthogonality computed only from coefficients (intercepts
excluded)
Unify SVD functionality via use_svd flag

`layerwise_analysis.py`:

Update validators to accept new parameters: concat_belief_states, compute_subspace_orthogonality, use_svd
Ensure proper parameter forwarding from analysis framework to regression functions
Maintain backward compatibility with existing analyses

API improvements:

Remove to_factors flag and introduce concat_belief_states flag
Use concat_belief_states to determine whether to do regression on concatenated beliefs vs factored beliefs
Rename projections dictionary to arrays (more descriptive)
Store linear regression parameters in arrays as coeffs and intercept
Omit intercept key when fit_intercept=False

Testing:

Orthogonality computation tests (`test_linear_regression.py`):

Updates existing tests to work with parameter/API changes
9 comprehensive orthogonality tests covering:
- Orthogonal, aligned, and hierarchically-contained subspaces
- Multi-factor pairwise combinations
- Different subspace dimensions
- Integration with SVD regression
- Edge cases and default behavior
Principled numerical thresholds based on machine precision
Informative assertion messages for debugging

Parameter validation tests (`test_layerwise_analysis.py`):

Validates that the layerwise analysis framework correctly accepts and processes the new parameters ( concat_belief_states, compute_subspace_orthogonality, use_svd)
Ensures validators properly handle defaults, type checking, and parameter forwarding
Tests updated to verify the new return structure (separate coeffs/intercept keys)

Note

Adds pairwise subspace-orthogonality metrics for factored beliefs, separates coeffs/intercept in outputs, and unifies linear regression/SVD via a use_svd flag with updated validators and tests.

Analysis (linear regression):
- Compute pairwise subspace-orthogonality metrics between factor coefficient subspaces (overlap, singular values, participation ratio, entropy, effective rank).
- Return arrays with separated coeffs and intercept (omit intercept when fit_intercept=False).
- Support multi-factor flows: per-factor regression by default; optional concat_belief_states to fit jointly and split back; reuse params for metrics.
- Unify solvers via use_svd (with optional rcond_values); expose best rcond appropriately (concat vs per-factor).
Layerwise orchestration/validation:
- Update validators to accept concat_belief_states, compute_subspace_orthogonality, use_svd, rcond_values; enforce constraints and defaults.
- Register linear_regression_svd via partial(layer_linear_regression, use_svd=True) and guard use_svd in validator.
API changes:
- Replace to_factors with concat_belief_states.
- Projections output now includes parameter arrays (projected, coeffs, intercept).
Tests:
- Extensive new/updated tests covering orthogonality scenarios, concat vs separate equivalence, SVD parity, validator behavior, and new return structure.

^{Written by Cursor Bugbot for commit 51283f7. This will update automatically on new commits. Configure here.}

Copilot

Pull request overview

This PR adds functionality to compute orthogonality metrics between activation subspaces when models are trained on factored processes, where belief states are Cartesian products of subprocess belief states.

Key Changes:

Implements subspace orthogonality computation using QR decomposition and SVD to measure overlap between learned coefficient subspaces
Refactors API: replaces to_factors with concat_belief_states, renames projections to arrays, separates coeffs and intercept in return values
Unifies SVD functionality through a use_svd flag for consistent access to both regression methods

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 8 comments.

File	Description
`tests/analysis/test_linear_regression.py`	Adds 9 comprehensive orthogonality tests covering orthogonal/aligned/contained subspaces, multi-factor scenarios, and edge cases; updates existing tests for new API structure
`tests/analysis/test_layerwise_analysis.py`	Adds validation tests for new parameters (`concat_belief_states`, `compute_subspace_orthogonality`, `use_svd`) and verifies proper default handling
`simplexity/analysis/linear_regression.py`	Implements core orthogonality computation functions, refactors regression to support both concat and separate factor processing, separates coefficients from intercepts in return values
`simplexity/analysis/layerwise_analysis.py`	Updates parameter validators to accept new parameters and ensure proper forwarding to regression functions

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

simplexity/analysis/linear_regression.py

tests/analysis/test_linear_regression.py

simplexity/analysis/linear_regression.py

tests/analysis/test_linear_regression.py

simplexity/analysis/layerwise_analysis.py

simplexity/analysis/linear_regression.py

cursor

This PR is being reviewed by Cursor Bugbot

Details

You are on the Bugbot Free tier. On this plan, Bugbot will review limited PRs each billing cycle.

To receive Bugbot reviews on all of your PRs, visit the Cursor dashboard to activate Pro and start your 14-day free trial.

simplexity/analysis/linear_regression.py

simplexity/analysis/layerwise_analysis.py

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 14 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-12T21:01:56Z

simplexity/analysis/linear_regression.py

+            "Degenerate subspace detected during orthogonality computation."
+            "All singular values are zero."
+            "Setting probability values to zero."


The warning message is missing proper spacing between sentences. Each sentence should be separated by a space for proper readability.

Suggested change

"Degenerate subspace detected during orthogonality computation."

"All singular values are zero."

"Setting probability values to zero."

"Degenerate subspace detected during orthogonality computation. "

"All singular values are zero. "

"Setting probability values to zero. "

simplexity/analysis/linear_regression.py

Copilot · 2025-12-12T21:01:56Z

simplexity/analysis/linear_regression.py

+        if compute_subspace_orthogonality:
+            SIMPLEXITY_LOGGER.warning(
+                "Subspace orthogonality requires multiple factors."
+                "Received single factor of type %s; skipping orthogonality metrics.",


The warning message is missing proper spacing between sentences. Each sentence should be separated by a space for proper readability.

Suggested change

"Received single factor of type %s; skipping orthogonality metrics.",

" Received single factor of type %s; skipping orthogonality metrics.",

Copilot · 2025-12-12T21:01:57Z

tests/analysis/test_linear_regression.py

+
+    assert scalars.keys() == scalars_non_tuple.keys()
+    assert arrays.keys() == arrays_non_tuple.keys()
+    for key in scalars.keys():
+        assert scalars[key] == pytest.approx(scalars_non_tuple[key])
+    for key in arrays.keys():
+        assert arrays[key] == pytest.approx(arrays_non_tuple[key])
+
+


Comparing dictionary keys and floating point values using direct equality and pytest.approx can be fragile. When comparing dictionaries, consider using chex.assert_trees_all_close for both scalars and arrays, which would provide a more robust comparison. Additionally, the trailing whitespace on line 336 should be removed.

Suggested change

assert scalars.keys() == scalars_non_tuple.keys()

assert arrays.keys() == arrays_non_tuple.keys()

for key in scalars.keys():

assert scalars[key] == pytest.approx(scalars_non_tuple[key])

for key in arrays.keys():

assert arrays[key] == pytest.approx(arrays_non_tuple[key])

chex.assert_trees_all_close(scalars, scalars_non_tuple)

chex.assert_trees_all_close(arrays, arrays_non_tuple)

tests/analysis/test_layerwise_analysis.py

Copilot · 2025-12-12T21:01:59Z

simplexity/analysis/linear_regression.py

+    # Compute the entropy
+    probs = singular_values**2 / probs_denominator
+    num_zeros = jnp.sum(probs == 0)
+    if num_zeros > 0:


The comparison num_zeros > 0 may not behave correctly when num_zeros is a JAX array. JAX arrays do not support truthiness in conditional statements. Use .item() to convert to a Python scalar, or use jnp.any() for array operations. Consider changing to if int(num_zeros) > 0: or if num_zeros.item() > 0:.

Suggested change

if num_zeros > 0:

if num_zeros.item() > 0:

Copilot · 2025-12-12T21:01:59Z

simplexity/analysis/linear_regression.py

+            scalars.update(orthogonality_scalars)
+            arrays.update(orthogonality_singular_values)
+        else:
+            SIMPLEXITY_LOGGER.warning("Subspace orthogonality cannot be computed for a single belief state")


This warning message is inconsistent with the one at lines 467-469. Line 424 says "cannot be computed for a single belief state" while lines 467-469 say "requires multiple factors". These should use consistent phrasing. Consider updating line 424 to match: "Subspace orthogonality requires multiple factors. Received single factor; skipping orthogonality metrics."

Suggested change

SIMPLEXITY_LOGGER.warning("Subspace orthogonality cannot be computed for a single belief state")

SIMPLEXITY_LOGGER.warning("Subspace orthogonality requires multiple factors. Received single factor; skipping orthogonality metrics.")

Copilot · 2025-12-12T21:01:59Z

simplexity/analysis/layerwise_analysis.py

+    allowed = {"fit_intercept", "concat_belief_states", "compute_subspace_orthogonality", "use_svd", "rcond_values"}
    unexpected = set(provided) - allowed
    if unexpected:
        raise ValueError(f"Unexpected linear_regression_svd kwargs: {sorted(unexpected)}")


The error message says "Unexpected linear_regression_svd kwargs" but this is now a base validator used by both linear_regression and linear_regression_svd. The error message should be more generic, such as "Unexpected linear regression kwargs" to accurately reflect that it's used by both validators.

Suggested change

raise ValueError(f"Unexpected linear_regression_svd kwargs: {sorted(unexpected)}")

raise ValueError(f"Unexpected linear regression kwargs: {sorted(unexpected)}")

Copilot

Pull request overview

Copilot reviewed 6 out of 6 changed files in this pull request and generated 8 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-12T23:07:04Z

simplexity/analysis/linear_regression.py

+    singular_values = {
+        "singular_values": singular_values,
+    }
+
+    return scalars, singular_values


The variable name singular_values is reused for the dictionary that will be returned, shadowing the array singular_values computed earlier at line 292. This reduces code clarity as readers need to track which singular_values is being referenced. Consider using a more descriptive name for the dictionary, such as singular_values_dict or arrays.

Suggested change

singular_values = {

"singular_values": singular_values,

}

return scalars, singular_values

singular_values_dict = {

"singular_values": singular_values,

}

return scalars, singular_values_dict

simplexity/analysis/linear_regression.py

Copilot · 2025-12-12T23:07:05Z

simplexity/analysis/linear_regression.py

+    probs = singular_values**2 / probs_denominator_safe
+
+    def execute_some_zeros_warning_branch(x):
+        # This correctly calls the log_some_zeros function


The comment "This correctly calls the log_some_zeros function" is redundant and adds no value. The code is self-explanatory. Consider removing this comment to improve code clarity.

Suggested change

# This correctly calls the log_some_zeros function

Copilot · 2025-12-12T23:07:05Z

simplexity/analysis/layerwise_analysis.py

+    kwargs = _base_validate_linear_regression_kwargs(kwargs)
+    kwargs.pop("rcond_values")
+    return kwargs
+
+
+def _validate_linear_regression_svd_kwargs(kwargs: Mapping[str, Any] | None) -> dict[str, Any]:
+    kwargs = _base_validate_linear_regression_kwargs(kwargs)
+    kwargs.pop("use_svd")
+    return kwargs


The validation functions modify the input by calling pop() on the dictionary returned from the base validator. This mutates the returned dictionary from _base_validate_linear_regression_kwargs. While this works, it creates an implicit dependency where the base validator must return a mutable dict. A cleaner approach would be to build a new dictionary without the unwanted key using dictionary comprehension or filtering, which would make the intent clearer and avoid mutation.

Suggested change

kwargs = _base_validate_linear_regression_kwargs(kwargs)

kwargs.pop("rcond_values")

return kwargs

def _validate_linear_regression_svd_kwargs(kwargs: Mapping[str, Any] | None) -> dict[str, Any]:

kwargs = _base_validate_linear_regression_kwargs(kwargs)

kwargs.pop("use_svd")

return kwargs

base_kwargs = _base_validate_linear_regression_kwargs(kwargs)

return {k: v for k, v in base_kwargs.items() if k != "rcond_values"}

def _validate_linear_regression_svd_kwargs(kwargs: Mapping[str, Any] | None) -> dict[str, Any]:

base_kwargs = _base_validate_linear_regression_kwargs(kwargs)

return {k: v for k, v in base_kwargs.items() if k != "use_svd"}

simplexity/analysis/layerwise_analysis.py

simplexity/analysis/linear_regression.py

cursor · 2025-12-13T00:09:50Z

simplexity/analysis/layerwise_analysis.py

+        fn=partial(layer_linear_regression, use_svd=True),
        requires_belief_states=True,
-        validator=_validate_linear_regression_svd_kwargs,
+        validator=_validate_linear_regression_kwargs,


Bug: Duplicate use_svd keyword argument causes TypeError

The linear_regression_svd registry entry uses partial(layer_linear_regression, use_svd=True) which binds use_svd=True. However, _validate_linear_regression_kwargs always includes use_svd in its return dictionary (defaulting to False). When LayerwiseAnalysis.analyze() calls self._analysis_fn(..., **self._analysis_kwargs), the use_svd argument is passed twice—once from partial and once from the unpacked kwargs—causing a TypeError: got multiple values for keyword argument 'use_svd'.

Additional Locations (1)

simplexity/analysis/layerwise_analysis.py#L59-L66

cursor · 2025-12-13T00:14:14Z

simplexity/analysis/layerwise_analysis.py

+        "concat_belief_states": concat_belief_states,
+        "compute_subspace_orthogonality": compute_subspace_orthogonality,
+        "use_svd": use_svd,
        "rcond_values": rcond_values,


Bug: SVD regression never used due to kwarg override

The validator _validate_linear_regression_kwargs always returns use_svd in its output (defaulting to False). When the linear_regression_svd registry entry uses partial(layer_linear_regression, use_svd=True), and analyze() calls it with **self._analysis_kwargs containing use_svd=False, the call-time kwarg overrides the partial's pre-bound value. This means linear_regression_svd will actually use non-SVD regression, breaking features like best_rcond output and SVD-based regularization.

Additional Locations (1)

simplexity/analysis/layerwise_analysis.py#L99-L101

cursor · 2025-12-13T00:41:55Z

simplexity/analysis/layerwise_analysis.py

    ),
    "linear_regression_svd": AnalysisRegistration(
-        fn=layer_linear_regression_svd,
+        fn=partial(layer_linear_regression, use_svd=True),


Bug: Duplicate keyword argument causes runtime error for SVD analysis

The linear_regression_svd registry entry uses partial(layer_linear_regression, use_svd=True) which binds use_svd=True as a keyword argument. However, the validator _validate_linear_regression_kwargs always sets resolved_kwargs["use_svd"] at line 54. When the analysis is called, both the partial and the resolved kwargs provide use_svd, causing a TypeError: got multiple values for keyword argument 'use_svd' at runtime.

Additional Locations (1)

simplexity/analysis/layerwise_analysis.py#L53-L54

…ise subspace orthogonality metrics

- Separate coeffs/intercept in return structure (omit intercept key when fit_intercept=False) - Rename to_factors → concat_belief_states for clarity - Add 9 orthogonality tests with principled numerical thresholds (safety_factor=10) - Test orthogonal, aligned, contained subspaces; multi-factor scenarios; edge cases - Update validators and existing tests for new parameter structure - Add informative assertion messages for debugging numerical precision

…iple belief states. Log a warning if only one belief state is present, preventing unnecessary calculations.

…ove redundant orthogonality compuations warning

…ank line after docstring in test_layerwise_analysis

…ogonality function

…ession.py to prevent crashes. Remove deprecated layer_linear_regression_svd function for cleaner code and encourage use of layer_linear_regression with use_svd=True.

…cation of layer_linear_regression with use_svd=True, removing the deprecated layer_linear_regression_svd function for improved clarity and consistency.

…pecifying return values and their meanings for improved clarity and documentation.

…True and exclude it from output. Enhance tests to validate behavior.

…ce_orthogonality

…ing bases to subspace analysis

loren-ac requested review from casperlchristensen, Copilot and ealt December 12, 2025 03:07

Copilot started reviewing on behalf of loren-ac December 12, 2025 03:08 View session

Copilot AI reviewed Dec 12, 2025

View reviewed changes

cursor bot reviewed Dec 12, 2025

View reviewed changes

simplexity/analysis/linear_regression.py Outdated Show resolved Hide resolved

simplexity/analysis/layerwise_analysis.py Outdated Show resolved Hide resolved

cursor bot reviewed Dec 12, 2025

View reviewed changes

simplexity/analysis/layerwise_analysis.py Outdated Show resolved Hide resolved

loren-ac requested a review from Copilot December 12, 2025 20:57

Copilot started reviewing on behalf of loren-ac December 12, 2025 20:58 View session

Copilot AI reviewed Dec 12, 2025

View reviewed changes

loren-ac requested a review from Copilot December 12, 2025 23:02

Copilot started reviewing on behalf of loren-ac December 12, 2025 23:03 View session

Copilot AI reviewed Dec 12, 2025

View reviewed changes

cursor bot reviewed Dec 13, 2025

View reviewed changes

ealt approved these changes Dec 13, 2025

View reviewed changes

cursor bot reviewed Dec 13, 2025

View reviewed changes

loren-ac and others added 13 commits December 15, 2025 18:32

Refactor regression code to incorporate optional computation of pairw…

60d1380

…ise subspace orthogonality metrics

Organize imports

d43935a

Fix lint issues

9e600a4

Fix slices

edba4fe

Simplify lr kwarg validation

70eb56e

Add return type

9cc9810

Add pylint ignore

d403bc7

Fix potential division by zero

1c55be0

Fix potential log(0) issue

c3d070c

Enhance subspace orthogonality computation by adding a check for mult…

0c9a37f

…iple belief states. Log a warning if only one belief state is present, preventing unnecessary calculations.

Fix docstring inconsistency

74c6760

Update docstring

d3b0235

loren-ac and others added 27 commits December 15, 2025 18:35

Add check requiring 2+ factors in _handle_factored_regression and rem…

5b6247d

…ove redundant orthogonality compuations warning

Add proper spacing to warning messages

43123af

Fix dictionary equivalence check in test_linear_regression and add bl…

729222d

…ank line after docstring in test_layerwise_analysis

Refactor subspace orthogonality computation for JIT compatibility

2e8829f

Fix conditional callback execution using jax.lax.cond

4136030

Fix linting and formatting issues

2be2032

Fix formatting issues

f77f2f5

Disable too-many-locals linting issue in test_linear_regression.py

7af2bc4

Change name of return dict from singular_values -> arrays for clarity

6ee64fa

Add docstring describing return values for _compute_all_pairwise_orth…

84006da

…ogonality function

Add docstring describing relevance of the do_nothing_branch function

556fede

Refactor key removal method in kwarg validator and fix docstring format

5b9801d

Temporarily disable pylint checks during AST traversal in linear_regr…

06c7692

…ession.py to prevent crashes. Remove deprecated layer_linear_regression_svd function for cleaner code and encourage use of layer_linear_regression with use_svd=True.

Refactor linear regression analysis registration to use partial appli…

5bcbe03

…cation of layer_linear_regression with use_svd=True, removing the deprecated layer_linear_regression_svd function for improved clarity and consistency.

Fix tests

ed69814

Add detailed docstring to _compute_subspace_orthogonality function, s…

46ce191

…pecifying return values and their meanings for improved clarity and documentation.

Add todo

049b6d6

Fix kwarg validation

c890e36

Fix tests

3a5a8e2

Add validator decorator for linear_regression_svd to enforce use_svd=…

0987697

…True and exclude it from output. Enhance tests to validate behavior.

Fix test

0f37809

Add get_robust_basis for robust orthonormal basis extraction

028e047

Pass pair of bases instead of coefficient matrices to _compute_subspa…

0532cd2

…ce_orthogonality

Compute full rank and orthonormal basis of coeff matrices before pass…

95060d1

…ing bases to subspace analysis

Fix formatting and docstring

b0ecb64

Update comment

7a02602

Fix issues due to API changes in activation and dataframe tests

69ff3e4

loren-ac force-pushed the loren/subspace_orthogonality_analysis branch from c37d006 to 69ff3e4 Compare December 16, 2025 02:50

Fix formatting issues

8e1efa4

loren-ac merged commit e6c2a6c into dev Dec 16, 2025
3 checks passed

	"Received single factor of type %s; skipping orthogonality metrics.",
	" Received single factor of type %s; skipping orthogonality metrics.",

	SIMPLEXITY_LOGGER.warning("Subspace orthogonality cannot be computed for a single belief state")
	SIMPLEXITY_LOGGER.warning("Subspace orthogonality requires multiple factors. Received single factor; skipping orthogonality metrics.")

	raise ValueError(f"Unexpected linear_regression_svd kwargs: {sorted(unexpected)}")
	raise ValueError(f"Unexpected linear regression kwargs: {sorted(unexpected)}")

Add subspace orthogonality analysis for factored processes #136

Add subspace orthogonality analysis for factored processes #136

Conversation

loren-ac commented Dec 12, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Features

Implementation

Core functionality:

linear_regression.py:

layerwise_analysis.py:

API improvements:

Testing:

Orthogonality computation tests (test_linear_regression.py):

Parameter validation tests (test_layerwise_analysis.py):

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

This PR is being reviewed by Cursor Bugbot

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

loren-ac commented Dec 12, 2025 •

edited by cursor bot

Loading

`linear_regression.py`:

`layerwise_analysis.py`:

Orthogonality computation tests (`test_linear_regression.py`):

Parameter validation tests (`test_layerwise_analysis.py`):

Bug: Duplicate `use_svd` keyword argument causes TypeError