[TEST PR] Script for CI cosine similarity comparisons by leesharkey · Pull Request #277 · goodfire-ai/spd

leesharkey · 2025-11-25T06:07:36Z

Description

Enhances the model comparison script with CI-based cosine similarity metrics for
more meaningful analysis of learned component alignment between SPD model runs.

Key Changes:

Metric Improvements:

Replaced activation density with mean causal importance (CI) as the component
filtering criterion
Added CI cosine similarity metrics to measure component alignment between models
Renamed density_threshold → mean_ci_threshold with proper validation (0.0-1.0
range)

Code Quality:

Refactored compute_activation_densities() → compute_ci_statistics() with better
error handling
Added comprehensive shape mismatch checking with detailed warnings
Improved batch handling with StopIteration safeguards
Enhanced logging with component-level statistics

Configuration Updates:

Updated compare_models_config.yaml with new semantic parameter names
Adjusted default threshold value for CI-based filtering
Updated example model paths and batch size

Related Issue

N/A - Enhancement to post-hoc analysis tooling

Motivation and Context

The CI cosine similarity metrics provide better insight into how learned components
align between different model runs. Mean CI is a more meaningful measure of
component importance than activation density, as it directly quantifies each
component's causal contribution to model outputs.

This complements the existing geometric similarity metrics (which compare component
subspace geometry) with a functional similarity metric (which compares component
usage patterns on actual data).

How Has This Been Tested?

✅ All formatting checks pass (make check)
✅ All type checks pass (basedpyright, 0 errors)
✅ All unit tests pass (200 passed, 11 skipped)
✅ Code reviewed against CLAUDE_CHECKLIST.md standards
✅ Removed obvious comment per style guide

Does this PR introduce a breaking change?

Minor breaking change in compare_models.py:

Config parameter renamed: density_threshold → mean_ci_threshold
Users with existing compare_models_config.yaml files will need to update this
parameter name
Impact is minimal: The script is for post-hoc analysis only, not part of the core
SPD training pipeline

…tested

… than eval

Added two documentation files to help AI assistants work effectively with the SPD codebase: - CLAUDE_COMPREHENSIVE.md: Complete reference guide covering development philosophy, coding standards, architecture patterns, workflows, and collaboration practices - CLAUDE_CHECKLIST.md: Pre-submission checklist for verifying code changes meet SPD standards before committing These documents ensure consistent code quality and help future AI assistants understand project conventions, reducing onboarding time and maintaining codebase consistency. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Added two checklist items to prevent future AI assistants from forgetting important steps: - "Checked existing patterns" item to ensure new files follow existing conventions - "Restarted checklist after any changes" with explicit STOP instruction to prevent incomplete verification Also fixed references from "dev branch" to "main branch" throughout both documentation files, as the repository uses main as the primary development branch. These changes address feedback from PR review process where these steps were accidentally omitted. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Per CLAUDE_CHECKLIST.md, removed redundant comment that was obvious from the code itself. The line `alive_mask = mean_component_cis[layer_name] > self.mean_ci_threshold` is self-explanatory. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

leesharkey and others added 30 commits September 16, 2025 18:07

Geometric similarity comparison made consistent with other evals and …

b93b9d6

…tested

Replaced mean max cosine sim with mean max ABS cosine sim

cd5fda2

Configs for geom comparison runs

61d3408

Merge remote-tracking branch 'origin/main' into feature/geom_sim_compar

63c85f0

Minor modifications to make PR-ready

770a5c5

Merge remote-tracking branch 'origin/main' into feature/geom_sim_compar

49ba925

Update seed to be consistent with other configs again

364198e

Cleaned up some comments and other bits

57c2c76

Major update of PR following review: Now implemented as script rather…

2e7752d

… than eval

Merge remote-tracking branch 'origin/main' into feature/geom_sim_compar

4fbf807

Updated registry to delete old obselete experiments

98a6620

Merge branch 'main' into feature/geom_sim_compar

bede346

Merge branch 'main' into feature/geom_sim_compar

acc04f1

Reorganized compare_models into subdirectory and cleaned up config code

62bd77e

Merging

b84814a

Updated README.md

5173a6a

Added some example models to the config

181cac8

Getting rid of newline

8db7559

Minor changes to make the PR mergeable

0d05f0a

Merge branch 'main' of https://github.com/goodfire-ai/spd

8767194

Merge branch 'main' of https://github.com/goodfire-ai/spd

019eb2d

Merge branch 'main' of https://github.com/goodfire-ai/spd

b935b4c

Merge branch 'main' of github.com:goodfire-ai/spd

3d1edeb

Merge branch 'main' of github.com:goodfire-ai/spd

1dd738d

Merge branch 'main' of github.com:goodfire-ai/spd

956f3d4

Merge branch 'main' of github.com:goodfire-ai/spd

f7ad411

Merge branch 'main' of github.com:goodfire-ai/spd

ade1377

Merge branch 'main' of github.com:goodfire-ai/spd

08875a9

Merge branch 'main' of github.com:goodfire-ai/spd

7ca7037

Merge branch 'main' of github.com:goodfire-ai/spd

cbbdb61

leesharkey and others added 14 commits October 28, 2025 14:00

Merge branch 'main' of github.com:goodfire-ai/spd

267deb6

Merge branch 'main' of github.com:goodfire-ai/spd

f49e9e0

Merge branch 'main' of github.com:goodfire-ai/spd

22f7cfc

Merge branch 'main' of github.com:goodfire-ai/spd

ab5346d

Merge branch 'main' of github.com:goodfire-ai/spd

7cb528f

wip: Add gradient clipping support to SPD optimizer

bf4048c

Merge branch 'main' into feature/gradient-clipping

e83ba79

Merge branch 'main' of github.com:goodfire-ai/spd

01d1b6b

Update compare models script and config

ae3a635

Merge branch 'main' of github.com:goodfire-ai/spd

a78fdc5

Merge feature/claude_mds into feature/ci-cosine-sim copy branch

a4821b2

leesharkey requested a review from Laplace418 November 25, 2025 06:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TEST PR] Script for CI cosine similarity comparisons#277

[TEST PR] Script for CI cosine similarity comparisons#277
leesharkey wants to merge 44 commits intomainfrom
feature/ci-cosine-sim-merge-claude-mds

leesharkey commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leesharkey commented Nov 25, 2025

Description

Key Changes:

Metric Improvements:

Code Quality:

Configuration Updates:

Related Issue

Motivation and Context

How Has This Been Tested?

Does this PR introduce a breaking change?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants