Speedup with torch compile by danbraunai-goodfire · Pull Request #322 · goodfire-ai/spd

danbraunai-goodfire · 2026-01-04T10:42:43Z

extremely messy test of whether we can speed up things by using torch.compile(). This requires getting rid of our hooks and monkeypatching the target model to insert the components. Not sure it'll work with identity components. Anyway, results are apparently:

  | Batch | Seq Len | Mode            | Eager   | Compiled | Speedup       |
  |-------|---------|-----------------|---------|----------|---------------|
  | 16    | 128     | reduce-overhead | 10.56ms | 9.64ms   | 1.09x (9.5%)  |
  | 32    | 256     | reduce-overhead | 12.61ms | 11.20ms  | 1.13x (12.6%) |
  | 64    | 256     | reduce-overhead | 20.70ms | 15.96ms  | 1.30x (30%)   |
  | 128   | 256     | reduce-overhead | 36.19ms | 28.34ms  | 1.28x (28%)   |
  | 64    | 256     | max-autotune    | 20.70ms | 15.38ms  | 1.35x (35%)   |

For a very stripped down version of our computation which does just masked forward and backward passes through the full ss_llama_simple_mlp (4L).
I think our batch size will be <64 in practice to avoid OOMs in our full training setup.
Probably going to leave this on the shelf. Not keen on the drastic core code change and skeptical that we could get >1.2x speedup for our workflows. Though might be worth picking up when we go to bigger models. (edited)

Description

Related Issue

Motivation and Context

How Has This Been Tested?

Does this PR introduce a breaking change?

Benchmark compile on 4L model

cd23cf8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speedup with torch compile#322

Speedup with torch compile#322
danbraunai-goodfire wants to merge 1 commit intomainfrom
play/compile

danbraunai-goodfire commented Jan 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

danbraunai-goodfire commented Jan 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Motivation and Context

How Has This Been Tested?

Does this PR introduce a breaking change?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

danbraunai-goodfire commented Jan 4, 2026 •

edited

Loading