[ModelSuite] Add model ops coverage validation test #178

PaliC · 2025-10-02T07:45:06Z

This PR adds another unit test to the model loading / config system in the last PR. Specifically, here we ensure that the ops specified in the config are run in the model itself. This is important as updates to torch could change how backwards passes could work. Furthermore, if we are expecting folks to write kernels for a set of ops and then run the model, we should guarentee those ops are used.

Future work with Model Suite

#181

Stack created with Sapling. Best reviewed with ReviewStack.

Here we introduce model suite (model.py). The idea here to start and codify the ideas from jiannanWang/BackendBenchExamples. Specifically this PR deals with model registration and loading. ### Model Registration This PR creates a way of adding models to the suite and automatically validates them through CI. It also loads the models as well. The way these models are added is detailed in this readme. The tl;dir is we use a format similar to kernelbench and SakanaAI/robust-kbench where we pair model code with a config. Importantly the configs contain initialization code, forward pass arguments (both in a similar format to torchbench), and a list of ops in the forward and backwards passes. These ops are fairly important as they are what we want to point out to the researcher when they are optimizing a model. There is a README.md to help folks setup proper model code / configs. We also further verify these registrations are correct through CI. Specifically we run test/test_model_ops_configs.py to ensure the configs are formatted correctly. ### Added models I added 2 models here as examples. SmokeTestModel - This is simple model that uses aten.ops.mm as we can implement a correct version of this op ToyCoreOpsModel - This is a model which explicitly calls the backwards passes which are both in torchbench + core. ### Small Things - Added a --model-filter to the CLI as it will be needed to support filtering in model suite as it chooses things to test based on the model not set of ops ### Testing New tests are added so pytest resolves things here ### Future work with Model Suite #181

This PR adds another unit test to the model loading / config system in the last PR. Specifically, here we ensure that the ops specified in the config are run in the model itself. This is important as updates to torch could change how backwards passes could work. Furthermore, if we are expecting folks to write kernels for a set of ops and then run the model, we should guarentee those ops are used. ### Future work with Model Suite #181

This was referenced Oct 2, 2025

[Model Suite] Add model correctness testing #179

Closed

[ModelSuite] Add model loading infrastructure #177

Closed

[ModelSuite] Refactor TorchBench for ModelSuite inheritance #180

Open

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 2, 2025

PaliC added 2 commits October 2, 2025 08:03

PaliC force-pushed the pr178 branch from 64e72d6 to 07fca91 Compare October 2, 2025 08:04

PaliC closed this Oct 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ModelSuite] Add model ops coverage validation test #178

[ModelSuite] Add model ops coverage validation test #178

Uh oh!

PaliC commented Oct 2, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[ModelSuite] Add model ops coverage validation test #178

[ModelSuite] Add model ops coverage validation test #178

Uh oh!

Conversation

PaliC commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Future work with Model Suite

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

PaliC commented Oct 2, 2025 •

edited

Loading