Feature: Thunks as an alternative to artifacts #120

nicholasjng · 2024-03-19T17:22:19Z

This branch contains Thunks as an alternative to Artifacts. Wiki -> https://en.wikipedia.org/wiki/Thunk.

This is basically the same as React's useMemo. Thunks are lazy-loadable similarly to artifacts, customizable as a class if state is required (with the generic nnbench.types.Thunk type), and usable as a drop-in replacement for a normal e.g. torch.Module value to be computed at benchmark runtime.

The strategy here is to bootstrap each value in the Thunk.__call__() method, relying on caching as much as possible to avoid recomputing values.

Another important addition is the BenchmarkRunner.typecheck flag, which can be turned off to skip typechecks and avoid shortcomings / bugs in our current typechecking logic.

Still to do:

Adding a cache for computed thunk memos.
Checking memory usage when models are loaded lazily instead of all at once.
Re-implement the NER example with thunks (almost done).
See if the UX/DX is better: clean code, runtimes, maintainability, ...

This is the better way of compressing parameters compared to directly in the benchmark runner, which steals responsibility of the transform that we just introduced. Refactors `nnbench.io.transform->nnbench.transforms`, the latter being its own submodule. This is useful to have when adding new builtin transforms, so that they do not have to go into a single file.

Adds two conditional branches to disable typechecks in the _check() method. This is nice to have when prototyping new features and inputs to benchmarks might not exactly be of the requested types.

Also stop the practice of binding partial parametrizations directly to benchmarks. This has the effect that we can manipulate benchmark function parameters if need be (for example by lazy-loading thunk parameters). Changes the interface construction slightly to inject the partial parametrization as defaults over the `inspect.Parameter` default values.

Slightly changes parameter construction and adds a dethunking step right before the benchmark loop. This means that the thunk values are accessed at the latest possible time, which is just before benchmark execution. Moves the context construction ahead of the empty collection check, so that we give back a constructed context even in the case of no found benchmarks. Adds two C++-style thunk helpers, `is_thunk` for deciding if a value is a thunk, and `is_thunk_type` to decide if a value type is a thunk type annotation. The whole thunk facility is designed to work both with the `nnbench.types.Thunk` type as well as with general anonymous functions.

In the current setup, properly typed memos and callables pass the type checker. Factors out the types into their own submodule, to be refactored later into their biggest constituents.

Showcases memo subclassing, parametrization, and trivializes the run() command again. As a downside, only per-class benchmarks and aggregates can be run in a single run, not side-by-side (that would require `params` injection).

Partial parametrizations are not bound eagerly to the benchmark functions anymore, which makes it simpler to inject memos and de-memoize variables just in time for execution. What is left is validation that a subsequent benchmark of models with intermittent garbage collection actually reaps each model after the benchmark is done.

nicholasjng added the refactor Makes existing code nicer and more useable. label Mar 19, 2024

nicholasjng self-assigned this Mar 19, 2024

nicholasjng added 7 commits March 21, 2024 11:27

Add typecheck flag to benchmark runner to disable typechecks

c298b39

Adds two conditional branches to disable typechecks in the _check() method. This is nice to have when prototyping new features and inputs to benchmarks might not exactly be of the requested types.

Change thunk -> memo, add type check bypass for memos

34cdbaa

In the current setup, properly typed memos and callables pass the type checker. Factors out the types into their own submodule, to be refactored later into their biggest constituents.

Migrate artifact benchmarking code to memo syntax

cb8fa26

Showcases memo subclassing, parametrization, and trivializes the run() command again. As a downside, only per-class benchmarks and aggregates can be run in a single run, not side-by-side (that would require `params` injection).

nicholasjng force-pushed the modelthunks branch from c789d72 to 98c2909 Compare March 21, 2024 15:03

nicholasjng merged commit 65fc45b into main Mar 21, 2024
5 checks passed

This was referenced Mar 21, 2024

Add checksum verification to artifacts and loaders #110

Closed

Redesign artifact concept #114

Closed

Set up best practice for easy record serialization + parametrization + reproducibility #105

Closed

Maciej818 mentioned this pull request Mar 26, 2024

Parameter representations instead of parameters in benchmark records #122

Closed

nicholasjng deleted the modelthunks branch November 21, 2024 18:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Thunks as an alternative to artifacts #120

Feature: Thunks as an alternative to artifacts #120

nicholasjng commented Mar 19, 2024

Feature: Thunks as an alternative to artifacts #120

Feature: Thunks as an alternative to artifacts #120

Conversation

nicholasjng commented Mar 19, 2024