Add `BenchmarkFamily` iterable #199

nicholasjng · 2025-01-15T15:06:05Z

Designed to consume the arguments of parametrize/product, and yield benchmarks lazily.

Params needs to support lazy iterables as well, to not fall into the eager materialization + memory blowup trap.

Testing should contain these scenarios at minimum:

Confirm that BenchmarkFamilys are being picked up by nnbench.collect(), similarly to existing collect checks.
Add a test that would fail with an eager iterable (e.g. list[Benchmark like it is now), asserting memory consumption stays low e.g. with memray.

Proposal for 2):

Add a benchmark matmul(a: np.ndarray, b: np.ndarray) with two arrays $a, b \in \mathbb{R}^{N \times N}$, parametrize over one of them (five np.random.randns should suffice), assert that memory consumption stays near 3 * N^2 * sizeof(np.float) (a, b, and the intermediary for the result).

Designed to consume the arguments of `parametrize`/`product`, and yield benchmarks lazily. Params needs to support lazy iterables as well, to not fall into the eager materialization + memory blowup trap.

nicholasjng · 2025-01-16T11:53:13Z

Sweet. Results are horrible:

================================================================================= MEMRAY REPORT ==================================================================================
Allocation results for tests/integration/test_benchmark_family_memory_consumption.py::test_foobar at the high watermark

         📦 Total memory allocated: 145.0MiB
         📏 Total allocations: 96
         📊 Histogram of allocation sizes: | █▄  |
         🥇 Biggest allocating functions:
                - matmul:/Users/nicholasjunge/Workspaces/python/nnbench/tests/integration/test_benchmark_family_memory_consumption.py:24 -> 68.1MiB
                - <genexpr>:/Users/nicholasjunge/Workspaces/python/nnbench/tests/integration/test_benchmark_family_memory_consumption.py:22 -> 64.0MiB
                - _call_with_frames_removed:<frozen importlib._bootstrap>:241 -> 267.5KiB

At an allowance of 25MiB, this puts us at 6x the theoretically necessary memory consumption.

This is an easy way to assert that stale parameters are dealloc'ed once benchmarks are over.

This means lazy evaluation of inputs and construction of benchmarks. In the case of parametrization, we get optimal memory usage. For products, we load all the iterators into memory, since the cartesian product of iterables cannot be evaluated without eager consumption of all inputs.

Since we don't return lists anymore, we cannot check length or use __getitem__ calls. Also, name checks are suspended for the time being, since the user should specify their own names without us interfering / berating them for it.

nicholasjng · 2025-01-20T14:24:08Z

Merging, docs updates to follow in a separate PR.

Add BenchmarkFamily iterable

ea73512

Designed to consume the arguments of `parametrize`/`product`, and yield benchmarks lazily. Params needs to support lazy iterables as well, to not fall into the eager materialization + memory blowup trap.

nicholasjng force-pushed the benchmark-iterators branch from 9036452 to c52fef7 Compare January 16, 2025 11:46

nicholasjng added 3 commits January 20, 2025 15:16

Add integration test for memory consumption on a matmul

bdda650

This is an easy way to assert that stale parameters are dealloc'ed once benchmarks are over.

nicholasjng force-pushed the benchmark-iterators branch from c52fef7 to 267129e Compare January 20, 2025 14:21

nicholasjng merged commit 0b6f9e6 into main Jan 20, 2025
14 checks passed

nicholasjng deleted the benchmark-iterators branch January 20, 2025 14:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `BenchmarkFamily` iterable #199

Add `BenchmarkFamily` iterable #199

nicholasjng commented Jan 15, 2025 •

edited

Loading

nicholasjng commented Jan 16, 2025

nicholasjng commented Jan 20, 2025

Add BenchmarkFamily iterable #199

Add BenchmarkFamily iterable #199

Conversation

nicholasjng commented Jan 15, 2025 • edited Loading

nicholasjng commented Jan 16, 2025

nicholasjng commented Jan 20, 2025

Add `BenchmarkFamily` iterable #199

Add `BenchmarkFamily` iterable #199

nicholasjng commented Jan 15, 2025 •

edited

Loading