Support user-defined benchmark suites. #109

ericsnowcurrently · 2021-10-04T19:44:05Z

(I realize this is a big PR. Sorry! The changes to .py files is a smaller change. If necessary I can split it up a little, but it might not be worth it. 🙂 Just let me know.)

Currently pyperformance is two things coupled together: a tool to run a Python benchmark suite and a curated suite of Python benchmarks. This PR splits those apart, with the existing suite used as the default. This allows users to run their own set of benchmarks, perhaps specific to their Python implementation or their PyPI library, e.g. https://github.com/ericsnowcurrently/pyston-macrobenchmarks/tree/pyperformance.

Key changes:

introduce a new filesystem structure for suites and individual benchmarks
add the --manifest CLI option to specify the custom suite to use
the default suite has been changed to the new format and moved to pyperformance/_benchmarks (only a data dir now)
sometimes run some benchmarks in separate venvs
do not fail all benchmarks if the dependencies for one cannot be installed

Most notably, this change should not affect benchmark results.

Specifying a Benchmark Suite

A benchmark suite is defined through a manifest file. For example, see the file for the default suite in pyperformance/_benchmarks/MANIFEST. A manifest file has a [benchmarks] section (and zero or more [group NAME] sections). See pyperformance/benchmarks/_manifest.py.

The [benchmarks] section is a TSV (tab-separated-values) table with 4 columns:

name - the name of the benchmark
version - (optional) the version of that benchmark to use
origin - (optional) the URI where the benchmark is found (e.g. PyPI, GitHub, local filesystem)
metafile - the location of the metadata file for the benchmark (or <local> to find "bm_" relative to the manifest file)

The New Structure of a Benchmark

Each benchmark is in its own directory and looks like this:

<benchmark>/
    pyproject.toml
    bm_<name>.toml    # alternate benchmarks based on this one
    requirements.txt  # if has external dependencies; pins versions
    run_benchmark.py  # essentially the same benchmark script as before

pyproject.toml is the normal PEP 621 format for Python projects. These are the important fields in the [project] section:

"name" - (may be inferred from the suite manifest)
"version" - (may be inferred from the suite manifest or base metadata)
"dependencies" - (if has external dependencies; the top-level list of dependencies)

We add a [tool.pyperformance] section, as supported by PEP 518. These are the important fields:

"name" - (inferred from the [project] section)
"metabase" - if any, the pyproject.toml to inherit from (".." expands to "../base.toml"; possibly inferred)
"extra_opts" - if any, additional CLI args to use when running the benchmark script
"runscript" - defaults to "./run_benchmark.py"

See pyperformance/benchmark/_metadata.py.

ericsnowcurrently · 2021-11-17T16:13:53Z

@pablogsal, if possible could you give this a quick look. I don't necessarily need a full review of the code. More than anything I want to be sure the overall approach is acceptable. I'd be glad to hop into a call if that would help. Thanks!

ericsnowcurrently · 2021-12-08T01:36:01Z

FYI, after a quick chat with @pablogsal, I plan on merging this tomorrow.

ericsnowcurrently added 30 commits October 4, 2021 11:17

benchmarks -> _benchmarks

509d93c

Use a new API for pyperformance.benchmarks.

e16141d

Deal with benchmark objects instead of names.

d2101b2

Refactor select_benchmarks().

20b2c23

Add and use the default manifest file.

20e8492

Move run_perf_script() back.

4f74cb8

Clean up benchmark/__init__.py.

3917d22

Make the utils a package.

32e15ec

Move each of the default benchmarks into its own directory.

b041e2e

Make BenchmarkSpec.metafile as "secondary" attribute.

d79eba5

Fix benchmark selection.

f30421d

Fix the default benchmarks selection.

043836b

Fix the run script filename.

6ba9603

Run benchmarks from the metadata instead of hard-coded.

0f15e8e

Fix the requirements.

91c27f9

Pass "name" through to parse_pyproject_toml().

a4f97ad

Leave a note about classifiers.

6d385bf

Drop an unused file.

5308b29

Load manifest and select benchmarks before running the command.

4c8a18f

Fix Benchmark.__repr__().

0f63a54

Fix a default arg in load_metadata().

c7a92f4

Fix the packaging data.

c45ece7

Support per-benchmark venvs in VirtualEnvironment.

e940e24

Ignore pyproject.toml name only if provided.

0420e96

Add requirements lock files to the benchmarks.

6bce386

Make a venv for each benchmark instead of sharing one.

29cf228

Support "libsdir" in metadata.

6cadadc

Fix an error message.

3fb5187

Use the default resolve() if the default manifest is explicit.

abf56e9

Merge in the version properly.

4bf223a

ericsnowcurrently added 13 commits November 16, 2021 10:45

Add the resolve_file() util.

6831508

Resolve the manifest file in includes.

bb88341

Default BenchmarkRevision._dryrun to False.

1e51f8e

Set the default for --benchmarks manually.

ef231fe

Allow the "venv" command to not install benchmark requirements.

648bd18

Use <NONE> as a marker for "no benchmarks".

7bb8d94

Require --benchmarks (or default) for some commands.

15fd559

Do not always install the first benchmark venv.

f041503

Separate creating venv from installing requirements.

452b541

Only install per-benchmark requirements when running them.

b049a4a

Print the benchmark number.

a083cd5

Factor out Python.resolve_program().

7b18753

Do not pass --benchmarks when creating venv for "compile".

c8a7789

ericsnowcurrently added 6 commits November 17, 2021 09:22

Skip a benchmark if its requirements could not be installed.

6f6df4d

Set Python.program to None if the resolved path does not exist.

b379536

Factor out resolve_python().

686a96d

Add a blank line.

664a909

Do not print a traceback for skipped benchmarks.

9899813

Fix a typo.

da1b6c3

This was referenced Nov 17, 2021

Use pyperformance to run the benchmarks. pyston/python-macrobenchmarks#2

Open

Use pyperformance to run the benchmarks. pyston/python-macrobenchmarks#3

Merged

Merge main.

09ffb6b

ericsnowcurrently removed the request for review from vstinner December 8, 2021 01:35

ericsnowcurrently merged commit 11ee898 into python:main Dec 8, 2021

ericsnowcurrently deleted the benchmark-management branch December 8, 2021 18:37

gpshead mentioned this pull request Jan 2, 2022

--inside-venv error in installed package #120

Closed

knyghty mentioned this pull request Jan 30, 2022

pyperformance benchmarks django/djangobench#40

Open

ericsnowcurrently mentioned this pull request Apr 19, 2022

Add a way to plug in custom benchmarks. #89

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Support user-defined benchmark suites. #109

Support user-defined benchmark suites. #109

Uh oh!

ericsnowcurrently commented Oct 4, 2021 •

edited

Loading

Uh oh!

ericsnowcurrently commented Nov 17, 2021

Uh oh!

ericsnowcurrently commented Dec 8, 2021

Uh oh!

Uh oh!

Uh oh!

Support user-defined benchmark suites. #109

Support user-defined benchmark suites. #109

Uh oh!

Conversation

ericsnowcurrently commented Oct 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Specifying a Benchmark Suite

The New Structure of a Benchmark

Uh oh!

ericsnowcurrently commented Nov 17, 2021

Uh oh!

ericsnowcurrently commented Dec 8, 2021

Uh oh!

Uh oh!

ericsnowcurrently commented Oct 4, 2021 •

edited

Loading