Add performance benchmarks #748

duckontheweb · 2022-02-10T18:31:20Z

Related Issue(s):

Closes Establish performance testing framework and benchmarks #729

Description:

Uses the airspeed velocity (asv) library to run performance benchmarks. This configuration generally follows the strategy used by xarray (as documented here), with some notable exceptions:

Increase the number of rounds to 4 (from default of 2) and the number of repetitions to between 10 & 50 (from default of 5). This seemed to eliminate a lot of the false positives with regard to performance changes. These values are set in the Bench base class and can be changed for a particular benchmark.
Use the --interleave-rounds option (also seems to help with eliminating false positives)
Run benchmarks on all supported Python versions (we may want to change this later as the benchmark suite grows and takes longer to run).

This is a work-in-progress, but I want to get feedback on the approach before fleshing out the rest.

Still to do:

Add timing benchmarks for extension implementations (maybe not all, but at least a few)
Add timing and peak memory benchmarks for reading and walking a catalog
Add timing benchmark for writing large catalogs to disk (e.g. on the order of thousands of items)
Come up with a strategy for measuring possible performance improvements from implementing async requests. We will probably see the most impact from this when making network requests rather than local file reads, but we don't want our benchmarks dependent upon changing network latency.
Add documentation

PR Checklist:

Code is formatted (run pre-commit run --all-files)
Tests pass (run scripts/test)
Documentation has been updated to reflect changes, if applicable
This PR maintains or improves overall codebase code coverage.
Changes are added to the CHANGELOG. See the docs for information about adding to the changelog.

duckontheweb · 2022-02-10T18:31:55Z

@TomAugspurger Would be great to get your feedback as well.

codecov-commenter · 2022-02-10T18:33:26Z

Codecov Report

Base: 94.43% // Head: 94.43% // No change to project coverage 👍

Coverage data is based on head (5b39551) compared to base (c45dd20).
Patch has no changes to coverable lines.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #748   +/-   ##
=======================================
  Coverage   94.43%   94.43%           
=======================================
  Files          83       83           
  Lines       12056    12056           
  Branches     1143     1143           
=======================================
  Hits        11385    11385           
  Misses        492      492           
  Partials      179      179

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

gadomski

LGTM. Did you have a running list of other scenarios you'd like to benchmark? One or two that spring to mind are:

Mutating the entire tree (e.g. setting the root href and updating all of the links/self hrefs)
Summary/extent computation

duckontheweb · 2022-02-15T13:58:30Z

LGTM. Did you have a running list of other scenarios you'd like to benchmark? One or two that spring to mind are:

Mutating the entire tree (e.g. setting the root href and updating all of the links/self hrefs)

Summary/extent computation

Thanks @gadomski, I have a bit of a list started in #729. I like both of these suggestions, I'll add them to the PR.

I'm of the opinion that we _shouldn't_ run benchmarks on Github runners, so I'm removing this workflow.

This lets simple command like `asv dev` work out of the box.

I'm not really sure how useful this is, but it was asked for so at least we have something.

This doesn't run benchmarks, but just checks to make sure they build.

gadomski · 2023-01-09T23:32:36Z

I picked this PR up and, in the interest of not letting perfect be the enemy of good, implemented most of @duckontheweb's suggestions:

I added benchmarks for reading, walking, and writing large catalogs, including peak memory usage for walking a large catalog
I added a single extension benchmark, for the projection extension. I'm not quite sure what we want to be benchmarking there, so I kept it minimal.
I refactored the location of things to make usage a bit simpler (e.g. I want asv dev to work out-of-the-box)
I removed CI benchmarking. To paraphrase @TomAugspurger from https://tomaugspurger.github.io/posts/performance-regressions, "This rules out running the benchmarks on ... CI ... even if we could finish it in time, we couldn’t really trust the results." It's my personal opinion that benchmarking on CI is so finicky and untrustworthy that its not worth the complexity. In liu of a dedicated benchmark machine, IMO it's better to run benchmarks locally when the situation calls for it. At least we have a framework for benchmarking now, which we could always expand in the future to a dedicated machine if its needed.
Docs.
Simple check in CI to make sure benchmarks run w/o failure -- doesn't do any reporting.

I didn't implement:

Any sort of async strategy. I think that's best tackled as part of the async improvements themselves (Add async I/O methods [WIP] #749) -- at least, I don't think they should hold up getting some sort of benchmarking into the repo.
Extensive extension benchmarking. I'm not sure where the pain points are right now, and I don't want broken benchmarks to get in the way of solutions to Handling STAC extension version upgrades #448.

cc @duckontheweb would love your review if you have the bandwidth, I can't request b/c it's your PR.

IMO this PR should be squash-merged since it's one coherent set of changes w/ some fixup commits -- I can't rebase b/c it's on @duckontheweb's fork.

pjhartzell

Looking good. This is a nice addition.

docs/contributing.rst

benchmarks/import_pystac.py

pjhartzell

Looks good.

duckontheweb added 7 commits February 8, 2022 14:23

Configure asv and add import benchmark

ca0a6fa

Add item (de)serialization benchmarks & use 10 reps

67633b2

Add catalog, collection benchmarks and tweak settings

ed65815

Add convenience script for running locally

3f5bb3d

Use default Python

78d97f5

Add benchmark workflow to CI

db7e92d

Match label condition to label name in repo

c187569

duckontheweb requested review from lossyrob and gadomski February 10, 2022 18:31

Fix lint errors

d6440b0

duckontheweb added the run-benchmarks label Feb 10, 2022

Add virtualenv to benchmark deps

01798fc

duckontheweb marked this pull request as draft February 10, 2022 18:53

Fix artifact name, increase failure threshold

8648b23

gadomski approved these changes Feb 14, 2022

View reviewed changes

duckontheweb mentioned this pull request Feb 14, 2022

Add async I/O methods [WIP] #749

Closed

9 tasks

gadomski self-assigned this Jan 9, 2023

gadomski added 9 commits January 9, 2023 15:19

Merge branch 'main' into add/729-performance-benchmarks

c0267bb

rm: benchmarks workflow

6a00081

I'm of the opinion that we _shouldn't_ run benchmarks on Github runners, so I'm removing this workflow.

refactor: use classes directly

6a35b1e

refactor: move benchmarks up a level

0f574d8

This lets simple command like `asv dev` work out of the box.

feat: add projection benchmarks

1afea50

I'm not really sure how useful this is, but it was asked for so at least we have something.

feat: add large catalog benchmarks

65155a4

fix: benchmark config

3767735

feat: add benchmark docs

06ec7b8

ci: add benchmark check

1ed23a8

This doesn't run benchmarks, but just checks to make sure they build.

gadomski marked this pull request as ready for review January 9, 2023 23:32

gadomski requested review from pjhartzell and removed request for lossyrob January 9, 2023 23:32

ci: set the asv machine

12a08b5

gadomski removed the run-benchmarks label Jan 9, 2023

ci: install pystac for benchmarks

34c4369

gadomski changed the title ~~Add performance benchmarks [RFC]~~ Add performance benchmarks Jan 9, 2023

Merge branch 'main' into add/729-performance-benchmarks

5b39551

gadomski added the enhancement label Jan 10, 2023

pjhartzell requested changes Jan 11, 2023

View reviewed changes

docs/contributing.rst Outdated Show resolved Hide resolved

benchmarks/import_pystac.py Outdated Show resolved Hide resolved

gadomski added 2 commits January 11, 2023 09:19

docs: add more text about running benchmarks

a170df5

bench: use timeraw for import

3b09da4

gadomski requested a review from pjhartzell January 11, 2023 16:27

Merge branch 'main' into add/729-performance-benchmarks

e023b4f

pjhartzell approved these changes Jan 11, 2023

View reviewed changes

gadomski merged commit 2aaa162 into stac-utils:main Jan 11, 2023

gadomski added this to the 1.7 milestone Jan 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add performance benchmarks #748

Add performance benchmarks #748

duckontheweb commented Feb 10, 2022 •

edited by gadomski

Loading

duckontheweb commented Feb 10, 2022

codecov-commenter commented Feb 10, 2022 •

edited

Loading

gadomski left a comment

duckontheweb commented Feb 15, 2022 •

edited

Loading

gadomski commented Jan 9, 2023 •

edited

Loading

pjhartzell left a comment

pjhartzell left a comment

Add performance benchmarks #748

Add performance benchmarks #748

Conversation

duckontheweb commented Feb 10, 2022 • edited by gadomski Loading

duckontheweb commented Feb 10, 2022

codecov-commenter commented Feb 10, 2022 • edited Loading

Codecov Report

gadomski left a comment

Choose a reason for hiding this comment

duckontheweb commented Feb 15, 2022 • edited Loading

gadomski commented Jan 9, 2023 • edited Loading

pjhartzell left a comment

Choose a reason for hiding this comment

pjhartzell left a comment

Choose a reason for hiding this comment

duckontheweb commented Feb 10, 2022 •

edited by gadomski

Loading

codecov-commenter commented Feb 10, 2022 •

edited

Loading

duckontheweb commented Feb 15, 2022 •

edited

Loading

gadomski commented Jan 9, 2023 •

edited

Loading