Add performance test #275

jagerber48 · 2024-12-16T13:24:41Z

Closes Benchmarking tests #274
Executed pre-commit run --all-files with no errors
The change is fully covered by automated unit tests
Documented in docs/ as appropriate
Added an entry to the CHANGES file

add a performance benchmark test. This test is important especially to ensure #262 doesn't introduce a performance regression.

codecov · 2024-12-16T13:26:52Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.61%. Comparing base (969324d) to head (6927711).
Report is 1 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #275      +/-   ##
==========================================
+ Coverage   96.50%   96.61%   +0.10%     
==========================================
  Files          16       17       +1     
  Lines        1919     1947      +28     
==========================================
+ Hits         1852     1881      +29     
+ Misses         67       66       -1

Flag	Coverage Δ
macos-latest-3.10	`95.06% <100.00%> (+0.12%)`	⬆️
macos-latest-3.11	`95.01% <100.00%> (+0.07%)`	⬆️
macos-latest-3.12	`95.01% <100.00%> (+0.07%)`	⬆️
macos-latest-3.8	`95.01% <100.00%> (+0.07%)`	⬆️
macos-latest-3.9	`95.01% <100.00%> (+0.07%)`	⬆️
no-numpy	`75.14% <100.00%> (+0.36%)`	⬆️
ubuntu-latest-3.10	`95.01% <100.00%> (+0.07%)`	⬆️
ubuntu-latest-3.11	`95.01% <100.00%> (+0.07%)`	⬆️
ubuntu-latest-3.12	`95.01% <100.00%> (+0.07%)`	⬆️
ubuntu-latest-3.8	`95.01% <100.00%> (+0.07%)`	⬆️
ubuntu-latest-3.9	`95.01% <100.00%> (+0.07%)`	⬆️
windows-latest-3.10	`95.01% <100.00%> (+0.07%)`	⬆️
windows-latest-3.11	`95.01% <100.00%> (+0.07%)`	⬆️
windows-latest-3.12	`95.01% <100.00%> (+0.07%)`	⬆️
windows-latest-3.8	`95.01% <100.00%> (+0.07%)`	⬆️
windows-latest-3.9	`95.01% <100.00%> (+0.07%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

jagerber48 · 2024-12-16T13:30:00Z

For the raw benchmark threshold I set that str(sum(ufloat(1, 0.1) for _ in range(100000))) should execute in 0.75 * 4 seconds. This came from this code executing in ~0.75 s on my laptop.

However already this threshold was too stringent for the github action runners. My question: how should we put in this benchmark? Should I just increase the factor to 8? Do something else? I'm not sure a good way to test for performance regression since absolute timing is so system dependent.

andrewgsavage · 2024-12-16T13:39:40Z

there's libraries built spefically for this purpose rather than writing your own tests like that. pint uses codspeed

jagerber48 · 2024-12-16T20:39:09Z

Perfect, that's exactly the sort of thing I will look for. I will try to integrate codspeed.

jagerber48 · 2024-12-17T13:57:33Z

I made a complexity only test which runs the benchmark for n = [10, 100, ...] and checks that log10(t/t0) is equal to log10(n/n0) to within 10%. Looking at t vs n on a log plot the data is very linear, so I expect this test to be relatively robust. It should definitely catch regressions if t becomes proportional to n^2 instead of n. The worry is that it will occasionally fail erroneously (false negative).

x axis is n, y axis is t in seconds. solid lines are linear interpolation between points (no fit or anything)

jagerber48 · 2024-12-17T14:56:53Z

I'm hitting a variety of issues with establishing the connection to codspeed.io, but I'll keep working at it.

newville · 2024-12-17T16:08:46Z

@jagerber48 @andrewgsavage Thanks for this. I approved the codspeed integration request.

I also agree with the observation in #274 that the lazy evaluation from @lebigot is very impressive and nicely avoids calculations of uncertainties on intermediate quantities that are never needed in isolation. Clever! And, yes, I can see that this would be easy to slip up when refactoring. So, thanks, and sorry I can't be more help at this time.

jagerber48 · 2024-12-17T23:21:15Z

@newville thank you for approving the integration request. However I see that further configuration is needed. I got a message that the configuration can only be done by an organization owner (which I am not).

This is the configuration that I think needs to be done

I access it from the repositories page on my codecov.io account page, then I click to import a new repository (+) and then configure the lmfit organization. Looks like within the uncertainties team in lmfit org that @newville and @wshanks are owners.

I've never used codspeed, I don't know if further "owner-only" configuration will be necessary after this.

jagerber48 · 2024-12-18T08:13:54Z

Ok, it looks like I am past the issue from my last comment. Not sure if that is due to someone else doing something, or if it just started working. Either way, the ball is in my court now to setup the CI to push some runs of the benchmarking up to codspeed.

codspeed-hq · 2024-12-18T11:34:58Z

CodSpeed Performance Report

Congrats! CodSpeed is installed 🎉

🆕 5 new benchmarks were detected.

You will start to see performance impacts in the reports once the benchmarks are run from your default branch.

Detected benchmarks

test_repeated_summation_speed[100000] (2.9 s)
test_repeated_summation_speed[10000] (275.4 ms)
test_repeated_summation_speed[1000] (27.1 ms)
test_repeated_summation_speed[100] (2.8 ms)
test_repeated_summation_speed[10] (402.3 µs)

andrewgsavage · 2024-12-18T11:43:32Z

tests/test_performance.py

+        assert 0.9 * log10(n / n0) < log10(t / t0) < 1.1 * log10(n / n0)
+
+
+@pytest.mark.benchmark


you can probably parameterise this test with n_list and have it benchmark each num

Nesting the benchmark and parametrize marks works in my local environment, but seems to make the github action hang forever running the test.

Ok got it. The problem is I was mixing my internal benchmarking using timeit with what codspeed was trying to do. The tool is very nice.

Will cleanup the tests a little bit now that I know what's going on then get this ready for review.

…ture/benchmark_test

jagerber48 · 2024-12-18T15:38:52Z

This is ready for review. @andrewgsavage you seem like the natural person for this one.

andrewgsavage · 2024-12-24T13:40:10Z

tests/test_performance.py

+    return result
+
+
+def test_repeated_summation_complexity():


I've seen a test similar to this that kept randomly failing so I don't know how reliable this test will be.
I think test_repeated_summation_speed will catch any increases in time so these tests are testing the same thing. I'm not opposed to this test- we can leave it in for now, and if it we find it unreliable we can remove it.

I'll add a link to PR with the graph you plotted as it helps makes sense of this test. It'll be good to see this plotted for your other PR too.

add performance test

8425756

jagerber48 added 3 commits December 17, 2024 06:01

try codspeed

b3e66a1

process time for complexity

d532874

test complexity

c6c6128

codspeed CI

992a37a

jagerber48 added 2 commits December 18, 2024 20:36

adjust test running

4153cf2

adjust test more

d0120b3

andrewgsavage reviewed Dec 18, 2024

View reviewed changes

jagerber48 added 10 commits December 18, 2024 20:45

only run on latest ubuntu

2dd07b9

slightly different ufloat summation

4480808

comments

c19d373

Merge remote-tracking branch 'origin/feature/benchmark_test' into fea…

a052ae1

…ture/benchmark_test

benchmark multiple N values

c9b46a5

reverse order of decorators

5f4d55a

benchmark fixture?

f98349e

run the benchmark directly, maybe codspeed will handle benchmarking.

eb29951

refactor and clean up the tests

6470a24

Changelog

afa1712

Update test_performance.py

6927711

andrewgsavage approved these changes Dec 24, 2024

View reviewed changes

jagerber48 merged commit 61f688f into lmfit:master Dec 28, 2024
22 checks passed

jagerber48 deleted the feature/benchmark_test branch December 28, 2024 02:12

jagerber48 mentioned this pull request Jan 3, 2025

Problem with special casing std_dev = 0 #283

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add performance test #275

Add performance test #275

jagerber48 commented Dec 16, 2024 •

edited

Loading

codecov bot commented Dec 16, 2024 •

edited

Loading

jagerber48 commented Dec 16, 2024

andrewgsavage commented Dec 16, 2024

jagerber48 commented Dec 16, 2024

jagerber48 commented Dec 17, 2024

jagerber48 commented Dec 17, 2024

newville commented Dec 17, 2024

jagerber48 commented Dec 17, 2024 •

edited

Loading

jagerber48 commented Dec 18, 2024

codspeed-hq bot commented Dec 18, 2024 •

edited

Loading

Detected benchmarks

andrewgsavage Dec 18, 2024

jagerber48 Dec 18, 2024

jagerber48 Dec 18, 2024

jagerber48 commented Dec 18, 2024

andrewgsavage Dec 24, 2024

		assert 0.9 * log10(n / n0) < log10(t / t0) < 1.1 * log10(n / n0)


		@pytest.mark.benchmark

Add performance test #275

Add performance test #275

Conversation

jagerber48 commented Dec 16, 2024 • edited Loading

codecov bot commented Dec 16, 2024 • edited Loading

Codecov Report

jagerber48 commented Dec 16, 2024

andrewgsavage commented Dec 16, 2024

jagerber48 commented Dec 16, 2024

jagerber48 commented Dec 17, 2024

jagerber48 commented Dec 17, 2024

newville commented Dec 17, 2024

jagerber48 commented Dec 17, 2024 • edited Loading

jagerber48 commented Dec 18, 2024

codspeed-hq bot commented Dec 18, 2024 • edited Loading

CodSpeed Performance Report

Congrats! CodSpeed is installed 🎉

Detected benchmarks

andrewgsavage Dec 18, 2024

Choose a reason for hiding this comment

jagerber48 Dec 18, 2024

Choose a reason for hiding this comment

jagerber48 Dec 18, 2024

Choose a reason for hiding this comment

jagerber48 commented Dec 18, 2024

andrewgsavage Dec 24, 2024

Choose a reason for hiding this comment

jagerber48 commented Dec 16, 2024 •

edited

Loading

codecov bot commented Dec 16, 2024 •

edited

Loading

jagerber48 commented Dec 17, 2024 •

edited

Loading

codspeed-hq bot commented Dec 18, 2024 •

edited

Loading