Add warnings about too few or too many samples #210

mdboom · 2024-12-17T19:08:09Z

Related to python/pyperformance#372. Once this is merged, we can reduce the number of iterations in pyperformance.

vstinner · 2024-12-18T08:19:07Z

pyperf/_bench.py

+
+        https://en.wikipedia.org/wiki/Sample_size_determination#Estimation_of_a_mean
+        """
+        # Get the means of the values per run


Why not computing the mean only once, for all values of all runs?

Because for some benchmarks, cache effects are visible within the same process. For example, pylint takes about 30% longer during the first iteration than the subsequent 2 iterations. One could argue that's a bad benchmark, but it's common enough that we should control for it. There's some more discussion here: faster-cpython/bench_runner#318 (comment)

That said, it's definitely worth putting a comment about that here.

vstinner · 2024-12-18T08:22:10Z

pyperf/_cli.py

+        lines.append(
+            "Consider passing processes=%d to the Runner constructor to save time." %
+            required_nsamples
+        )


This warning may be a little bit annoying. Maybe only show it in the "pyperf check" command? https://pyperf.readthedocs.io/en/latest/cli.html#check-cmd

Yeah, that's a good idea. We can run check in our own infra, which is good enough for me.

vstinner · 2024-12-18T08:22:54Z

pyperf/_bench.py

@@ -424,6 +424,39 @@ def median_abs_dev(self):
            raise ValueError("MAD must be >= 0")
        return value

+    def required_nsamples(self):


If you want to add a public function, please document it at: https://pyperf.readthedocs.io/en/latest/api.html#benchmark-class

Good point. I think we do want it to be public (for the same reason the other statistics methods are public).

pyperf/_bench.py

vstinner · 2024-12-19T10:33:50Z

pyperf/tests/test_perf_cli.py

@@ -635,6 +628,14 @@ def test_slowest(self):

    def test_check_stable(self):
        stdout = self.run_command('check', TELCO)
+        self.assertTrue(


I suggest using assertIn() instead.

Co-authored-by: Victor Stinner <vstinner@python.org>

vstinner

The code change LGTM. I didn't check the maths behind required_nprocesses().

Add warnings about too few or too many samples

766ecdc

mdboom requested a review from vstinner December 17, 2024 19:08

mdboom added 2 commits December 17, 2024 14:20

Fix typo

0e3dbc1

Fix tests

7c47af1

vstinner reviewed Dec 18, 2024

View reviewed changes

mdboom added 2 commits December 18, 2024 14:03

Address comments in the PR

a685a14

Don't compute required_nprocesses unless necessary

56dfad1

mdboom requested a review from vstinner December 18, 2024 19:08

vstinner reviewed Dec 19, 2024

View reviewed changes

Update pyperf/_bench.py

980003e

Co-authored-by: Victor Stinner <vstinner@python.org>

vstinner approved these changes Dec 19, 2024

View reviewed changes

Use assertIn

a02bff9

mdboom merged commit bbc8e9f into psf:main Dec 19, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add warnings about too few or too many samples #210

Add warnings about too few or too many samples #210

mdboom commented Dec 17, 2024

vstinner Dec 18, 2024

mdboom Dec 18, 2024

vstinner Dec 18, 2024

mdboom Dec 18, 2024

vstinner Dec 18, 2024

mdboom Dec 18, 2024 •

edited

Loading

vstinner Dec 19, 2024

vstinner left a comment

Add warnings about too few or too many samples #210

Add warnings about too few or too many samples #210

Conversation

mdboom commented Dec 17, 2024

vstinner Dec 18, 2024

Choose a reason for hiding this comment

mdboom Dec 18, 2024

Choose a reason for hiding this comment

vstinner Dec 18, 2024

Choose a reason for hiding this comment

mdboom Dec 18, 2024

Choose a reason for hiding this comment

vstinner Dec 18, 2024

Choose a reason for hiding this comment

mdboom Dec 18, 2024 • edited Loading

Choose a reason for hiding this comment

vstinner Dec 19, 2024

Choose a reason for hiding this comment

vstinner left a comment

Choose a reason for hiding this comment

mdboom Dec 18, 2024 •

edited

Loading