Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

testing: b.Loop gives bogus results in some situations #72971

Closed
aclements opened this issue Mar 20, 2025 · 8 comments
Closed

testing: b.Loop gives bogus results in some situations #72971

aclements opened this issue Mar 20, 2025 · 8 comments
Assignees
Labels
BugReport Issues describing a possible bug in the Go implementation. FixPending Issues that have a fix which has not yet been reviewed or submitted.
Milestone

Comments

@aclements
Copy link
Member

aclements commented Mar 20, 2025

Go version

go version go1.24.0 linux/amd64

What did you do?

There are at least two user errors that testing.B.Loop could detect, but that currently lead to bogus benchmark results instead:

  • Returning from a benchmark function before b.Loop() has returned false.
  • Stopping the timer in the benchmark loop without starting it again.

What did you see happen?

Here are two examples:

package main

import "testing"

func BenchmarkBreak(b *testing.B) {
	for i := 0; b.Loop(); i++ {
		if i == 2 {
			break
		}
	}
}

func BenchmarkStop(b *testing.B) {
	for i := 0; b.Loop(); i++ {
		b.StopTimer()
	}
}

With Go 1.24 (and tip as of this writing), these produce bogus results:

BenchmarkBreak-8   	     100	         5.950 ns/op
BenchmarkStop-8    	458912020	         0.0000006 ns/op

With the fix for #72922, BenchmarkStop actually loops forever, until the whole test binary times out.

What did you expect to see?

I expected to see useful errors, and certainly not a timed out benchmark.

@aclements aclements added this to the Go1.25 milestone Mar 20, 2025
@aclements aclements self-assigned this Mar 20, 2025
@aclements
Copy link
Member Author

cc @JunyangShao

I have some CLs to fix this, which I'll send shortly.

@gabyhelp
Copy link

@gabyhelp gabyhelp added the BugReport Issues describing a possible bug in the Go implementation. label Mar 20, 2025
@aclements
Copy link
Member Author

Good gaby. This is indeed a partial duplicate of #72933.

@gopherbot
Copy link
Contributor

Change https://go.dev/cl/659657 mentions this issue: testing: detect a stopped timer in B.Loop

@gopherbot
Copy link
Contributor

Change https://go.dev/cl/659655 mentions this issue: testing: separate b.Loop counter from b.N

@gopherbot
Copy link
Contributor

Change https://go.dev/cl/659656 mentions this issue: testing: detect early return from B.Loop

@aclements
Copy link
Member Author

@gopherbot, please open a backport to Go 1.24.

I'm on the fence about whether this should be backported. It makes b.Loop, which is new in Go 1.24, more robust to user mistakes, but those same mistakes could have been made with the old b.N style benchmarks, we just couldn't catch them. In support of backporting, it is a brand new feature and we're going to post a blog post about it soon so it'll probably get a spike in usage.

@gopherbot
Copy link
Contributor

Backport issue(s) opened: #72974 (for 1.24).

Remember to create the cherry-pick CL(s) as soon as the patch is submitted to master, according to https://go.dev/wiki/MinorReleases.

gopherbot pushed a commit that referenced this issue Mar 21, 2025
Currently, b.Loop uses b.N as the iteration count target. However,
since it updates the target as it goes, the behavior is quite
different from a b.N-style benchmark. To avoid user confusion, this CL
gives b.Loop a separate, unexported iteration count target. It ensures
b.N is 0 within the b.Loop loop to help catch misuses, and commits the
final iteration count to b.N only once the loop is done (as the
documentation states "After Loop returns false, b.N contains the total
number of iterations that ran, so the benchmark may use b.N to compute
other average metrics.")

Since there are now two variables used by b.Loop, we put them in an
unnamed struct. Also, we rename b.loopN to b.loop.i because this
variable tracks the current iteration index (conventionally "i"), not
the target (conventionally "n").

Unfortunately, a simple renaming causes B.Loop to be too large for the
inliner. Thus, we make one simplification to B.Loop to keep it under
the threshold. We're about to lean into that simplification anyway in
a follow-up CL, so this is just temporary.

Prep for #72933 and #72971.

Change-Id: Ide1c4f1b9ca37f300f3beb0e60ba6202331b183e
Reviewed-on: https://go-review.googlesource.com/c/go/+/659655
LUCI-TryBot-Result: Go LUCI <golang-scoped@luci-project-accounts.iam.gserviceaccount.com>
Reviewed-by: Junyang Shao <shaojunyang@google.com>
Auto-Submit: Austin Clements <austin@google.com>
@dmitshur dmitshur added the FixPending Issues that have a fix which has not yet been reviewed or submitted. label Mar 21, 2025
gopherbot pushed a commit that referenced this issue Mar 24, 2025
Currently, if a benchmark function returns prior to B.Loop() returning
false, we'll report a bogus result. While there was no way to detect
this with b.N-style benchmarks, one way b.Loop()-style benchmarks are
more robust is that we *can* detect it.

This CL adds a flag to B that tracks if B.Loop() has finished and
checks it after the benchmark completes. If there was an early exit
(not caused by another error), it reports a B.Error.

Fixes #72933.
Updates #72971.

Change-Id: I731c1350e6df938c0ffa08fcedc11dc147e78854
Reviewed-on: https://go-review.googlesource.com/c/go/+/659656
LUCI-TryBot-Result: Go LUCI <golang-scoped@luci-project-accounts.iam.gserviceaccount.com>
Auto-Submit: Austin Clements <austin@google.com>
Reviewed-by: Michael Pratt <mpratt@google.com>
Reviewed-by: Junyang Shao <shaojunyang@google.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BugReport Issues describing a possible bug in the Go implementation. FixPending Issues that have a fix which has not yet been reviewed or submitted.
Projects
None yet
Development

No branches or pull requests

4 participants