proposal: spec: redefine range loop variables in each iteration #20733

bisgardo · 2017-06-19T20:08:27Z

As an alternative to #20725 (which was originally about go vet), let the variables of range loops be implicitly redefined in each iteration like in Dart's for loops. That is,

for k, v := range vals {
  // ...
}

should be equivalent to

for k, v := range vals {
  k := k
  v := v
  // ...
}

This will make it "safe" to take the loop variable's addresses as well as capturing the loop variables in nested functions (see #16520).

The proposal could be expanded to vanilla for loops, although that would make it diverge semantically from other languages.

The text was updated successfully, but these errors were encountered:

bradfitz · 2017-06-19T20:10:42Z

We considered this prior to Go 1, but considered it too quickly. We'll definitely reconsider it for Go 2. I actually think there's a duplicate tracking bug for this.

davecheney · 2017-06-19T22:03:30Z

I think this would be an important improvement to a situation that catches every go developer at least once in their efforts to learn Go.

…

On Tue, 20 Jun 2017, 06:10 Brad Fitzpatrick ***@***.***> wrote: We considered this prior to Go 1, but considered it too quickly. We'll definitely reconsider it for Go 2. I actually think there's a duplicate tracking bug for this. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#20733 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAAcA80GqcY1F4zKSo-optHwPwFOb1Jpks5sFtXGgaJpZM4N-uiT> .

dsnet · 2017-06-19T22:23:01Z

And I would argue even catches seasoned Go programmers occasionally. See https://golang.org/cl/40937. It took me a while staring at that logic trying to figure out why the code wasn't doing what it naturally read as.

cznic · 2017-06-20T06:05:54Z

Adopting this proposal can make performance of perfectly valid code worse, sometimes dramatically. For example, when escape analysis cannot determine it's safe to keep the {k,v} value on stack while it's address is taken. Then we get O(n) heap allocations instead of the current O(1).

davecheney · 2017-06-20T06:10:33Z

@cznic can you give some examples of this. I would expect that most of these cases would actually be bugs because taking the address of the same variable over and over again was the problem in the first place.

cznic · 2017-06-20T08:57:27Z

@davecheney Consider #20725 (comment) and imagine escape analysis is not able to prove foo only reads via the passed pointer.

davecheney · 2017-06-20T09:24:56Z

@cznic

Why would you want to take the address of a copy of the current range value? If i'm not mistaken, any modification to it would be lost.

cznic · 2017-06-20T09:43:23Z

@davecheney

Why would you want to take the address of a copy of the current range value?

To avoid copying a big value when passing it as an argument.
To call a pointer receiver method on the value.

If i'm not mistaken, any modification to it would be lost.

Taking the address of a value does not imply intent to modify the pointee.

davecheney · 2017-06-20T10:09:51Z

To avoid copying a big value when passing it as an argument.

But the value is already copied as part of the range iteration. Having a separate value escape on every iteration is sub optimal, but hopefully you'll agree it's reasonable to trade a potential performance problem for a very common correctness problem.

To call a pointer receiver method on the value.

True, that is asking a lot of escape analysis and I really don't know the ways that escape analysis and the implicit taking of address interact. But I will note that your example explicitly took the address of the copy of the current index value, so if the intention was to pass a pointer to the value down to a function, you'd want to make sure you were passing a pointer to the original slice element, not a copy. In the code that I have read that is more commonly written as

for i := range vals {
     foo(&vals[i])
}

Taking the address of a value does not imply intent to modify the pointee.

Respectfully, generally it does, as in your method example above.

cznic · 2017-06-20T10:29:40Z

foo(&vals[i])

Iff vals is a slice/array. There are also maps and channels.

Respectfully, generally it does, as in your method example above.

Even the innocent looking fmt.Println(value) always takes the address of 'value'. As does every function taking an interface (not only interface{}, any interface) argument. Every assignment to an interface variable takes the address of the RHS (if not already a pointer), etc.

Edit: Added '(if not already a pointer)'.

davecheney · 2017-06-20T10:43:49Z

Iff vals is a slice/array. There are also maps and channels.

If it's an array, it's already copied during the evaluation of the range statement. If it's a channel, it's a copy of the value received. Again i'm falling back on arguments of correctness over performance.

Even the innocent looking fmt.Println(value) always takes the address of 'value'.

I believe a copy of value is passed to fmt.Println. That copy is almost always heap allocated due to a limitation introduced in the Go 1.5 garbage collector.

cznic · 2017-06-20T10:52:15Z

If it's an array, it's already copied during the evaluation of the range statement. If it's a channel, it's a copy of the value received.

Right. But with this proposal there will be, in certain situations, an additional allocation per iteration.

I believe a copy of value is passed to fmt.Println. That copy is almost always heap allocated due to a limitation introduced in the Go 1.5 garbage collector.

You're right. The interface runtime struct has the address, but that's not different with this proposal. I misspoke, sorry.

bcmills · 2017-06-21T18:15:35Z

@cznic

with this proposal there will be, in certain situations, an additional allocation per iteration.

Sure, but if it actually shows up on a profile there should be an easy workaround:

var bigValue someType
for _, v := range bigValues {
        bigValue = v  // Compiler should be smart enough to elide this copy.
        foo(&bigValue)  // Only one copy of bigValue escapes.
}

It is possible to write correct, efficient code with either definition of range loops; the question is mainly which should be the "default" or "typical" case. I believe that, in real usage, the proposed change would fix far more issues — and remove more workarounds — than it would introduce.

cznic · 2017-06-21T18:33:34Z

@bcmills

It is possible to write correct, efficient code with either definition of range loops;

True, but still the proposal has the potential to cripple existing correct and efficient code. Also, your comment does not consider v.foo(). We've agreed earlier with @davecheney that for escape analysis of pointer receivers it is sometimes, if not often, impossible to prove they are safe-for-stack. And, again, "Then we get O(n) heap allocations instead of the current O(1)."

bcmills · 2017-06-21T18:51:19Z

@cznic

still the proposal has the potential to cripple existing correct and efficient code.

For this proposal (and Go 2 proposals in general), if the proposal is accepted we should either compile existing Go 1 programs as-is or provide a tool that converts them preserving existing semantics. That is: no "existing" Go 1 code should be affected either way.

for escape analysis of pointer receivers it is sometimes, if not often, impossible to prove they are safe-for-stack

Escape analysis of pointer methods is mostly straightforward; you only really need to worry about arguments to interface methods. And note that iterating over a container of n values is already O(n), so doing O(n) memory-management work changes only the constant factors (not the asymptotic behavior).

cznic · 2017-06-21T19:01:54Z

@bcmills

That is: no "existing" Go 1 code should be affected either way.

I don't think this can be achieved without solving the halting problem. But let's assume it can be done. Even then we would have to put the particular compiler optimization that you've mentioned earlier, directly in the specification of the language. That can be done, no doubt. Isn't it better to avoid such leaks of abstractions? I think so.

bcmills · 2017-06-21T19:14:06Z

@cznic

we would have to put the particular compiler optimization that you've mentioned earlier, directly in the specification of the language.

Why? Generally, the only time an optimization needs to go into a language spec is if it affects the asymptotic behavior of the program, the canonical example being tail-call-elimination in functional languages (which changes memory usage from O(1) to O(N) with the depth of recursion). As I noted above, per-iteration memory-management overhead affects only constant factors: running time for loops is already O(N) with the number of iterations, and peak memory usage is O(1) either way (because the value can be garbage-collected at each iteration).

bisgardo · 2017-06-22T08:50:27Z

@cznic

the proposal has the potential to cripple existing correct and efficient code

If by "cripple" you mean introduce bugs, this will only happen in code that depends on a variable having been overwritten after its capture. I find it quite hard to imaging cases where this would be a requirement for correctness (would appreciate examples). But even if it is, it's easy to make a tool (see #20725; original formulation) that would flag all relevant code for scrutinization as part of a migration process.

As to the performance argument, I believe that if a good escape analysis can't figure out where a pointer is going, then neither can programmers (or it does in fact escape the loop). Thus, if such code is correct, it's probably by accident and not for performance reasons.

martisch · 2017-06-22T18:55:04Z

If by "cripple" you mean introduce bugs, this will only happen in code that depends on a variable having been overwritten after its capture.

I thought maybe something like the below just with := can be constructed where this happens too:

	x := []int{1, 2, 3, 4, 5}
	var i int
	for x[i], i = range x {
		//...
		i = 2
	}

Edit: on closer look i guess currently := allows only simple variable names so this is not an issue.

leonklingele · 2017-10-03T15:41:10Z

As this does not require changes to the for-range syntax, it is still valid Go 1 code.
What if one tries to run some Go 2 code using such a for-range loop with a Go 1 compiler? I can already see people complaining "this code works fine in Go 2 but misbehaves for whatever reason in Go 1".
This proposal also should add a way to break compilation with Go 1 (e.g. a // +build !go1 annotation) which no one will ever use.

bcmills · 2017-11-29T16:10:01Z

Ran across this interesting example today (https://play.golang.org/p/DUstLropfJ):

…

func (k *key) f(v string) {
	fmt.Printf("%#v.f(%#v)\n", k, v)
}

func main() {
	…
	for k, v := range m {
		defer k.f(v)
	}
}

The interaction between loop variables and implicit pointer receivers makes it possible to accidentally close over a loop variable without taking its address or writing a function literal.

Merovius · 2018-01-13T08:17:38Z

For posterity, today someone posted on golang-nuts about stumbling into this with (*testing.T).Parallel:
https://groups.google.com/d/msg/golang-nuts/SAZ6wCSLXU0/F-TtRwDCAAAJ
As a further datapoint to fix this in Go2.

With regards to the performance-concerns: I'd expect the difference, in general, to be optimized out. Even if there are still edge-cases: Given the subtlety and prevalence of this problem, I'd strongly favor correctness over performance. If your code is too slow, you can always benchmark and hand-optimize. If your code is subtly incorrect, that's a lot harder to remedy.

davecb · 2018-05-09T12:37:36Z

On 08/05/18 06:13 PM, Bryan C. Mills wrote:

So I see the cost/benefit tradeoff of a breaking change as:
Costs
Update code generators.
Reprogram users' muscle memory.
go fix existing code, which may or may not be necessary anyway (depending on what else goes into Go 2).

Benefits
Prompt users to omit redundant workarounds.
(This proposal.) Make the common / default cases more concise.

It's up to the proposal committee to weigh those costs and benefits; the balance is not obvious to me.
The point of this proposal is to lay out some benefits on the “breaking change” side that might not be obvious otherwise.

From previous experience with managed change (which dates back to Multics and dinosaurs roaming the earth (:-)) one of the largest problems of trying to maintain compatibility is that it can lead to persons depending on a bug, and thereby preventing it from ever being fixed.

I would argue that when this kind of deranged dependency occurs, that in order one

introduces the new capability,

provides a means of migration from old to new,

```
marks the old as deprecated, and
```
```
withdraws the old
```

In the special case of linked-library changes, the caller may have lost the source code they need to change, so we instead made the last step

require positive action to continue to use the old (in our case that was a linker option)

Go's case is not as hard: if you can do steps 1-4, then I strongly recommend you do so.

--dave

gopherbot · 2019-02-27T17:29:40Z

Change https://golang.org/cl/164119 mentions this issue: cmd/link: do not close over the loop variable in testDWARF

Fixes #30429 Updates #16520 Updates #20733 Change-Id: Iae41f06c09aaaed500936f5496d90cefbe8293e4 Reviewed-on: https://go-review.googlesource.com/c/164119 Run-TryBot: Bryan C. Mills <bcmills@google.com> TryBot-Result: Gobot Gobot <gobot@golang.org> Reviewed-by: Cherry Zhang <cherryyz@google.com> Reviewed-by: Ian Lance Taylor <iant@golang.org>

danielchatfield · 2020-07-27T15:00:12Z

I'm very supportive of this change. For code that really needs to be optimised then the allocation can be avoided by declaring the variables outside the loop like this:

var (
	i int
	v string
)
for i, v = range []string{"foo", "bar"} {
	println(i, v)
}

candlerb · 2021-01-02T22:08:07Z

Last year there was high profile example of this class of bug:
https://community.letsencrypt.org/t/2020-02-29-caa-rechecking-bug/114591
https://www.theregister.com/2020/03/03/lets_encrypt_cert_revocation/
Detailed analysis: https://bugzilla.mozilla.org/show_bug.cgi?id=1619047
Fix: https://github.com/letsencrypt/boulder/pull/4690/files#diff-d02067a9f9a2bed1110fd4e98641c2effcf5d1d5f18461e35d6ac1535f6e2c21L1411-R1414

earthboundkid · 2022-03-16T19:55:47Z

I think this can be done now by keeping it behind a go.mod version flag. When you run go mod edit version -go=1.XX it should scan for existing closure captures and convert them to safe equivalents as part of the upgrade. I think it's doable. E.g. it would change this:

for i := 0; i < n; i++ {
  use(&i)
}

to this as part of the upgrade:

{ 
  i := 0
  for ; i < n; i++ {
    use(&i)
  }
}

thepudds · 2022-03-16T20:15:59Z

Hi @carlmjohnson, FWIW, the Go language changes design document uses this issue here as an example:

I think the only feasible safe approach is to not permit language redefinitions.

We are stuck with our current semantics. This doesn't mean we can't improve them. For example, for issue 20733, the range issue, we could change range loops so that taking the address of a range parameter, or referring to it from a function literal, is forbidden. This would not be a redefinition; it would be a removal. That approach might eliminate the bugs without the potential of breaking code unexpectedly.

That document also describes using the go directive in go.mod to control the version of the language used by each module (with some additional discussion & refinement in #28221).

So I suspect it is at least possible to remove this as a feature. One minor comment is I suspect go mod edit would be not used to change source code; it would more likely be go fix from what I understand.

mpx · 2022-10-04T23:43:24Z

FYI, there is a discussion covering how to redefine "for" loop variable semantics on #56010.

This commit fixes a bug that was recently introduced when registering the costfuzz and unoptimized-query-oracle tests. This bug was due to the well-known range loop variable issue described here: golang/go#20733. This commit also adds some additional logging and debug information to help with future issues with these test cases. Fixes cockroachdb#90010 Release note: None

90221: roachtest: fix costfuzz and unoptimized-query-oracle setups r=rytaft a=rytaft This commit fixes a bug that was recently introduced when registering the costfuzz and unoptimized-query-oracle tests. This bug was due to the well-known range loop variable issue described here: golang/go#20733. This commit also adds some additional logging and debug information to help with future issues with these test cases. Fixes #90010 Release note: None Co-authored-by: Rebecca Taft <becca@cockroachlabs.com>

rsc · 2023-05-09T16:51:51Z

Closing as duplicate of #60078.

gopherbot added this to the Proposal milestone Jun 19, 2017

gopherbot added the Proposal label Jun 19, 2017

bradfitz added v2 An incompatible library change LanguageChange Suggested changes to the Go language labels Jun 19, 2017

ianlancetaylor added the NeedsDecision Feedback is required from experts, contributors, and/or the community before a change can be made. label Feb 27, 2018

This was referenced Mar 6, 2018

proposal: spec: improve for-loop ergonomics #24282

Open

proposal: spec: add const literals for reference types like structs, maps, and arrays #21130

Open

thepudds mentioned this issue Jun 6, 2019

proposal: start using semantic versions for Go releases #32450

Closed

gopherbot removed the NeedsDecision Feedback is required from experts, contributors, and/or the community before a change can be made. label Aug 16, 2019

gopherbot added the NeedsDecision Feedback is required from experts, contributors, and/or the community before a change can be made. label Sep 3, 2019

This comment has been minimized.

Sign in to view

bcmills mentioned this issue Jan 21, 2020

staticcheck: detect incorrectly used pointers to loop variables dominikh/go-tools#163

Open

ianlancetaylor mentioned this issue Feb 5, 2020

proposal: Go 2: fix range loopclosure bug #37061

Closed

ianzhang366 mentioned this issue May 5, 2020

possible synchronizer refactor solution stolostron/multicloud-operators-subscription#182

Merged

kalexmills mentioned this issue Nov 22, 2020

escape analysis for reference types github-vet/bots#16

Closed

tc-hib mentioned this issue Dec 5, 2020

Fix test TestServer alexandrevicenzi/go-sse#27

Closed

joelrebel mentioned this issue Oct 18, 2021

Redfish useraccounts bmc-toolbox/bmclib#242

Merged

zpavlinovic added the Analysis Issues related to static analysis (vet, x/tools/go/analysis) label Mar 21, 2022

earthboundkid mentioned this issue Jul 25, 2022

proposal: Go 2: add a new iterator syntax, package, interfaces #54047

Closed

rytaft mentioned this issue Oct 19, 2022

roachtest: fix costfuzz and unoptimized-query-oracle setups cockroachdb/cockroach#90221

Merged

bobheadxi mentioned this issue Jan 5, 2023

pool: fix loop variables in tests sourcegraph/conc#14

Merged

nykma mentioned this issue Feb 17, 2023

Add the slack platform support NextDotID/proof_server#71

Merged

rsc closed this as completed May 9, 2023

Xanonymous-GitHub mentioned this issue Dec 7, 2023

Fix owner reference not normally exported from VirtualFileSystem.selectUser TheRiseOfDavid/assignment_VFS#1

Open

enrichman mentioned this issue Jan 30, 2024

Add status field for global roles rancher/rancher#44246

Merged

golang locked and limited conversation to collaborators May 22, 2024

gopherbot added the FrozenDueToAge label May 22, 2024

proposal: spec: redefine range loop variables in each iteration #20733

proposal: spec: redefine range loop variables in each iteration #20733

Comments

bisgardo commented Jun 19, 2017

bradfitz commented Jun 19, 2017

Uh oh!

davecheney commented Jun 19, 2017 via email

Uh oh!

dsnet commented Jun 19, 2017

Uh oh!

cznic commented Jun 20, 2017

Uh oh!

davecheney commented Jun 20, 2017

Uh oh!

cznic commented Jun 20, 2017

Uh oh!

davecheney commented Jun 20, 2017

Uh oh!

cznic commented Jun 20, 2017

Uh oh!

davecheney commented Jun 20, 2017

Uh oh!

cznic commented Jun 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davecheney commented Jun 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cznic commented Jun 20, 2017

Uh oh!

bcmills commented Jun 21, 2017

Uh oh!

cznic commented Jun 21, 2017

Uh oh!

bcmills commented Jun 21, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cznic commented Jun 21, 2017

Uh oh!

bcmills commented Jun 21, 2017

Uh oh!

bisgardo commented Jun 22, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

martisch commented Jun 22, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leonklingele commented Oct 3, 2017

Uh oh!

bcmills commented Nov 29, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Merovius commented Jan 13, 2018

Uh oh!

davecb commented May 9, 2018

Uh oh!

gopherbot commented Feb 27, 2019

Uh oh!

This comment has been minimized.

danielchatfield commented Jul 27, 2020

Uh oh!

candlerb commented Jan 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

earthboundkid commented Mar 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thepudds commented Mar 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mpx commented Oct 4, 2022

Uh oh!

rsc commented May 9, 2023

Uh oh!

cznic commented Jun 20, 2017 •

edited

Loading

davecheney commented Jun 20, 2017 •

edited

Loading

bcmills commented Jun 21, 2017 •

edited

Loading

bisgardo commented Jun 22, 2017 •

edited

Loading

martisch commented Jun 22, 2017 •

edited

Loading

bcmills commented Nov 29, 2017 •

edited

Loading

candlerb commented Jan 2, 2021 •

edited

Loading

earthboundkid commented Mar 16, 2022 •

edited

Loading

thepudds commented Mar 16, 2022 •

edited

Loading