[SUGGESTION] `for` loop is ambiguous with lambda syntax #386

msadeqhe · 2023-04-20T13:22:31Z

Preface

Cpp2 has the following control structures:

if CONDITION { ... }
else if CONDITION { ... }
...
else { ... }

while CONDITION next STEP { ... }

do { ... }
while CONDITION next STEP;

for ITEMS do: (ITEM) = { ... }

While I always thought that for loop gets a lambda, and because of that I started to think that Cpp2 should have user-defined language constructs. But NO, hopefully the statement block of for loop is not a lambda.

All control structures of Cpp2 have statement blocks, therefore we don't need to capture anything inside the statement block of for loop.

How to reproduce?

It's an example:

#include <iostream>
#include <vector>
#include <string>

main: () = {
    items: std::vector<int> = (1, 2, 3);
    r: = 10;

    for items do: (item) = {
        r = 0;
        std::cout << item << "\n";
    }
}

A typical programmer would think that : (item) = { ... } is a lambda because of its similarity with lambda syntax, but hopefully it's not. Cpp2 generates the following code:

auto main() -> int{
    std::vector<int> items {1, 2, 3}; 
    auto r {10}; 

    for ( auto const& cpp2_range = items;  auto const& item : cpp2_range ) {
        r = 0;
        std::cout << item << "\n";
    }
}

Suggestion Detail

I didn't know I should create a bug report or a suggestion for ambiguous problem of for loop. By the way I created a suggestion.

I suggest to change the syntax of for loop from:

for ITEMS do: (ITEM) = {
    ...
}

to something consistent with other control structures such as one of the following syntax:

for ITEM keyword ITEMS {
    ...
}

for ITEMS keyword ITEM {

}

In which:

keyword for the first option can be in, of, iter or etc.
keyword for the second option can be next, each, loop or etc.

Any other syntax which is similar to other control structures, is good.

Why do I suggest this change?

Because:

The statement block of for loop syntax : (item) = { ... } is ambiguous with lambda syntax. A typical programmer expects to capture variables inside it.
The statement block of for loop syntax : (item) = { ... } is inconsistent with other control structures which their statement block are simply { ... }.

Your Questions

Will your feature suggestion eliminate X% of security vulnerabilities of a given kind in current C++ code?

No.

Will your feature suggestion automate or eliminate X% of current C++ guidance literature?

Yes absolutely. It would make control structures to be consistent with each other in addition to simplification of guidance:

{ ... } will be for the statement block of control structures, therefore variables don't have to be captured.
But : (args) = { ... } will be for lambdas, therefore variables have to be captured. for loop breaks this rule.
Expressions in control structures (e.g. the expression after next keyword in while loop) won't be evaluate at first evidence, and they can be evaluated multiple times.
But expressions in other places (e.g. for variable initialization or function arguments) will be evaluated one time at first evidence.

Considered alternatives

The other alternative solution is to make the statement block of for loop to be actually a lambda, which is weird and inconsistent with other control structures.

The text was updated successfully, but these errors were encountered:

JohelEGP · 2023-04-20T13:43:53Z

The syntax is supposed to make the language more regular, as a for loop's body is parametrized on the element being iterated.

I did find it a bit weird, but only because of the association with function expressions and the knowledge they're lowered to Cpp1 lambda expressions.

Otherwise, I think it's a win. The function syntax is regular. At namespace scope and type scope, they declare functions. At statement scope, they're currently banned. As expressions, they work and permit captures. And finally, they work as the for loop's parametrized statements' body (no scope introduced, and I suppose captures not permitted).

msadeqhe · 2023-04-20T13:47:51Z

Otherwise, I think it's a win. The function syntax is regular. At namespace scope and type scope, they declare functions. At statement scope, they're currently banned. As expressions, they work and permit captures. And finally, they work as the for loop's parametrized statements' body (no scope introduced, and I suppose captures not permitted).

IMO that's why it complicates the language. It just feels like another static keyword in Cpp1 which has many meanings in different places.

JohelEGP · 2023-04-20T13:51:05Z

Consider the upcoming template for. In Cpp2, that's a noop. cppfront would just have to permit <T> before :(item), and emit template before for in that case.

EDIT: It's https://wg21.link/P1306, for....

msadeqhe · 2023-04-20T13:57:26Z

That's nice, but { ... } doesn't cause trouble for for... in generic programming:

func: <T: type...> (args: T...) = {
    for... arg: T in args {
        //: statements...
    }
}

or this one:

func: <T: type...> (args: T...) = {
    for... args next arg: T {
        //: statements...
    }
}

or any other syntax without : (arg) = { ... }.

JohelEGP · 2023-04-20T14:10:37Z

Sorry, that was a bad example. "Expansion statements", proposed for Cpp1, operates on values. So it'd probably be a matter of permitting for... in Cpp2, too.

hsutter · 2023-04-20T17:29:42Z

Thanks for the feedback. The status quo is by design, because the body of a range-for loop (not a do or while loop) is already conceptually a function invoked once per element in the range.

Note that capture is already not allowed in all functions. In particular, named functions already don't allow capture. Perhaps it helps if you view the body of a for loop as being a function named do?

msadeqhe · 2023-04-20T20:18:36Z

Although it's not possible to have nested functions in Cpp2, but yes, it helps if I view it as a function named do.

hsutter · 2023-04-20T21:00:18Z

BTW, Cpp2 will have local functions. They're pretty much there, I just ban them as a 'temporary alpha limitation' only because I haven't been motivated to prioritize the task of emitting them as a lambda and making sure they can capture. It shouldn't be hard, but I've wanted to drive other things like meta functions first. Also, they have a pretty easy workaround (as in Cpp1 today) which is in the error message you get if you try to write one:

(temporary alpha limitation) local functions like 'f: (/*params*/) = {/*body*/}' are not currently supported - write a local variable initialized with an unnamed function like 'f := :(/*params*/) = {/*body*/};' instead (add '=' and ';')

That said, I have said the same about other features and then ended up prioritizing and implementing them anyway already because they came up in issues... squeaky wheels do get grease, and programmers asking for a feature and showing concrete use cases is new information that justifies revisiting priority. So if you want cppfront to prioritize implementing nested functions, get friends to open issues and show use cases. 😁

filipsajdak · 2023-04-20T21:21:40Z

Or... propose implementation in PR. It might save some Herb time (hopefully). I can help in reviewing the code.

msadeqhe · 2023-04-21T06:46:35Z

Thank you. I think I have to write a conclusion for future readers if they come up to this issue.

I'm reading again this comment from @JohelEGP:

Otherwise, I think it's a win. The function syntax is regular. At namespace scope and type scope, they declare functions. At statement scope, they're currently banned. As expressions, they work and permit captures. And finally, they work as the for loop's parametrized statements' body (no scope introduced, and I suppose captures not permitted).

And I put it next to this informative comment from @hsutter:

BTW, Cpp2 will have local functions. They're pretty much there, I just ban them as a 'temporary alpha limitation' only because I haven't been motivated to prioritize the task of emitting them as a lambda and making sure they can capture. It shouldn't be hard, but I've wanted to drive other things like meta functions first. ...

So I should give up to separate functions and lambdas for learning Cpp2, because they are implementation detail. To simplify the learning progress I should keep in mind that there is only functions (either named or unnamed). Thus I should change my comment from:

Yes absolutely. It would make control structures to be consistent with each other in addition to simplification of guidance:

{ ... } will be for the statement block of control structures, therefore variables don't have to be captured.

But : (args) = { ... } will be for lambdas, therefore variables have to be captured. for loop breaks this rule.

Expressions in control structures (e.g. the expression after next keyword in while loop) won't be evaluate at first evidence, and they can be evaluated multiple times.

But expressions in other places (e.g. for variable initialization or function arguments) will be evaluated one time at first evidence.

... to this one:

Conclusion

Cpp2 have functions and local functions (either named or unnamed). { ... } is a statement block. : (args) = { ... } is a function. The following rules apply to them:

Expressions (e.g. for variable initialization, function arguments and ...) will be evaluated one time at first evidence in their place, because they have to be evaluated before they can be passed to a function. Also to access local variables which are from outer function scope, we have to capture them inside statement blocks (e.g. either { ... } or : (args) = { ... }):

...
variable: = 0;
forrr(items, : (item) = {
    ...
    // We have to capture `variable`.
    x: = variable$ + 10;
    ...
});


count: = 0;
// `count < 10` will be evaluated only one time at first evidence in their place.
// `count++` will be evaluated only one time at first evidence in their place.
// So this doesn't do the same thing that `while` does.
whileee(count < 10, count++, : () = {
    ...
});
...

But in control structures, expressions (e.g. the expression after next keyword in while loop) won't be evaluate at first evidence, and they can be evaluated multiple times (they are somehow macros), because of that, to access local variables from outer function scope, we don't have to capture them inside statement blocks (e.g. either { ... } or : (args) = { ... }):

...
variable: = 0;
for items do: (item) = {
    ...
    // We don't have to capture `variable`.
    x: = variable + 10;
    ...
}

count: = 0;
// `count < 10` will be evaluated multiple times.
// `count++` won't be evaluated at first time but it will be evaluated later multiple times.
while count < 10 next count++ {
    ...
}
...

In this way Cpp2 language is consistent with simple rules. Thanks for your guidance from those comments.

msadeqhe · 2023-04-21T06:52:10Z

Or... propose implementation in PR. It might save some Herb time (hopefully). I can help in reviewing the code.

@filipsajdak, You're right, it's the best way to suggest features in Cpp2. But I hope I could... I don't have any experience or study in transpilers, compilers and etc... I'm going to read the source code to try my luck to understand it.

filipsajdak · 2023-04-21T12:16:25Z

@msadeqhe I did not have much experience with it as well. I can help.

JohelEGP · 2023-04-21T13:51:50Z

Preface lost in an accidentally deleted comment. Recovered:

So, I'd say, in the happy path:
The Cpp2 range-based for statement
label? 'for' expression next-clause? 'do' unnamed-declaration
is equivalent to (after lowering to Cpp1)
{
  label?
  for ( parameter-declaration : expression ) {
    { compound-statement }
    next-clause?
  }
}
where parameter-declaration and compound-statement
are the immediate productions that make up the unnamed-declaration.

That said, I'd prefer a description that doesn't rely on Cpp1. I'd like to think that you don't have to learn Cpp1 to learn Cpp2, before learning Cpp1, and such. Much like we say about teaching C++ and C.

JohelEGP · 2023-04-21T14:00:42Z

That said, I'd prefer a description that doesn't rely on Cpp1.

Granted, Cpp2 is grounded on Cpp1.

For this particular case, I meant not having to learn the Cpp1's range-based for, which is rewritten in terms of for, which is rewritten in terms of while, which is rewritten in terms of if and goto.

It's not as easy to come up with a pure Cpp2 description without goto. You'd have to verbally repeat what goto encapsulates.

hsutter · 2023-04-21T18:26:58Z

Thanks, now I see your point: Because the range-for loop already effectively treats its body as a function body executed once per loop iteration, I model it syntactically as such. But you are pointing out that with any other such function that appears at local scope (an unnamed expression function today, and a named local function in the future) it would have to capture to refer to locals, whereas the for loop body has direct access to locals.

You have hit the nail on the head with your pointing out the capture difference. You are touching directly on an intended unification that has driven the current design, and I think you're pointing out that I should tweak the for syntax to be consistent.

First, I'll summarize the general design. Then, let's talk about for...

From the (unpublished) Cpp2 design doc: Unifying functions and blocks

Here's a piece from my Cpp2 design doc that aims to unify functions and blocks, and where I've currently only implemented the first two parts... this is mostly identical to what I wrote a few years ago, tweaking a phrase or two here to touch it up as I write this:

A declaration within a scope, or in ( ) that precedes a scope, has the lifetime of that scope. For example, in all of the following cases x’s lifetime ends at the following } (implicit in the single-statement implicit-block cases):
f:(x: int = init) = { ... }		// x is a parameter to the function
f:(x: int = init) = statement;		// same, { } is implicit

 :(x: int = init) = { ... }		// x is a parameter to the lambda
 :(x: int = init) = statement;		// same, { } is implicit

  (x: int = init)   { ... }		// x is a “let parameter” to the block
  (x: int = init)   statement;		// same, { } is implicit

and for visual completeness I'll now add the simplest familiar case:

                    { ... }		// x is a “let parameter” to the block
                    statement;		// same, { } is implicit

(Recall that in Cpp2 : always and only means "declaring a new thing," and therefore also always has an = immediately or eventually to set the value of that new thing.)

The idea is to treat functions and blocks/statements uniformly, as syntactic and semantic subsets of each other:

A named function has all the parts: A name, a : (and therefore =) because we're declaring a new entity and setting its value, a parameter list, and a block (possibly an implicit block in the convenience syntax for single-statement bodies).
An unnamed function drops only the name: It's still a declared new entity so it still has : (and =), still has a parameter list, still has a block.
(not currently implemented) A parameterized block drops only the name and : (and therefore =). A parameterized block is not a separate entity (there's no : or =), it's part of its enclosing entity, and therefore it doesn't need to capture.
Finally, if you drop also the parameter list, you have an ordinary block.

In this model, the last (currently unimplemented) option above allow a block parameter list, which does the same work as "let" variables in other language, but without a "let" keyword. This would subsume all the Cpp1 loop/branch scope variables (and more generally because you could declare multiple parmeters easly which you can't currently do with the Cpp1 loop/branch scope variables)... quoting the next part from the design doc (note these for examples don't show declaring the loop variable, more on this below):

Note Because this works naturally for flow control statements, those no longer need a special syntax for if/switch/etc. scoped variables. For example:

(x:int = f()) if x>1		// same as C++17: if (int x = f(); x>1)
(x:int = f()) switch x 		// same as C++17: switch (int x = f(); x)
(x:int = f()) for x>1 next --x	// same as K&R C: for (int x = f(); x>1; --x)
(x:int = f()) for range do	// same as C++20: for (int x = f(); _ : range)
(x:int = f(), i:=0) for range do next ++i
                                // same as C++20: { int i=0; for (int x = f(); _ : range) { ++i;
(x:int = f()) while x>1		// not yet allowed for ‘while’ in C++20
(x:int = f()) do { } while x>1	// not yet allowed for ‘do’ in C++20
(x:int = f()) try			// not yet allowed for ‘try’ in C++20
				// note x is available inside the catch too

Last year, eagle-eyed readers will have noticed that the cppfront code had stubbed-in support for these "let" parameters, and I subsequently removed it "for now" because I wasn't quite ready to pursue the idea. In particular I worried about readability without something like "let" to introduce it, but I also wasn't yet willing to invent such a single-use keyword/concept without a compelling reason.

Maybe it's time to bring this back.

Implications for `for`

Coming back around now to for: Your point that capture is not needed is really evidence that the for loop body is in the third case... it really is a block with a parameter list, not really declaring a new function with a parameter list. If so, then that's an argument that for should not have a : or = (other than that the syntax is fine and consistent as above), i.e., to change the for loop syntax from this:

// right now
for items do: (item) = {
    x := local + 10;
    // ...
}

to this:

// alternative
for items do (item) {
    x := local + 10;
    // ...
}

(I think the do probably still has value. In both versions it adds nothing grammatically, only visually, but visually I think it's important because it makes it more readable.)

Then if I also supported parameters on blocks, which would allow this:

// a block with parameters
(value := local) { // an 'in' (read-only) parameter initialized from a local
    x := value + 10;
    // ...
}

then the for loop would be visually and semantically just like that and easy to teach and explain as "the for loop body is just an ordinary parameterized block" where the loop supplies the argument/initializer for each iteration. I'd expect that consistency to be easy to explain and teach... e.g., "local functions can capture because they're in (but not part of) the local scope, and blocks naturally don't capture because they're part of the local scope."

Thoughts?

hsutter · 2023-04-21T18:37:39Z

That said, I'd prefer a description that doesn't rely on Cpp1. I'd like to think that you don't have to learn Cpp1 to learn Cpp2, before learning Cpp1, and such. Much like we say about teaching C++ and C.

👍 Does the above accomplish that?

msadeqhe · 2023-04-21T19:08:28Z

Thanks. The new syntax looks great.

msadeqhe · 2023-04-21T19:10:02Z

Sorry. I accidently touched close with comment on my phone 😬.

hsutter · 2023-04-21T19:17:39Z

Also, over on #382, @JohelEGP mentioned this paper proposing do expressions.

Which makes me think, hmm, could using do alleviate my concern about introducing a special "let" keyword for parameterized blocks... i.e., what if:

// alternative 'for' syntax
for items do (item) {
    use( item, local_var );
    // ...
}

// a block with parameters: what if it also starts with 'do'?
do (item := f()) { // an 'in' (read-only) parameter (initializer is required)
    use( item, local_var );
    // ...
}

JohelEGP · 2023-04-21T19:21:37Z

Parametrized statements fit the for loop's body much better semantically.

filipsajdak · 2023-04-21T19:51:26Z

Thank you for explaining the disappearing let parameters from the code (I am one of the "eagle-eyed readers" :) ).

I was looking for a solution to solve issues from the following code:

it := blocks.begin(); while it != blocks.end() next it++ {
 // ...
}

I was annoyed that it outlived the loop where it was used.

From what I understand the above lines after let blocks will look like the following:

(it := blocks.begin()) while it != blocks.end() next it++ {
  // ...
}

Or with the do version:

do (it := blocks.begin()) while it != blocks.end() next it++ {
  // ...  //Is the similarity with do-while a problem here?
}

Looks maybe a little noisy. Maybe a version with an explicit block will be better:

(it := blocks.begin()) {
  while it != blocks.end() next it++ {
    // ...
  }
}

And with the do version:

do (it := blocks.begin()) {
  while it != blocks.end() next it++ { // is the similarity with do-while is a problem here?
    // ...
  }
}

I will try to port more code examples to see how it looks like and share some thought. I am very happy that there will be an easy way to limit the lifetime of the variables used in loops

hsutter · 2023-04-21T19:59:40Z

Maybe a version with an explicit block will be better:
(it := blocks.begin()) {
  while it != blocks.end() next it++ {
    // ...
  }
}

Exactly that example was something that made me hesitate, because it's not much different from the already-legal:

{
  it := blocks.begin();
  while it != blocks.end() next it++ {
    // ...
  }
}

and that made me hesitate about the value of the feature.

SebastianTroy · 2023-04-21T20:03:17Z

So would "next" become a keyword for standard for loops then? On 21 April 2023 19:27:21 Herb Sutter ***@***.***> wrote: Thanks, now I see your point: Because the range-for loop already effectively treats its body as a function body executed once per loop iteration, I model it syntactically as such. But you are pointing out that with any other such function that appears at local scope (an unnamed expression function today, and a named local function in the future) it would have to capture to refer to locals, whereas the for loop body has direct access to locals. You have hit the nail on the head with your pointing out the capture difference. You are touching directly on an intended unification that has driven the current design, and I think you're pointing out that I should tweak the for syntax to be consistent. First, I'll summarize the general design. Then, let's talk about for... From the (unpublished) Cpp2 design doc: Unifying functions and blocks Here's a piece from my Cpp2 design doc that aims to unify functions and blocks, and where I've currently only implemented the first two parts: A declaration within a scope, or in ( ) that precedes a scope, has the lifetime of that scope. For example, in all of the following cases x’s lifetime ends at the following } (implicit in the last case): f:(x: int = init) = { ... } // x is a parameter to the function f:(x: int = init) = statement; // same, { } is implicit :(x: int = init) = { ... } // x is a parameter to the lambda :(x: int = init) = statement; // same, { } is implicit (x: int = init) { ... } // x is a “let parameter” to the block (x: int = init) statement; // same, { } is implicit (Recall that in Cpp2 : always and only means "declaring a new thing," and therefore also always has an = immediately or eventually to set the value of that new thing.) The idea is to treat functions and blocks/statements uniformly, as syntactic and semantic subsets of each other: * A named function has all the parts: A name, a : (and therefore =) because we're declaring a new entity and setting its value, a parameter list, and a block (possibly an implicit block in the convenient syntax for single-statement bodies). * An unnamed function drops only the name: It's still a declared new entity so it still have : (and =), still has a parameter list, still has a block. * (not currently implemented) A block drops only the name and : (and therefore =): We have a parameterized block. It is not a separate entity (there's no : or =), it's part of its enclosing entity, and therefore it doesn't need to capture. * Finally, if you drop also the parameter list, you have an ordinary block. In this model, the last (currently unimplemented) option above allow a block parameter list, which does the same work as "let" variables in other language, but without a "let" keyword. This would subsume all the Cpp1 loop/branch scope variables (and more generally because you could declare multiple parmeters easly which you can't currently do with the Cpp1 loop/branch scope variables)... quoting the next part from the design doc (note these for examples don't show declaring the loop variable, more on this below): Note Because this works naturally for flow control statements, those no longer need a special syntax for if/switch/etc. scoped variables. For example: (x:int = f()) if x>1 // same as C++17: if (int x = f(); x>1) (x:int = f()) switch x // same as C++17: switch (int x = f(); x) (x:int = f()) for x>1 next --x // same as K&R C: for (int x = f(); x>1; --x) (x:int = f()) for range do // same as C++20: for (int x = f(); _ : range) (x:int = f(), i:=0) for range do next ++i // same as C++20: { int i=0; for (int x = f(); _ : range) { ++i; (x:int = f()) while x>1 // not yet allowed for ‘while’ in C++20 (x:int = f()) do { } while x>1 // not yet allowed for ‘do’ in C++20 (x:int = f()) try // not yet allowed for ‘try’ in C++20 // note x is available inside the catch too Last year, eagle-eyed readers will have noticed that the cppfront code had stubbed-in support for these "let" parameters, and I subsequently removed it "for now" because I wasn't quite ready to pursue the idea. In particular I worried about readability without something like "let" to introduce it, but I also wasn't yet willing to invent such a single-use keyword/concept without a compelling reason. Maybe it's time to bring this back. Implications for for Coming back around now to for: Your point that capture is not needed is really evidence that the for loop body is in the third case... it really is a block with a parameter list, not really declaring a new function with a parameter list. If so, then that's an argument that for should not have a : or = (other than that the syntax is fine and consistent as above), i.e., to change the for loop syntax from this: // right now for items do: (item) = { x := local + 10; // ... } to this: // alternative for items do (item) { x := local + 10; // ... } (I think the do probably still has value. In both versions it adds nothing grammatically, only visually, but visually I think it's important because it makes it more readable.) Then if I also supported parameters on blocks, which would allow this: // a block with parameters (value := local) { // an 'in' (read-only) parameter initialized from a local x := value + 10; // ... } then the for loop would be visually and semantically just like that and easy to teach and explain as "both are parameterized blocks." I'd expect that consistency to be easy to explain and teach... e.g., "local functions can capture because they're in (but not part of) the local scope, and blocks naturally don't capture because they're part of the local scope." Thoughts? — Reply to this email directly, view it on GitHub<#386 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AALUZQOMMTXD2Z63SYQEDILXCLGP7ANCNFSM6AAAAAAXFQABLQ>. You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

filipsajdak · 2023-04-21T20:33:23Z

Will this let parameters require/allows setting argument passing type? That might be something not present in the following case:

{
  it := blocks.begin();
  while it != blocks.end() next it++ {
    // ...
  }
}

hsutter · 2023-04-21T20:38:09Z

Will this let parameters require/allows setting argument passing type?

Yes, that's the idea.

So would "next" become a keyword for standard for loops then?

Do you mean something like this? This works now:

main: (args) = {
    local := 10;
    for args next local++ do: (arg) = {
        x := local + 10;
        std::cout << x << "\n";
    }
}

Invoked with test.exe a b this prints:

20
21
22

msadeqhe · 2023-04-21T21:59:33Z

Also, over on #382, @JohelEGP mentioned this paper proposing do expressions.

Which makes me think, hmm, could using do alleviate my concern about introducing a special "let" keyword for parameterized blocks... i.e., what if:
// alternative 'for' syntax
for items do (item) {
    use( item, local_var );
    // ...
}

// a block with parameters: what if it also starts with 'do'?
do (item := f()) { // an 'in' (read-only) parameter (initializer is required)
    use( item, local_var );
    // ...
}

If we consider it with if/while/... then it's for variable initialization inside the statement block. I think it's different from do (params) { ... } in for loop which passes arguments to the statement block. Therefore choosing let keyword seems more natural for its purpose. Both let (inits) and do (params) can coexist (be used together) in one control structure:

let (item: = f()) {
    user (item, local_var);
    // ...
}

let (item: = f()) if item.is_valid() {
    // ...
}

let (local: = 10) for args next local++ do (arg) {
    // ...
}

let (local: = 10) {
    // ...
} while local > 0;

let (i: = 0) while i < 10 next i++ {
    // ...
}

Interestingly let-while-next is similar to for (init; condition; step) { ... } in Cpp1. Also let-while is complementary to do-while. Maybe let (inits) is a little more readable and it can be used in reflections in comparison to { inits; ... }. In a nutshell:

do (params) { ... } for passing arguments to parameterized blocks.
let (inits) LANGUAGE-CONSTRUCT { ... } for variable initializations in control structures and parameterized blocks.

Both of them are parameterized statement blocks, and they can be used together, e.g. let (inits) for args do (arg) { ... }.

msadeqhe · 2023-04-22T07:35:14Z

Maybe a version with an explicit block will be better:
(it := blocks.begin()) {
  while it != blocks.end() next it++ {
    // ...
  }
}
Exactly that example was something that made me hesitate, because it's not much different from the already-legal:
{
  it := blocks.begin();
  while it != blocks.end() next it++ {
    // ...
  }
}
and that made me hesitate about the value of the feature.

That's right, and let is going to be just a syntax sugar (perhaps more readable which is opinion-based, BTW it's a separation of semantic which means it is used to construct while loop):

let (it := blocks.begin()) while it != blocks.end() next it++ {
    // ...
}

hsutter · 2023-04-23T16:52:32Z

Thanks everyone for all the comments, and especially for pointing out the key things that prompted me to finally bring back and implement the function/block unification I've always wanted, including block/statement scope parameter lists, and removing : and = from the for loop syntax.

Now that I've implemented block/statement scope parameter lists and tried using them in a ~dozen test examples, I do find them to be as semantically useful as I had hoped, which is shown in this new test case where it feels nice to be able to directly declare intended side effects of a statement/loop/block of code. For now I'm limiting it to in and inout parameters, but some notes:

move is that one that would make the most sense, to express a statement/block that consumes its move parameter's initializer, but I haven't got around to making that work yet;
out is of questionable value... it would express a statement/block that can construct the parameter's initializer, but (a) the variable (local or parameter) that it would initialize is already in scope and therefore fully covered by the existing initialization guarantee logic, so I'm not sure of the benefit of supporting this extra indirection if it merely amounts to renaming (which could be a negative thing, if it's really just code obfuscation), and (b) I suspect that would be a bunch more work in sema and I don't want to be distracted with that right now
forward would express a statement/block that forwards the initializer, but to make sense that initializer would have to be some function call expression, not a local variable (which we can already see the definition of, so forwarding isn't needed, similarly to the reason why I don't think the out flavor is probably all that useful)

Re migration: The for loop change is well justified and is just removing two characters, but it's also the first significant syntax breaking change. Because I showed that syntax in a talk/documentation, I've added the first compiler migration diagnostic so that people who try the old syntax should see a helpful message that the syntax has changed and what to write instead. Also, I need to update the example(s) on Godbolt [update: fortunately it uses while and so isn't broken]. There are costs to breaking changes even when there are no production users yet... 😁

Feedback always welcome, especially bugs (e.g., does something not work) and usability (e.g., does the migration message fire in most cases). Thanks again.

msadeqhe · 2023-04-24T05:18:28Z

Thanks. I like how the following is similar to for (init; condition; step) ...:

(i: = 0) while i < 10 next i++ {
    std::cout << i << "\n";
}

EDIT

Thanks @ntrel about the issue with my example. This is working now:

(copy i: = 0) while i < 10 next i++ {
    std::cout << i << "\n";
}

ntrel · 2023-04-24T13:18:34Z

(i: = 0) while i < 10 next i++ {

That needs inout i for mutability, though it doesn't work yet: #393.

(inout i := 0) while i < 10 next i++ {

I think it would read a bit nicer if inout was spelled mut.

…that syntax to `for` (remove `:` and `=`), closes hsutter#386

msadeqhe added the suggestion label Apr 20, 2023

hsutter closed this as completed Apr 20, 2023

hsutter reopened this Apr 21, 2023

msadeqhe closed this as completed Apr 21, 2023

msadeqhe reopened this Apr 21, 2023

hsutter mentioned this issue Apr 21, 2023

[SUGGESTION] User-defined Language Constructs #382

Closed

This comment was marked as outdated.

Sign in to view

This comment was marked as off-topic.

Sign in to view

hsutter closed this as completed in 928186e Apr 23, 2023

msadeqhe mentioned this issue Apr 23, 2023

[SUGGESTION] Statement-expressions, result vs return #391

Closed

JohelEGP mentioned this issue Apr 24, 2023

[BUG] inout parameterized statement generates wrong Cpp1 #393

Closed

JohelEGP referenced this issue May 7, 2023

Work around a C++20 compiler's range-for scoped variable bug/ICE

2c64707

JohelEGP mentioned this issue May 23, 2023

[SUGGESTION] Typed Expressions; generalized constructors, UDLs, unnamed variables and functions #463

Closed

JohelEGP mentioned this issue Jul 14, 2023

[BUG] _braced-init-list_ argument in Cpp2 #542

Open

JohelEGP referenced this issue Sep 26, 2023

Minor enum and union tidying

26a4cbf

JohelEGP mentioned this issue Nov 12, 2023

Const by default #815

Open

msadeqhe mentioned this issue Nov 14, 2023

[BUG] == and = in declarations are confusing. #824

Open

zaucy pushed a commit to zaucy/cppfront that referenced this issue Dec 5, 2023

Add statement (including block) scope in/inout parameters, and apply …

9b5c36c

…that syntax to `for` (remove `:` and `=`), closes hsutter#386

dutkalex mentioned this issue May 13, 2024

[QUESTION] cpp2 for loops #1063

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SUGGESTION] `for` loop is ambiguous with lambda syntax #386

[SUGGESTION] `for` loop is ambiguous with lambda syntax #386

msadeqhe commented Apr 20, 2023 •

edited

Loading

JohelEGP commented Apr 20, 2023

msadeqhe commented Apr 20, 2023 •

edited

Loading

JohelEGP commented Apr 20, 2023 •

edited

Loading

msadeqhe commented Apr 20, 2023 •

edited

Loading

JohelEGP commented Apr 20, 2023

hsutter commented Apr 20, 2023

msadeqhe commented Apr 20, 2023

hsutter commented Apr 20, 2023 •

edited

Loading

filipsajdak commented Apr 20, 2023

msadeqhe commented Apr 21, 2023

msadeqhe commented Apr 21, 2023

filipsajdak commented Apr 21, 2023

JohelEGP commented Apr 21, 2023 •

edited

Loading

JohelEGP commented Apr 21, 2023

hsutter commented Apr 21, 2023 •

edited

Loading

hsutter commented Apr 21, 2023

msadeqhe commented Apr 21, 2023

msadeqhe commented Apr 21, 2023

hsutter commented Apr 21, 2023 •

edited

Loading

JohelEGP commented Apr 21, 2023

filipsajdak commented Apr 21, 2023

hsutter commented Apr 21, 2023 •

edited

Loading

SebastianTroy commented Apr 21, 2023 via email

filipsajdak commented Apr 21, 2023

hsutter commented Apr 21, 2023 •

edited

Loading

msadeqhe commented Apr 21, 2023 •

edited

Loading

This comment was marked as outdated.

This comment was marked as off-topic.

This comment was marked as off-topic.

msadeqhe commented Apr 22, 2023 •

edited

Loading

This comment was marked as off-topic.

hsutter commented Apr 23, 2023 •

edited

Loading

msadeqhe commented Apr 24, 2023 •

edited

Loading

ntrel commented Apr 24, 2023 •

edited

Loading

[SUGGESTION] for loop is ambiguous with lambda syntax #386

[SUGGESTION] for loop is ambiguous with lambda syntax #386

Comments

msadeqhe commented Apr 20, 2023 • edited Loading

Preface

How to reproduce?

Suggestion Detail

Why do I suggest this change?

Your Questions

Considered alternatives

JohelEGP commented Apr 20, 2023

msadeqhe commented Apr 20, 2023 • edited Loading

JohelEGP commented Apr 20, 2023 • edited Loading

msadeqhe commented Apr 20, 2023 • edited Loading

JohelEGP commented Apr 20, 2023

hsutter commented Apr 20, 2023

msadeqhe commented Apr 20, 2023

hsutter commented Apr 20, 2023 • edited Loading

filipsajdak commented Apr 20, 2023

msadeqhe commented Apr 21, 2023

Conclusion

msadeqhe commented Apr 21, 2023

filipsajdak commented Apr 21, 2023

JohelEGP commented Apr 21, 2023 • edited Loading

JohelEGP commented Apr 21, 2023

hsutter commented Apr 21, 2023 • edited Loading

From the (unpublished) Cpp2 design doc: Unifying functions and blocks

Implications for for

hsutter commented Apr 21, 2023

msadeqhe commented Apr 21, 2023

msadeqhe commented Apr 21, 2023

hsutter commented Apr 21, 2023 • edited Loading

JohelEGP commented Apr 21, 2023

filipsajdak commented Apr 21, 2023

hsutter commented Apr 21, 2023 • edited Loading

SebastianTroy commented Apr 21, 2023 via email

filipsajdak commented Apr 21, 2023

hsutter commented Apr 21, 2023 • edited Loading

msadeqhe commented Apr 21, 2023 • edited Loading

This comment was marked as outdated.

This comment was marked as off-topic.

This comment was marked as off-topic.

msadeqhe commented Apr 22, 2023 • edited Loading

This comment was marked as off-topic.

hsutter commented Apr 23, 2023 • edited Loading

msadeqhe commented Apr 24, 2023 • edited Loading

EDIT

ntrel commented Apr 24, 2023 • edited Loading

[SUGGESTION] `for` loop is ambiguous with lambda syntax #386

[SUGGESTION] `for` loop is ambiguous with lambda syntax #386

msadeqhe commented Apr 20, 2023 •

edited

Loading

msadeqhe commented Apr 20, 2023 •

edited

Loading

JohelEGP commented Apr 20, 2023 •

edited

Loading

msadeqhe commented Apr 20, 2023 •

edited

Loading

hsutter commented Apr 20, 2023 •

edited

Loading

JohelEGP commented Apr 21, 2023 •

edited

Loading

hsutter commented Apr 21, 2023 •

edited

Loading

Implications for `for`

hsutter commented Apr 21, 2023 •

edited

Loading

hsutter commented Apr 21, 2023 •

edited

Loading

hsutter commented Apr 21, 2023 •

edited

Loading

msadeqhe commented Apr 21, 2023 •

edited

Loading

msadeqhe commented Apr 22, 2023 •

edited

Loading

hsutter commented Apr 23, 2023 •

edited

Loading

msadeqhe commented Apr 24, 2023 •

edited

Loading

ntrel commented Apr 24, 2023 •

edited

Loading