Skip to content

Latest commit

 

History

History
297 lines (210 loc) · 10.2 KB

20221018-change-fun-type-syntax.md

File metadata and controls

297 lines (210 loc) · 10.2 KB
status flip authors sponsor updated
implemented
43
Naomi Liu (naomi.liu@dapperlabs.com)
Naomi Liu (naomi.liu@dapperlabs.com)
2022-10-27

FLIP 43: Change the syntax for function types

Objective

Modify/add syntax for representing function types.

Motivation

Currently, functions are declared and referred to with differing syntax. For instance,

// function declaration
fun foo(_ x: Int): String {...}
// function expression bound to a name
let bar = fun (x: Int): String {...}

// a variable declaration with a function value
// note the lack of the `fun` keyword when referring to its type
let baz: ((Int): String) = foo

This convention for representing functions gets even more confusing when higher-order functions are brought into the mix:

// cadence type: ((Int): ((Int): ((Int): Int)))
// pretty type: Int -> Int -> Int -> Int

fun add3(_ x: Int): ((Int): ((Int): Int)) {
    return fun (y: Int): ((Int): Int) {
        return fun (z: Int): Int {
            return x + y + z
        }
    }
}

// cadence type: (([Int], ((Int): String)): [String])
// pretty type: [Int] -> (Int -> String) -> [String]
fun mapIntString(list: [Int], f: ((Int): String)): [String] {...}

Another extreme example is the type (({Int: String}): {Int: Int}). To a reader who isn't familiar with Cadence, it's not immediately obvious that this type means "a function taking a dictionary of Ints to Strings, returning a dictionary of Ints to Ints".

User Benefit

The existing syntax is inconsistent and unfamiliar, ultimately harming readability of Cadence code. Users would benefit from type signatures with clearer intent.

Design Proposal

Rename function types to require the fun keyword.

Given that fun is already a reserved keyword for declaring functions, requiring the keyword in the type signature of a function is more consistent with the existing syntax. For instance,

```cadence
// (AnyStruct -> Void) -> AnyStruct -> Void
fun apply(f: ((AnyStruct): Void), x: AnyStruct) {...}

// after
fun apply(f: fun (AnyStruct): Void, x: AnyStruct) {...}
```

This could be implemented as a non-breaking change by still allowing the old syntax of omitting fun, but preferring the keyword in new contracts and transactions. The stringification of function types should be updated to use the new syntax.

For type signatures ascribed to variables, we currently rely on the colon : token in order to parse the type as a function. This requires the type to be wrapped up in parentheses to avoid parsing ambiguity. Reusing a previous example:

let bar = fun (x: Int): String {...}

// a variable declaration with a function value
let baz: ((Int): String) = foo

If we continued to elide the fun keyword in function types, parentheses could still be eliminated by making : a right-associative operator with a higher precedence than =.

Currently, a function type signature is defined by the following grammar:

functionType
    : '('
         '(' ( typeAnnotation ( ',' typeAnnotation )* )? ')'
         ':' typeAnnotation
      ')'

(* example: ((Int, @AnyResource): @AnyResource) *)

Where the surrounding parentheses are required. By instead treating : as an operator, the rule is simplified to:

functionType
    : '(' ( typeAnnotation ( ',' typeAnnotation )* )? ')'
      ':' typeAnnotation

At the same time however, how do we express a function that returns Void (()) with this syntax? We currently allow authors to omit a return type in function declarations, which causes the type to default to Void. Typing the function still requires one to explicitly name Void:

fun noop() {} // return type is inferred to be `Void`

let noop_: ((): Void) = noop // return type is required because we use `:` as a marker token

Using the fun keyword would simplify parsing rules and also allow for an author to omit the return type when writing signatures for procedures, as we already permit when declaring them.

let noop_: fun() = noop // unambiguous, since `fun` is our marker now instead of `:`

fun apply2(f: fun (Int), g: fun (Int), x: Int) {...}

let baz: fun (Int): String = bar

The adjusted grammar would only affect functionType:

functionType:
    'fun' '(' ( typeAnnotation ( ',' typeAnnotation )* )? ')'
         ( ':' typeAnnotation )?

Drawbacks

To the best of my knowledge, this change should not introduce any ambiguities into the language grammar. There is currently an ambiguous case for function expressions that return restricted types:

interface Foo {...}

let f = fun (): AnyStruct{Foo}

In this case, it is unclear to the parser whether f is a function that returns a restricted AnyStruct{Foo} without a body, or if it returns an unrestricted AnyStruct and whose body is the single expression Foo. We currently step around this issue by requiring restricted types' curly brackets to follow immediately after the parent type without any whitespace:

fun (): AnyStruct{Foo} // return type is a restricted type, no body
fun (): AnyStruct {Foo} // return type is AnyStruct, body is Foo

This ambiguity does not extend to types however, and is outside the scope of this FLIP. Using the proposed syntax, we could still write

let f: fun (): AnyStruct{Foo} = fun (): AnyStruct{Foo} {...}

Alternatives Considered

An alternative to using fun to denote function types is to introduce another keyword, such as => or ->. This would be more familiar to users coming from languages such as JavaScript, Swift, and Kotlin, while remaining as a non-breaking change due to these operators being unreserved currently. For example,

// before
fun apply(f: ((AnyStruct): Void), x: AnyStruct): Void {...}
let _apply: (((AnyStruct): Void, AnyStruct): Void) = apply

// proposed syntax with `fun`
fun apply(f: fun (AnyStruct): Void, x: AnyStruct): Void {...}
let _apply: fun (fun (AnyStruct): Void, AnyStruct): Void = apply

// alternative syntax with '->'
fun apply(f: AnyStruct -> Void, x: AnyStruct): Void {...}
let _apply: (AnyStruct -> Void, AnyStruct) -> Void = apply

The main benefit of this alternative syntax would be the unambiguous parsing and lower visual noise compared to using fun. This would still introduce another inconsistency between how functions are declared and how they're typed, but it's more intuitive at a glance to unfamiliar readers. Introducing function arrows in this way also opens the door to an alternative syntax for denoting closures that doesn't require an annotated return type or return statement, which could be useful for simple one-liner functions.

Introducing a separate operator for return types still does not eliminate the previously-mentioned ambiguity in parsing function expressions, however:

// still ambiguous, but looks cleaner because ':' in declarations
// now only indicates a type assignment to a variable
let f: fun () -> AnyStruct{Foo} = fun () -> AnyStruct{Foo}

Performance Implications

Function types written with either syntax will create identical parse trees, so the performance impact is expected to be limited to another conditional branch in the parser.

Engineering Impact

Given that first-class functions are a very small part of onchain contracts and the standard library in its current form, no significant impacts should occur. Any required changes are expected to be unit-testable and relatively encapsulated.

Best Practices

If we decide to still allow the old syntactic forms, then the new syntax should be reocommended for any new code.

Tutorials and Examples

Tutorials and documentation should be updated to use the new syntax.

Compatibility

This can be implemented as a non-breaking change, requiring no intervention from developers to update existing code.

User Impact

Function types will be printed using the new syntax in the REPL, error messages, and language server. If we still continue to allow the old syntax, then no migrations or inverventions are needed and this change can be rolled out independently of Stable Cadence.

Related Issues

If we use a syntax other than fun for denoting function types, such as an arrow ->, the new keyword could be used to implement a shorthand for declaring closures.

Prior Art

Most languages with first-class functions use similar language for referring to those functions. For instance,

// scala
val f: Int => Int = {n => n * 2}
-- haskell
let f :: Int -> Int
    f = \n -> n * 2
// swift
// note that using the arrow allows for the `func` keyword
// to be omitted from function types
let f: (Int) -> Int = func(_ n: Int) -> Int {
    return n * 2
}
// typescript
let f: (n: number) => number = n => n * 2;
// go
var f func(int) int = func(int) int {
    return 2
}

In the case of omitting a return type in the signatures of procedures, the main distinction between languages that provide this feature and languages that don't, is whether they denote function types with an infix arrow (->) or a prefixed keyword (fun, func, fn, etc). Some languages allow the return type to be omitted in a function's declaration, but still require it to be annotation in its corresponding type signature.

For instance:

// scala
def noop() = {}
// with return type:
def noop(): Unit = {}
// type written out
val noop_: () => Unit = noop
-- haskell
noop :: () -> ()
noop _ = ()
// rust
fn noop() {}
// with return type
fn noop() -> () {}
// type written out
let noop_: fn() = noop
// the return type can be explicitly supplied
let noop_: fn() -> () = noop
// swift
func noop() {}
// with return type:
func noop() -> Void {}
// type written out
let noop_: () -> Void = noop
// go
func noop() {}
// the only way to denote procedures in golang is to omit a return type
var noop_ func() = noop

Questions and Discussion Topics

Is the existing syntax fine, i.e. do people actually care?

Is the issue large enough to warrant adding another keyword to the language?

Will adding the fun keyword lead to intractable parse ambiguities or break existing contracts?

Will changing the syntax to use an operator improve readability, or just introduce another source of inconsistency to the language?