Fix #4867 Union type inference is too eager to widen unions #7829

landerlo · 2019-12-20T23:10:50Z

This inference enhancement for union inference follows @smarter 's suggested approach of preserving the union if there is an explicit union ascription. Otherwise the union is widened with OrType.join.

Uses the very naive but effective approach of lexically checking the untyped tree for Or operators or TypedSplices when the Typer is widening the unions.

The fix only touches that widening point to introduce the lexical check. If the Or operator is found on the type tree of the term, or the DefDef type tree is a TypedSplice, this is seen as a hint of an explicit union ascription, both as a direct union ascription or a union ascription in a Function signature. This heuristic works well for the case in @Baccata 's code and other union type inference issues in my own exploration.

IMO the non-rigorous approach is justified as this is a very targeted check that simply defers the widening of the union type. In the worst case, the union would be joined at a later time. I couldn't foresee any negative impacts by not joining the OrType as this check is only performed when the typer has detected the OrType and calls widenUnion. In case some ascriptions are missed by this check this would only mean the join is performed eagerly keeping the current behaviour.

We get a more expressive union type inference with no cost, as the untyped tree check won't impact performance or require code changes that could interact with the typer in unexpected ways.
The current checks on whether there is an Or operator on the tpt of ValDef or the tpt of a DefDef is a TypedSplice covers the explicit union ascriptions of terms and functions signatures.

dottybot

Hello, and thank you for opening this PR! 🎉

All contributors have signed the CLA, thank you! ❤️

Have an awesome day! ☀️

sjrd · 2019-12-21T04:25:02Z

Thanks for your PR. I don't think this is the right approach, though. The fact that the following still doesn't work is a clear sign to me:

val x: B | C = ...
val z = x // We lose the Union ascription as it's not explicit
val error: B | C = z // error as z has lost union ascription

We should actually tag every OrType with a flag that tells whether it was written down or created by the compiler. Then, only widen the ones that were initially created by the compiler. With that strategy, when inferring the type of z, it would receive the type of x which is an OrType with the flag "written by user" on, and hence it would not be widened. It would even keep that flag (since it receives the very same OrType) which means that this can be changed multiple times.

landerlo · 2019-12-21T09:13:51Z

I agree that that would be a more desirable end state, but to flag the provenance would require more changes and we would need to deal with the situation of a union growing to too many branches in the disjunction, something that @odersky always tried to avoid. Forcing the union to always be explicit through the ascription deals with that issue in a simple way. If we allow an originally explicit union to flow and grow even when not explicitly written by the user we would need to find other strategies, e.g. having a cut-off size where the union is widened if the disjunction goes over N branches. Again, this seems to require some more discussion.

I'm happy to keep exploring that route and I would love if we had tha discussion as I am a great believer in the potential of union types.

But for the moment I think that merging is a reasonable incremental step, as it gives an instant incremental benefit and the reverting cost is zero as there are no code changes other than that targeted pattern match and widening deferring if.

This simple change unlocks interesting usages of union types, we've seen people encountering this issue. And those extended usages can inform further evolution into a more principled approach like tracking the provenance and making that ascribed union origin flow into other unions that haven't been explicitly written, in case the dotty team wants to follow that route.

sjrd · 2019-12-21T09:22:45Z

I agree that that would be a more desirable end state, but to flag the provenance would require more changes and we would need to deal with the situation of a union growing to too many branches in the disjunction, something that @odersky always tried to avoid. Forcing the union to always be explicit through the ascription deals with that issue in a simple way. If we allow an originally explicit union to flow and grow even when not explicitly written by the user we would need to find other strategies, e.g. having a cut-off size where the union is widened if the disjunction goes over N branches. Again, this seems to require some more discussion.

An originally written union could indeed flow without further explicit types. However it would never be able to grow without explicitly writing the bigger union. Therefore those concerns are also addressed at the core with what I suggest.

Note also that what I suggest has been discussed with @odersky and @smarter, and both agreed that it was a likely solution to all our issues with inference of union types.

But for the moment I think that merging is a reasonable incremental step, as it gives an instant incremental benefit and the reverting cost is zero as there are no code changes other than that targeted pattern match and widening deferring if.

I disagree, but it's not my call to make so you don't need to convince me. Someone from the core dotty team will have to make that call.

landerlo · 2019-12-21T11:14:15Z

Thank you for the feedback.

An originally written union could indeed flow without further explicit types. However it would never be able to grow without explicitly writing the bigger union

I still don't see how the flowing of the original union won't grow into bigger unions not explicitly written if we don't widen based on the flag saying there was an explicitly written union at the origin of the flow.

If the term with a union type goes through expressions that grow the union surely the union might grow into unions not explicitly written. e.g.:

val x: A | B | C = ???
val y: D | E | F = ???

val z = if (true) x else y

Of course this can be controlled if we introduce additional widening cases in addition to checking the flag, but I am unaware of that logic having already been discussed in that level of detail. I must have overlooked it.

sjrd · 2019-12-21T13:11:30Z

Let's note | a union type that was written, and |! a union type that was created by the compiler.

The result of the if expression that you wrote would then be (A | B | C) |! (D | E | F). When deciding the inferred type of z, the compiler would notice that the top-level union was created by the compiler and would therefore widen it. The union would not grow.

landerlo · 2019-12-21T20:17:57Z

I see your point. Widening only when the union grows.
This would results in the new semantics of:

val x: A | B | C = ???
val y: D         = ???

val z = x                    // Infers A | B | C
val zz = if (true) x else y  // We lose inference and got Any

The loss of inference of zz still not completely satisfactory but I agree this would be an improvement over my current fix.
I was wary of modifications to the core classes but I'll check what would be the implications of adding the flag to OrType.

LPTK · 2019-12-30T18:01:18Z

If we go down the route suggested by @sjrd, wouldn't it make sense to generalize it to all other widening behaviors in Scala?

For instance, making the following work:

scala> val x: 1 = 1
val x: 1 = 1

scala> val y = x
val y: Int = 1

scala> val z: 1 = y
1 |val z: 1 = y
  |           ^
  |           Found:    (y : Int)
  |           Required: (1 : Int)

If we do not do that in general, then I feel like it's weird to do it only for unions, and I would actually find the behavior originally proposed in this PR less confusing. Indeed, it would mean there is only one automatic widening behavior, instead of two.

landerlo · 2020-01-03T21:44:28Z

I agree it'd be desirable to make it as general as possible, and that inference flowed through assignments. I wanted to keep the scope as minimal as the simple change in the PR unblocks some interesting union usages I've been experimenting with. Keeping the provenance as @sjrd suggests would require more refactoring a and change to OrType to add the explicit ascription flag. Not sure about the widening on the literal types you're bringing up. Maybe @smarter could advise on the preferred approach.

smarter · 2020-01-19T01:07:11Z

Thanks a lot for your PR, I love to see more people experimenting with type inference! :) I broadly agree with @sjrd and @LPTK that what we ultimately want is a more general mechanism which does its best to not widen a user-written type, even if that type ends up in some other place through type inference. (We currently only widen unions and singletons, but if we can make widening better then maybe we could actually go further and also widen the children of a sealed trait or enum which would be really nice I think).

Looking at the cases where inference is improved by your PR, it seems they can be minimized to something like:

class X
class Y
val x: X | Y = identity(if (true) new X else new Y)

but since the logic uses untyped trees it won't help with more complex result types:

class X
class Y

type Id[T] = T
val x: Id[X | Y] = identity(if (true) new X else new Y) // error

on the other hand, if the result type of a val is a union, this PR will keep all unions inferenced in the rhs of that val even if they're unrelated to the result type, this can have consequences on implicit search for example:

class Base
class X extends Base
class Y extends Base

implicit def invBase: Inv[Base] = new Inv[Base]

def getInv[T](x: T)(implicit inv: Inv[T]): Int = 1

// ok on master and this PR
val a: Int = getInv(if (true) new X else new Y)
// ok on master, "no implicit argument of type Inv[X | Y]" with this PR
val b: Int | Any = getInv(if (true) new X else new Y)

Anyway, I think there must be an underlying bug in master here, because even though we widen singleton types in a similar way, the following currently works fine:

val x: 1 = identity(1)
type Id[T] = T
val x: Id[1] = identity(1)

I'd like to take a closer look at widening and type inference in the near future, but before I can get to that, I need to work on some deep bugs in our constraint solving mechanism, so I can't promise any quick fixes.

landerlo · 2020-01-23T22:32:42Z

Thank you for the review and feedback, I agree that more generality is desirable. I'll try to put some more thinking into it. The driving motivations for allowing the cases currently supported in this PR is my experimentation with some alternative encodings with union types, examples are in https://github.com/landerlo/fscala19-algapp/ if anybody interested.

An enum value may have the type `A & B`, in such cases we need to register for both `A` and `B`.

…mented The check for a concrete class used to be simply that its `abstractTermMembers` are empty. However, i7597.scala shows that this is not enough. The problem is that abstractTermMembers is based on signatures. It will not include a member as long as there is a concrete member with the same signature. But as scala#7597 shows it's possible for a concrete member to have the same signature as an abstract one but still not have a matching type.

Check for duplicate symbols when creating export forwarders.

Fix scala#8355: REPL tests : fix for two tests failing on Windows

Fix scala#7597: Refine checks whether a deferred term member is implemented

Fix scala#8333: Check for duplicate symbols in exports

GH actions don't digest them well fttb

When a community test fails, re-run it 2 more times to avoid the whole suite failure.

Fix lampepfl/dotty-knowledge#30: Re-run failing community tests

@main

Added a scripted sbt test to check if a @main annotation is detected by sbt

@main

Fix lampepfl/dotty-knowledge#17: add a scripted sbt test for @main annotation

…ined because of a non-literal string parameter

We used to pretty-print trees in the API info we send to sbt, but the pretty-printed output seems to be unstable leading to overcompilation, so just use the raw trees instead (this should also be faster).

Avoid overcompilation involving inline or annotation trees

doc(export): add link to note

…polator Fix scala#8362: Fail compilation if a compile time error can't be inlined because of a non-literal string parameter

doc(macros): fix expansion and some improvements

doc(creator apply): typos

doc(trait param): state which rule is violated

doc(multi-staging): fix typos

doc(untupling): fix list at the end, add newline

doc(context-functions): typo `are` instead of `is`

doc(operators): typo and add import to examples

doc(dep-fun-type): typos

Improves the inference of union types by preserving the union when there is a type ascription with an union. If the term has not been explicity ascribed with an union then the existing semantics of joining the orType is maintained. To determine whether an explicit union ascription exists, a lexical check on the untyped tree is performed.

Splits union.scala test from negative to negative and positive tests, exhibiting the preservation of the union when the union has been explicitly ascribed as per fix scala#4867.

OlivierBlanvillain · 2020-03-26T17:42:04Z

This PR has been inactive for 2 months so I'll close it for now. @landerlo feel free to reopen in case of further developement!

smarter · 2020-04-03T13:02:22Z

Good news: all the examples in this PR should work in the latest nightly thanks to #8635 !

simlei · 2022-05-14T12:52:56Z

Is this issue still on your radar, although it's closed?

smarter · 2022-05-14T13:04:37Z

Which issue? As I noted in my previous comment the examples in this PR should now work. But there are other issues open related to union widening in this repository.

dottybot reviewed Dec 20, 2019

View reviewed changes

landerlo mentioned this pull request Dec 20, 2019

Type parameter inference is too eager to widen union types #4867

Closed

landerlo force-pushed the fix-#4867 branch 3 times, most recently from 0f1b479 to c7cf9e9 Compare December 28, 2019 22:15

landerlo force-pushed the fix-#4867 branch from c7cf9e9 to 218656c Compare January 1, 2020 19:16

landerlo force-pushed the fix-#4867 branch 3 times, most recently from 2708769 to a4b7f1e Compare January 16, 2020 20:56

liufengyun and others added 11 commits February 5, 2020 11:55

Fix scala#8203: handle intersection type in parent registration

3c9f377

An enum value may have the type `A & B`, in such cases we need to register for both `A` and `B`.

Fix scala#8333: Check for duplicate symbols in exports

4ae6522

Check for duplicate symbols when creating export forwarders.

Fix scala#8355: REPL tests : fix for two tests failing on Windows

dc8a4d8

small code change to address review

b6737a8

Merge pull request scala#8356 from michelou/dotty-multiline

f51bf1b

Fix scala#8355: REPL tests : fix for two tests failing on Windows

Merge pull request scala#8332 from dotty-staging/fix-#7597

93f65c6

Fix scala#7597: Refine checks whether a deferred term member is implemented

Merge pull request scala#8341 from dotty-staging/fix-#8333

811dc19

Fix scala#8333: Check for duplicate symbols in exports

doc(multi-staging): fix typos

d7ee473

doc(multi-staging): more typos

946604b

Disable Mill libraries

c66a33e

GH actions don't digest them well fttb

robstoll and others added 25 commits February 26, 2020 06:50

doc(creator apply): typos

91bdeef

doc(export): add link to note

9964ec8

Fix lampepfl/dotty-knowledge#30

92a3952

When a community test fails, re-run it 2 more times to avoid the whole suite failure.

Merge pull request scala#8382 from Uko/community-test-rerun

764a23e

Fix lampepfl/dotty-knowledge#30: Re-run failing community tests

Fix lampepfl/dotty-knowledge#17

d7e0212

Added a scripted sbt test to check if a @main annotation is detected by sbt

Merge pull request scala#8384 from Uko/test-sbt-method-annotation

6ac98d4

Fix lampepfl/dotty-knowledge#17: add a scripted sbt test for @main annotation

Fix scala#8362: Fail compilation if a compile time error can't be inl…

0803aff

…ined because of a non-literal string parameter

Avoid overcompilation involving inline or annotation trees

b952d41

We used to pretty-print trees in the API info we send to sbt, but the pretty-printed output seems to be unstable leading to overcompilation, so just use the raw trees instead (this should also be faster).

Merge pull request scala#8359 from dotty-staging/overcompilation-print

08f876e

Avoid overcompilation involving inline or annotation trees

Merge pull request scala#8381 from robstoll/patch-36

1955f75

doc(export): add link to note

Merge pull request scala#8387 from Uko/compiletime-error-with-s-inter…

84ea41a

…polator Fix scala#8362: Fail compilation if a compile time error can't be inlined because of a non-literal string parameter

Merge pull request scala#8374 from robstoll/patch-33

f8e65de

doc(macros): fix expansion and some improvements

Merge pull request scala#8380 from robstoll/patch-35

248009c

doc(creator apply): typos

Merge pull request scala#8379 from robstoll/patch-34

c6f46b5

doc(trait param): state which rule is violated

Merge pull request scala#8363 from robstoll/patch-28

8f2aabe

doc(multi-staging): fix typos

Merge pull request scala#8372 from robstoll/patch-31

11bd978

doc(untupling): fix list at the end, add newline

Merge pull request scala#8373 from robstoll/patch-32

d943616

doc(context-functions): typo `are` instead of `is`

revert ascii-doc change

12773ec

Merge pull request scala#8367 from robstoll/patch-30

5134e98

doc(operators): typo and add import to examples

Merge pull request scala#8366 from robstoll/patch-29

63343d2

doc(dep-fun-type): typos

Fix exhaustivity issue required to publish bootstrapped

1e2506b

Fix-scala#4867 Update union tests to improve inference when ascribed

16288bc

Splits union.scala test from negative to negative and positive tests, exhibiting the preservation of the union when the union has been explicitly ascribed as per fix scala#4867.

Fix-scala#4867 provide positive test for fix

735bed5

Fix unsafe cast

bf00a1e

OlivierBlanvillain closed this Mar 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix #4867 Union type inference is too eager to widen unions #7829

Fix #4867 Union type inference is too eager to widen unions #7829

Uh oh!

landerlo commented Dec 20, 2019 •

edited

Loading

Uh oh!

dottybot left a comment

Uh oh!

sjrd commented Dec 21, 2019

Uh oh!

landerlo commented Dec 21, 2019

Uh oh!

sjrd commented Dec 21, 2019

Uh oh!

landerlo commented Dec 21, 2019 •

edited

Loading

Uh oh!

sjrd commented Dec 21, 2019

Uh oh!

landerlo commented Dec 21, 2019

Uh oh!

LPTK commented Dec 30, 2019

Uh oh!

landerlo commented Jan 3, 2020

Uh oh!

smarter commented Jan 19, 2020

Uh oh!

landerlo commented Jan 23, 2020

Uh oh!

OlivierBlanvillain commented Mar 26, 2020

Uh oh!

smarter commented Apr 3, 2020

Uh oh!

simlei commented May 14, 2022

Uh oh!

smarter commented May 14, 2022

Uh oh!

Uh oh!

Fix #4867 Union type inference is too eager to widen unions #7829

Fix #4867 Union type inference is too eager to widen unions #7829

Uh oh!

Conversation

landerlo commented Dec 20, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dottybot left a comment

Choose a reason for hiding this comment

Uh oh!

sjrd commented Dec 21, 2019

Uh oh!

landerlo commented Dec 21, 2019

Uh oh!

sjrd commented Dec 21, 2019

Uh oh!

landerlo commented Dec 21, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sjrd commented Dec 21, 2019

Uh oh!

landerlo commented Dec 21, 2019

Uh oh!

LPTK commented Dec 30, 2019

Uh oh!

landerlo commented Jan 3, 2020

Uh oh!

smarter commented Jan 19, 2020

Uh oh!

landerlo commented Jan 23, 2020

Uh oh!

OlivierBlanvillain commented Mar 26, 2020

Uh oh!

smarter commented Apr 3, 2020

Uh oh!

simlei commented May 14, 2022

Uh oh!

smarter commented May 14, 2022

Uh oh!

Uh oh!

landerlo commented Dec 20, 2019 •

edited

Loading

landerlo commented Dec 21, 2019 •

edited

Loading