WIP: redesign of tuples and tuple types #10380

JeffBezanson · 2015-03-02T22:16:16Z

I have only just barely begun to implement this, but I'm filing this early to keep everybody apprised of the new design. This is my current pick for highest-priority breaking change to system internals for 0.4.

Summary of the approach:

The representation currently used for heap-allocated tuples (a length following by some pointers) will instead be used for a new type called SimpleVector. This way many internal things can keep using the same representation (at least for a while).
Tuples will be immutable DataTypes, and DataType will change very little
Tuples and tuple types are now totally different, distinguished by jl_is_tuple and jl_is_tuple_type. jl_typeof and jl_is_type are now much simpler.
Internal names starting with jl_tuple are generally changed to jl_sv (for SimpleVector)
The horribly-named jl_null is changed to jl_emptytuple and jl_emptysv
Tuples will be accessed with jl_fieldref
The Vararg type is replaced with a boolean flag in DataType

Syntax changes:

Construct tuple types with {}
deprecate (Type,Type) to {Type,Type}, eventually have it return a tuple of types instead
deprecate (Type...) to {Type...}, eventually have it splat instead

Planned follow-on changes:

make NTuple a concrete type, for efficiently handling large homogeneous tuples
reduce use of SimpleVector as much as possible

Addressed issues:
#8470 - use {} for tuple types
#4869 - splatting in tuple construction
#7941 - Towards array nirvana
#3440 - optimization tracker

Related issues:
#7568 - WIP: implementation of fixed-size arrays
#2299 - SIMD types

timholy · 2015-03-02T22:25:37Z

I can't find anything to complain about in that list. Quite the contrary, in fact.

ivarne · 2015-03-02T23:13:07Z

I bet we can find a better usage for {} than tuple types. I feel like we're out of ASCI characters for new syntax all the time, and tuple types is so obscure that most users have managed well without them. Tuple{} looks pretty nice too, and has a similar feeling to the other parametric types.

I will also express my negative feelings for pushing the 0.4 release further into the future. We're past due already, and stalled releases gives a negative impression about our progress.

(If this gets into 0.4, we'll also need to keep {} as a depreciation for Any[], if we are to follow our depreciation policy)

Ps: otherwise this seem like a really great change!

Jutho · 2015-03-03T06:58:02Z

+1 for every single suggestion... Does this also mean that a NTuple{N,ConcreteType} field inside a type definition will be 'inlined', so that we can easily build ImmutableVector and redo CartesianIndex ?

nalimilan · 2015-03-03T08:27:24Z

@ivarne What candidates do you have in mind for an alternative use of {}? I think that's an interesting data point for the discussion.

timholy · 2015-03-03T08:53:12Z

Hmm, @ivar's point about one cycle's worth of deprecation for {} does seem important.

ivarne · 2015-03-03T09:05:02Z

@nalimilan See #10386.

(@timholy I'm @ivarne, not @ivar. (Sorry to my name brother in Vancouver))

timholy · 2015-03-03T09:21:35Z

Sorry to both.

JeffBezanson · 2015-03-03T17:07:17Z

I'm willing to use Tuple{}. In a few cases this could be awkward though, like ccall(f, T, Tuple{A, B}, x, y). ccall could maybe continue to use a tuple of types instead of an actual tuple type.

@Jutho yes tuples of bits types will be inlined.

As for release timing, my current thinking is to get this change in, but punt most of the rest of what's currently in the 0.4 milestone. Any very disruptive changes that we know we need, we should do as soon as possible. Many of the other 0.4 issues are not that disruptive, e.g. if they can be deprecated easily.

StefanKarpinski · 2015-03-03T17:22:13Z

What about Tuple{(A,B)}? That way Tuple always has a single parameter which is the tuple of types.

JeffBezanson · 2015-03-03T17:59:50Z

That's an interesting idea. Off-line I actually worked through ~3 possible designs for this, and I considered some things like that. One unusual feature of such a design is that the length of a tuple is not actually stored anywhere; you just have a cycle of objects for each possible length. Of course you could store the length anyway, but you can't get to it just by looking at the parameter(s) of a tuple type, which is a bit surprising. It also entails allocating more objects per tuple type, and there are a lot of tuple types.

StefanKarpinski · 2015-03-03T19:00:19Z

I was just thinking that allowing varargs parametric types was kind of a complication to the type system. Would you be able to declare your own varargs parametric types? Or would it be something that is special to Tuple?

ivarne · 2015-03-03T19:12:19Z

Aren't types kind of varargs already (in that you can have incompletely specified types like Array{Int})?

JeffBezanson · 2015-03-03T19:20:00Z

I wasn't planning to add general varargs type parameters; this would be specific to tuples.

Varargs parameters could mean two things: allowing a type to be instantiated with any number of parameters, or allowing ... to be present (which I guess would be a kind of pattern for matching trailing arguments). The second feature is a bit odd without covariance.

We don't have this already, because the number of parameters is always limited. No type actually accepts any number of parameters (except tuples).

tkelman · 2015-03-08T17:48:15Z

Could jl_sv be made at least a few characters more descriptive? jl_svec maybe?

JeffBezanson · 2015-03-08T17:56:06Z

Ok.

joschu · 2015-03-13T20:38:16Z

I don't like the {} syntax, because it's at odds with mathematical notation, where curly braces denote sets, and most other programming languages, where curly braces denote unordered collections. (E.g., in python, where {} constructs a set or dict, and Lua and Matlab, where {} constructs an unordered map.)

IainNZ · 2015-03-13T20:56:12Z

@joschu we already use curly brackets to denote tuples of types in a lot of places though, e.g.

type MyType{A,B}
  foo::A
  bar::B
end

MyType(5.0, 2)  # MyType{Float64,Int}

So it doesn't seem too ambiguous

StefanKarpinski · 2015-03-13T21:06:32Z

It kind of makes sense:

(a,b) is just the argument part of f(a,b) without the function
{A,B} is just the parameter part of T{A,B} without the type.

joschu · 2015-03-13T21:09:30Z

Ah! I misread the original proposal as saying that all tuples would be constructed using that notation.
Using {} for tuple types completely makes sense. Cheers.

JeffBezanson · 2015-03-24T22:19:57Z

Making good progress here.

One minor thing I ran into: it's a bit odd that usually T{} == T, but Tuple{} != Tuple == Tuple{Any,...}.

And a major thing: we have no way to destructure tuple types. We used to be able to use all tuple functions on tuple types, which was pretty convenient, but we've never had variadic DataTypes before so there's no infrastructure for that. This mostly affects convert for tuple types. I could use a staged function but I'd rather avoid that.

The traditional approach would be a recursive representation, e.g. the type of (1,2,3.0) is Product{Int,Product{Int,Product{Float64,{}}}}. That's pretty disruptive. We could accomplish something similar just with a function that takes Tuple{A,B,C} to Tuple{B,C}. I'm not sure how to do that any more elegantly than a staged function or a builtin.

StefanKarpinski · 2015-03-24T22:50:55Z

Making the type parameter the tuple of types solves both of these issues, no? Tuple{} == Tuple != Tuple{(Any...)}. You can also do destructuring on the parameter itself, so the only inconvenient step is pulling it out of the type. Might it be possible to hack the representation so that the actual tuple object need not be stored?

JeffBezanson · 2015-03-24T22:59:47Z

A good way to do that would be to have a parameters(T) function (instead of T.parameters) that gives you a tuple of the parameters.

I hesitate there because of the large number of silly tuple types we'd need to generate: typeof(parameters(typeof((1,2)))) == Tuple{Type{Int},Type{Int}}. We'd need that instead of Tuple{DataType,DataType} for the full effect. It seems like instead of moving around between the type and value level, you should just be able to manipulate the type directly. The type we started with, Type{Int,Int}, already has all the needed information.

JeffBezanson · 2015-04-09T16:05:52Z

I thought my last commit did not work. Funny thing is, it actually did work, but just took so long to start up that I gave up waiting... how amusing... :)

ViralBShah · 2015-04-09T17:18:24Z

👍

JeffBezanson · 2015-04-09T19:07:39Z

If anybody wants to kick the tires, this is now usable with sys0.ji. Don't try to build sys.ji; just interrupt it.

mdcfrancis · 2015-05-13T00:48:46Z

Makes sense, thanks for the clarification.

I believe the bug was due to the need to hash typename->primary specially, based on how other type functions treat it.

…0875) - add a linearly-searched part of the type cache for more difficult types - assign UIDs earlier so they can't change after cache insertion

…hat JuliaLang#10380 is merged and it works & can be tested

…JuliaLang#10380

ivarne mentioned this pull request Mar 3, 2015

What should we use unprefixed {} brackets for? #10386

Closed

JeffBezanson mentioned this pull request Mar 3, 2015

type inference and tuples #10352

Closed

JeffBezanson force-pushed the jb/tupleoverhaul branch from 246c078 to a01f07e Compare March 21, 2015 04:21

JeffBezanson mentioned this pull request Mar 31, 2015

RFC: controlling dispatch with varargs of defined length #10691

Closed

mauro3 mentioned this pull request Apr 8, 2015

vararg tuple types not working when vararg not at last place #10770

Closed

This was referenced May 17, 2015

SegFault caused by tupocolypse #11313

Closed

Parameterized type with super long tuple #11321

Closed

long tuple problems #11320

Closed

timholy mentioned this pull request May 19, 2015

StackOverflowError display 900-dimensional array #11340

Closed

JeffBezanson added a commit that referenced this pull request May 25, 2015

fix #11278, restore check that field types are Types, lost in #10380

d777bcc

This was referenced May 25, 2015

Building a Dict with 5-tuple keys is slow on 0.3 and 0.4 #11100

Closed

2x performance regression when comparing types #11425

Closed

Stopgap for Vararg widening #11480

Closed

timholy mentioned this pull request Jun 2, 2015

Dispatch on tuples with a type #11535

Closed

yuyichao mentioned this pull request Jun 6, 2015

Indexing performance #11595

Closed

mbauman pushed a commit to mbauman/julia that referenced this pull request Jun 6, 2015

more tupocolypse (JuliaLang#10380) updates

1087d0c

mbauman pushed a commit to mbauman/julia that referenced this pull request Jun 6, 2015

try to fix the intermittent failures since JuliaLang#10380

758b893

I believe the bug was due to the need to hash typename->primary specially, based on how other type functions treat it.

mbauman pushed a commit to mbauman/julia that referenced this pull request Jun 6, 2015

fix some bitrot in inline_incompletematch (from JuliaLang#7075) now t…

d738580

…hat JuliaLang#10380 is merged and it works & can be tested

mbauman pushed a commit to mbauman/julia that referenced this pull request Jun 6, 2015

fix JuliaLang#11278, restore check that field types are Types, lost in …

a43884e

…JuliaLang#10380

tkelman pushed a commit to tkelman/julia that referenced this pull request Jun 6, 2015

fix JuliaLang#11278, restore check that field types are Types, lost in …

f51470f

…JuliaLang#10380

This was referenced Jun 8, 2015

Do not cache the type before we finish constructing it. #11606

Merged

Revert "Do not cache the type before we finish constructing it." #11632

Merged

yuyichao mentioned this pull request Jun 11, 2015

julia-mode.el: Highlight functions with nested parens in definition #8722

Closed

jumutc mentioned this pull request Jun 12, 2015

Loading of XGBoost module fails dmlc/XGBoost.jl#5

Closed

dhoegh mentioned this pull request Jun 13, 2015

Regression in methodshow/print_to_string #11700

Closed

stevengj mentioned this pull request Jun 25, 2015

Sorting wrt multiple arrays JuliaCollections/SortingAlgorithms.jl#12

Closed

yuyichao mentioned this pull request Jul 15, 2015

Code generation failure on expression involving Tuples of length > 2 (regression in 0.4-dev) #12163

Closed

yuyichao mentioned this pull request Jul 24, 2015

Remove OVERLAP_SVEC_LEN #12288

Merged

yuyichao mentioned this pull request Aug 16, 2015

varargs generated functions generate inefficient code #12643

Closed

timholy mentioned this pull request Aug 21, 2015

RFC: deprecate cartesianmap #12716

Merged

mbauman mentioned this pull request Aug 24, 2015

Tuple{} vs tuple() in Generated Function Vararg type #12783

Closed

DSLituiev pushed a commit to DSLituiev/JLD.jl that referenced this pull request Oct 19, 2015

Update JLD for JuliaLang/julia#10380

62e9b24

yuyichao mentioned this pull request Jul 30, 2017

Regression calling cfunction #23020

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: redesign of tuples and tuple types #10380

WIP: redesign of tuples and tuple types #10380

JeffBezanson commented Mar 2, 2015

timholy commented Mar 2, 2015

ivarne commented Mar 2, 2015

Jutho commented Mar 3, 2015

nalimilan commented Mar 3, 2015

timholy commented Mar 3, 2015

ivarne commented Mar 3, 2015

timholy commented Mar 3, 2015

JeffBezanson commented Mar 3, 2015

StefanKarpinski commented Mar 3, 2015

JeffBezanson commented Mar 3, 2015

StefanKarpinski commented Mar 3, 2015

ivarne commented Mar 3, 2015

JeffBezanson commented Mar 3, 2015

tkelman commented Mar 8, 2015

JeffBezanson commented Mar 8, 2015

joschu commented Mar 13, 2015

IainNZ commented Mar 13, 2015

StefanKarpinski commented Mar 13, 2015

joschu commented Mar 13, 2015

JeffBezanson commented Mar 24, 2015

StefanKarpinski commented Mar 24, 2015

JeffBezanson commented Mar 24, 2015

JeffBezanson commented Apr 9, 2015

ViralBShah commented Apr 9, 2015

JeffBezanson commented Apr 9, 2015

mdcfrancis commented May 13, 2015

WIP: redesign of tuples and tuple types #10380

WIP: redesign of tuples and tuple types #10380

Conversation

JeffBezanson commented Mar 2, 2015

timholy commented Mar 2, 2015

ivarne commented Mar 2, 2015

Jutho commented Mar 3, 2015

nalimilan commented Mar 3, 2015

timholy commented Mar 3, 2015

ivarne commented Mar 3, 2015

timholy commented Mar 3, 2015

JeffBezanson commented Mar 3, 2015

StefanKarpinski commented Mar 3, 2015

JeffBezanson commented Mar 3, 2015

StefanKarpinski commented Mar 3, 2015

ivarne commented Mar 3, 2015

JeffBezanson commented Mar 3, 2015

tkelman commented Mar 8, 2015

JeffBezanson commented Mar 8, 2015

joschu commented Mar 13, 2015

IainNZ commented Mar 13, 2015

StefanKarpinski commented Mar 13, 2015

joschu commented Mar 13, 2015

JeffBezanson commented Mar 24, 2015

StefanKarpinski commented Mar 24, 2015

JeffBezanson commented Mar 24, 2015

JeffBezanson commented Apr 9, 2015

ViralBShah commented Apr 9, 2015

JeffBezanson commented Apr 9, 2015

mdcfrancis commented May 13, 2015