Poor inference of `[1, 2] +` [3.0, missing]` #28382

andyferris · 2018-08-01T00:28:08Z

I have been trying to get into using missing and fast Union types in my worflow and seem to be coming up with some issues. It's most easily highlighted by:

julia> @code_warntype [1, 2] + [3.0, missing]
Body::Array
44 1 ── %1  = (Base.getfield)(Bs, 1, true)::Array{Union{Missing, Float64},1}                                                                               │╻╷╷         iterate
   └───       goto #7 if not true                                                                                                                          │           
   2 ┄─ %3  = φ (#1 => %1, #6 => %12)::Array{Union{Missing, Float64},1}                                                                                    │           
   │    %4  = φ (#1 => 2, #6 => %13)::Int64                                                                                                                │           
45 │          invoke Base.promote_shape(_2::Array{Int64,1}, %3::Array{Union{Missing, Float64},1})                                                          │           
   │    %6  = (Base.slt_int)(1, %4)::Bool                                                                                                                  ││╻           <
   └───       goto #4 if not %6                                                                                                                            ││          
   3 ──       goto #5                                                                                                                                      ││          
   4 ── %9  = (Base.getfield)(Bs, %4, true)::Array{Union{Missing, Float64},1}                                                                              ││╻           getindex
   │    %10 = (Base.add_int)(%4, 1)::Int64                                                                                                                 ││╻           +
   └───       goto #5                                                                                                                                      ││          
   5 ┄─ %12 = φ (#4 => %9)::Array{Union{Missing, Float64},1}                                                                                               │           
   │    %13 = φ (#4 => %10)::Int64                                                                                                                         │           
   │    %14 = φ (#3 => true, #4 => false)::Bool                                                                                                            │           
   │    %15 = (Base.not_int)(%14)::Bool                                                                                                                    │           
   └───       goto #7 if not %15                                                                                                                           │           
   6 ──       goto #2                                                                                                                                      │           
47 7 ── %18 = (getfield)(Bs, 1)::Array{Union{Missing, Float64},1}                                                                                          │           
   │    %19 = (Core.tuple)(A, %18)::Tuple{Array{Int64,1},Array{Union{Missing, Float64},1}}                                                                 ││╻           broadcasted
   │    %20 = (Base.arraysize)(A, 1)::Int64                                                                                                                │││╻╷╷╷╷       instantiate
   │    %21 = (Base.slt_int)(%20, 0)::Bool                                                                                                                 ││││╻╷╷╷        combine_axes
   │    %22 = (Base.ifelse)(%21, 0, %20)::Int64                                                                                                            │││││┃│││││      broadcast_axes
   │    %23 = %new(Base.OneTo{Int64}, %22)::Base.OneTo{Int64}                                                                                              ││││││┃│││        axes
   │    %24 = (Base.arraysize)(%18, 1)::Int64                                                                                                              ││││││╻╷╷         broadcast_axes
   │    %25 = (Base.slt_int)(%24, 0)::Bool                                                                                                                 │││││││╻╷╷╷        axes
   │    %26 = (Base.ifelse)(%25, 0, %24)::Int64                                                                                                            ││││││││┃│││        map
   │    %27 = %new(Base.OneTo{Int64}, %26)::Base.OneTo{Int64}                                                                                              │││││││││┃│          Type
   │    %28 = (%26 === %22)::Bool                                                                                                                          ││││││╻╷╷╷╷       _bcs
   │    %29 = (Base.and_int)(true, %28)::Bool                                                                                                              │││││││╻           _bcs1
   └───       goto #9 if not %29                                                                                                                           ││││││││┃           _bcsm
   8 ──       goto #10                                                                                                                                     │││││││││   
   9 ── %32 = (%22 === 1)::Bool                                                                                                                            │││││││││╻           ==
   └───       goto #10                                                                                                                                     │││││││││   
   10 ┄ %34 = φ (#8 => %29, #9 => %32)::Bool                                                                                                               ││││││││    
   └───       goto #12 if not %34                                                                                                                          ││││││││    
   11 ─       goto #18                                                                                                                                     ││││││││    
   12 ─ %37 = (%22 === %26)::Bool                                                                                                                          │││││││││╻╷          ==
   │    %38 = (Base.and_int)(true, %37)::Bool                                                                                                              ││││││││││╻           &
   └───       goto #14 if not %38                                                                                                                          │││││││││   
   13 ─       goto #15                                                                                                                                     │││││││││   
   14 ─ %41 = (%26 === 1)::Bool                                                                                                                            │││││││││╻           ==
   └───       goto #15                                                                                                                                     │││││││││   
   15 ┄ %43 = φ (#13 => %38, #14 => %41)::Bool                                                                                                             ││││││││    
   └───       goto #17 if not %43                                                                                                                          ││││││││    
   16 ─       goto #18                                                                                                                                     ││││││││    
   17 ─ %46 = %new(Base.DimensionMismatch, "arrays could not be broadcast to a common size")::DimensionMismatch                                            ││││││││╻           Type
   │          (Base.Broadcast.throw)(%46)                                                                                                                  ││││││││    
   └───       $(Expr(:unreachable))                                                                                                                        ││││││││    
   18 ┄ %49 = φ (#11 => %27, #16 => %23)::Base.OneTo{Int64}                                                                                                │││││││     
   │    %50 = (Core.tuple)(%49)::Tuple{Base.OneTo{Int64}}                                                                                                  │││││││     
   └───       goto #19                                                                                                                                     │││││││     
   19 ─       goto #20                                                                                                                                     ││││││      
   20 ─       goto #21                                                                                                                                     │││││       
   21 ─ %54 = %new(Base.Broadcast.Broadcasted{Base.Broadcast.DefaultArrayStyle{1},Tuple{Base.OneTo{Int64}},typeof(+),Tuple{Array{Int64,1},Array{Union{Missing, Float64},1}}}, +, %19, %50)::Base.Broadcast.Broadcasted{Base.Broadcast.DefaultArrayStyle{1},Tuple{Base.OneTo{Int64}},typeof(+),Tuple{Array{Int64,1},Array{Union{Missing, Float64},1}}}
   └───       goto #22                                                                                                                                     ││││        
   22 ─ %56 = invoke Base.Broadcast.copy(%54::Base.Broadcast.Broadcasted{Base.Broadcast.DefaultArrayStyle{1},Tuple{Base.OneTo{Int64}},typeof(+),Tuple{Array{Int64,1},Array{Union{Missing, Float64},1}}})::Array
   └───       goto #23                                                                                                                                     │││         
   23 ─       goto #24                                                                                                                                     ││          
   24 ─       return %56

As you can see, the resultant output is (correctly but imprecisely) inferred to be Array (not even Vector?).

I find this example interesting for two reasons

Performance (resulting from imprecise inference)
Predictable interface provided by output container

Regarding performance, I find it quite usual to combine combinations of "functional" operations like +, map, etc, together with hand-written loops in the same scope. In fact, predictably good performance over loops, recursion, higher-order programming, etc is kind of one of our main selling points. In the above, performance will be pessimised if I loop over the output since neither the container type (even the array dimensionality is lost, somehow!) nor the element type is well known. I'm guessing but I'd predict that Vector{Union{Missing, Float64}} would be fastest for subsequent iteration, but it's possible that if inference found something like Vector{<:Union{Missing, Float64}} then this would also be quite fast (I'm not sure, really).

Regarding the second, more semantic issue of what the output actually becomes at run time, I'm a bit confused about what I'm meant to assume I can do with the output container. For these input types, we can get

julia> Int[] + Union{Float64, Missing}[]
0-element Array{Union{Missing, Float64},1}

julia> Int[1] + Union{Float64, Missing}[3.0]
1-element Array{Float64,1}:
 4.0

julia> Int[2] + Union{Float64, Missing}[missing]
1-element Array{Missing,1}:
 missing

julia> Int[1, 2] + Union{Float64, Missing}[3.0, missing]
2-element Array{Union{Missing, Float64},1}:
 4.0     
  missing

As a programmer, this scares me somewhat because I might get subtly different program behavior (or program correctness) depending on my input data. My concerm is that I might perform tests with mixtures of Missing and Float64 input values but be caught out in rare cases in production when all (or none) of the inputs are missing.

In this hypothetical scenario, my (tested, production) code might look something like

vec3 = vec1 + vec2
for i in keys(vec3)
    if isless(vec3[i], 0) # vec3 can't have any negative elements
        vec3[i] = missing
    end
end
return vec3

But unfortunately this would sometimes fail, depending on my input data, which may vary in size and quality (and maybe 1 in 10,000 times are all non-missing).

As a coder, I need to be able to predict the interface provided by the outputs of operation so I can write reliable code. Generally, I've noted that the output of +, map, broadcast, filter (for Array inputs) tend to be mutable Arrays, and generally am unafraid to mutate them. Naively, I would track in my head the eltypes of vec1 and vec2 and assume I can populate vec3 with anything consistent.

I'm not sure what the best solution is, but from my perspective I find any potential (performance?) gains in returning the narrowest container that fits the run-time data questionable in value compared to understanding the semantic guarantees as a code author.

The text was updated successfully, but these errors were encountered:

JeffBezanson · 2018-08-01T01:12:17Z

Inference can be improved, but there's no general solution to the second problem. Mutating an array just seems to be fundamentally incompatible with using functional constructs that pick an element type for you.

andyferris · 2018-08-01T05:08:16Z

Mutating an array just seems to be fundamentally incompatible with using functional constructs that pick an element type for you.

Yeah... I've been almost wondering if the outputs of such functions might even be better off marked immutable...

andyferris · 2018-08-01T05:11:09Z

Inference can be improved

I'm still not 100% convinced this is an inference issue per se, rather an issue of the container type chosen by broadcast being dependent on something other than the input types. Inference can only track input and output types (plus a bit of constant propagation) after all.

nalimilan · 2018-08-03T14:39:16Z

The way the element type is chosen cannot be changed before 2.0, but inference should certainly be fixed to avoid killing performance of all code using the resulting array without a function barrier.

The problem seems to happen with copyto_nonleaf!, which recursively calls itself to widen the element type when encountering values of a new type. That's the typical strategy for this situation, also used by map. But it's probably hard for inference to reason about these recursive calls, even if it knows elements can only ever be of type Int or Missing (and indeed everything else is inferred correctly in these functions).

Cc: @mbauman, @vtjnash

mbauman · 2018-08-03T15:06:08Z

I mean, the choice of container eltype all stems from how we've chosen to do comprehensions: #7258 — the interesting discussion is on the linked Google Groups topic. This is the same behavior.

nalimilan · 2018-08-04T02:40:30Z

Yes, I don't think we should discuss this behavior here. What needs fixing is just inference.

mbauman · 2018-08-04T16:49:55Z

Here's an option that initially looked promising:

diff --git a/base/broadcast.jl b/base/broadcast.jl
index 4d6629a02e..3230968463 100644
--- a/base/broadcast.jl
+++ b/base/broadcast.jl
@@ -759,7 +759,12 @@ const NonleafHandlingStyles = Union{DefaultArrayStyle,ArrayConflict}
     dest = similar(bc′, typeof(val))
     @inbounds dest[I] = val
     # Now handle the remaining values
-    return copyto_nonleaf!(dest, bc′, iter, state, 1)
+    if dest isa AbstractArray
+        # Give inference a helping hand on the element type and dimensionality (work-around for #28382)
+        return copyto_nonleaf!(dest, bc′, iter, state, 1)::AbstractArray{<:ElType, ndims(dest)}
+    else
+        return copyto_nonleaf!(dest, bc′, iter, state, 1)
+    end
 end

julia> @eval Base.Broadcast @inline function copy(bc::Broadcasted{Style}) where {Style}
           ElType = combine_eltypes(bc.f, bc.args)
           if Base.isconcretetype(ElType)
               # We can trust it and defer to the simpler `copyto!`
               return copyto!(similar(bc, ElType), bc)
           end
           # When ElType is not concrete, use narrowing. Use the first output
           # value to determine the starting output eltype; copyto_nonleaf!
           # will widen `dest` as needed to accommodate later values.
           bc′ = preprocess(nothing, bc)
           iter = eachindex(bc′)
           y = iterate(iter)
           if y === nothing
               # if empty, take the ElType at face value
               return similar(bc′, ElType)
           end
           # Initialize using the first value
           I, state = y
           @inbounds val = bc′[I]
           dest = similar(bc′, typeof(val))
           @inbounds dest[I] = val
           # Now handle the remaining values
           if dest isa AbstractArray
               # Give inference a helping hand on the element type and dimensionality (work-around for #28382)
               return copyto_nonleaf!(dest, bc′, iter, state, 1)::AbstractArray{<:ElType, ndims(dest)}
           else
               return copyto_nonleaf!(dest, bc′, iter, state, 1)
           end
       end
copy (generic function with 89 methods)

julia> @code_warntype [1, 2] + [3.0, missing]
Body::Array{T,1} where T<:Union{Missing, Float64}

Unfortunately it fails in cases like broadcast((x,y)->(x==1 ? 1.0 : x, y), [1 2 3], ["a", "b", "c"]), because copyto_nonleaf uses promote_typejoin to choose the new container eltype, whereas inference returns a more precise union:

julia> broadcast((x,y)->(x==1 ? 1.0 : x, y), [1 2 3], ["a", "b", "c"])
ERROR: TypeError: in typeassert, expected AbstractArray{#s4,2} where #s4<:Tuple{Union{Float64, Int64},String}, got Array{Tuple{Real,String},2}

nalimilan · 2018-08-07T03:02:26Z

Thanks @mbauman. I had been considering the same temporary fix. Maybe we can fix the problem you highlight by calling promote_typejoin on Union type parameters? BTW, I wonder whether you caught a bug by which the element type used when the input is empty isn't what we intended (i.e. not what is returned when values of all possible types are present in the returned array).

nalimilan · 2018-12-21T21:44:45Z

I've made an attempt at implementing this strategy at #30485.

JeffBezanson added broadcast Applying a function over a collection compiler:inference Type inference labels Aug 1, 2018

nalimilan added the missing data Base.missing and related functionality label Aug 4, 2018

This was referenced Dec 20, 2018

Broadcasting with missings is slow #30455

Closed

Make return type of broadcast inferrable with heterogeneous arrays #30485

Merged

nalimilan mentioned this issue Feb 6, 2019

Improve quantile in corner cases of collection eltype #30938

Merged

nalimilan closed this as completed in #30485 Oct 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Poor inference of `[1, 2] +` [3.0, missing]` #28382

Poor inference of `[1, 2] +` [3.0, missing]` #28382

andyferris commented Aug 1, 2018 •

edited

Loading

JeffBezanson commented Aug 1, 2018

andyferris commented Aug 1, 2018 •

edited

Loading

andyferris commented Aug 1, 2018 •

edited

Loading

nalimilan commented Aug 3, 2018

mbauman commented Aug 3, 2018 •

edited

Loading

nalimilan commented Aug 4, 2018

mbauman commented Aug 4, 2018

nalimilan commented Aug 7, 2018

nalimilan commented Dec 21, 2018

Poor inference of [1, 2] + [3.0, missing]` #28382

Poor inference of [1, 2] + [3.0, missing]` #28382

Comments

andyferris commented Aug 1, 2018 • edited Loading

JeffBezanson commented Aug 1, 2018

andyferris commented Aug 1, 2018 • edited Loading

andyferris commented Aug 1, 2018 • edited Loading

nalimilan commented Aug 3, 2018

mbauman commented Aug 3, 2018 • edited Loading

nalimilan commented Aug 4, 2018

mbauman commented Aug 4, 2018

nalimilan commented Aug 7, 2018

nalimilan commented Dec 21, 2018

Poor inference of `[1, 2] +` [3.0, missing]` #28382

Poor inference of `[1, 2] +` [3.0, missing]` #28382

andyferris commented Aug 1, 2018 •

edited

Loading

andyferris commented Aug 1, 2018 •

edited

Loading

andyferris commented Aug 1, 2018 •

edited

Loading

mbauman commented Aug 3, 2018 •

edited

Loading