Refactor null_safe_op to workaround codegen changes #21290

TotalVerb · 2017-04-05T21:05:48Z

There have been codegen / inference changes since #16961 was merged that result in substantial (~40x) regressions in trivial Nullable broadcasts.

See https://discourse.julialang.org/t/performance-of-dot-operator-on-nullables/3071 for a report of the regressions.

This PR works around those changes by combining all parameters for null_safe_op into a single Type{<:Tuple} argument, which avoids the splat of (::Type{<:Tuple}).parameters which had previously confounded codegen (and would not be eliminated even when the function is declared @pure.)

Since this is a performance only change it should go into 0.6.

TotalVerb · 2017-04-05T21:06:31Z

cc @nalimilan
cc @davidanthoff

ararslan · 2017-04-05T21:16:15Z

@nanosoldier runbenchmarks("nullable", vs=":master")

Edit: Hm, maybe that's not the right way to invoke the nullable benchmarks...

vtjnash · 2017-04-05T21:42:12Z

base/broadcast.jl

+function null_safe_eltype_op(op, xs...)
+    Base.@_pure_meta
+    null_safe_op(op, typeeltypestuple(xs...))
+end


eltype(a) is not pure, so it's likely a bad idea to mark this entire stack of calls as @pure (it seems the same applies to eltypestuple also).

I'm cautiously optimistic that simply removing Base.@_pure_meta is OK, and inference can infer everything anyway. So far I haven't noticed any inference regressions and make test still passes. Could you/someone else run nanosoldier to make sure nothing regresses?

Why is eltype(a) not pure? Is anything more complex than f() = 1 pure? Or is that also not pure?

ararslan · 2017-04-05T23:08:38Z

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2017-04-06T02:09:42Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

TotalVerb · 2017-04-06T02:11:14Z

None of the regressions/improvements make any sense 😕. But a lot of the regressions are pretty substantial. (The improvements are probably noise.)

ararslan · 2017-04-06T02:54:59Z

Interestingly, the nullable benchmarks seem to be unaffected.

TotalVerb · 2017-04-06T02:55:37Z

The dot calls are probably not benchmarked. We should add benchmarks for them.

ararslan · 2017-04-06T03:05:31Z

If you mean broadcast for nullables, that appears to be the case: https://github.com/JuliaCI/BaseBenchmarks.jl/blob/master/src/nullable/NullableBenchmarks.jl

jrevels · 2017-04-06T13:07:51Z

Edit: Hm, maybe that's not the right way to invoke the nullable benchmarks...

It should be, but Nanosoldier won't catch it if you edit your submission after submitting it. Let's try with a fresh submission:

@nanosoldier runbenchmarks("nullable", vs=":master")

nanosoldier · 2017-04-06T14:02:21Z

Your benchmark job has completed - no performance regressions were detected. A full report can be found here. cc @jrevels

JeffBezanson · 2017-04-06T22:28:57Z

Could you give an example of the codegen regression --- e.g. a particular code_typed or code_llvm that demonstrates the problem?

TotalVerb · 2017-04-06T23:06:20Z

@JeffBezanson Sure, see #21305. I'm currently bisecting to see if I can find the commit that changed this.

JeffBezanson · 2017-04-07T22:20:13Z

@TotalVerb Hopefully this PR is not needed any more --- can you double check?

TotalVerb · 2017-04-08T04:42:39Z

@JeffBezanson #21310 is a nice change but I don't think it's the particular problem here. The file from #21305 still has long LLVM:

eltypestuple() = Tuple{}
eltypestuple(x, xs...) = Tuple{eltype(x), eltypes(xs...).types...}

null_safe_op(::Any, ::Type, ::Type...) = false
Base.@pure null_safe_eltype_op(f, xs...) =
    null_safe_op(f, eltypestuple(xs...).types...)

@inline function broadcast_c(f, a...)
    if null_safe_eltype_op(f, a...)
        0
    else
        1
    end
end

f() = broadcast_c(1:10, 1:10, 1:10)

code_llvm(STDOUT, f, Tuple{})

The return value is correctly inferred as a constant, but the issue is that there is no way it seems for inference to figure out that null_safe_op is actually effect free and so the expensive call still takes place. (as a consequence of #20726)

TotalVerb · 2017-04-08T04:44:08Z

The @code_warntype demonstrates that the return value is inferred, but the call itself is still performed for side effects:

Variables:
  #self#::#f

Body:
  begin 
      $(Expr(:inbounds, false))
      # meta: location /tmp/regression.jl broadcast_c 9
      (Core._apply)(Main.null_safe_op, (Core.tuple)($(Expr(:new, UnitRange{Int64}, 1, :((Base.select_value)((Base.sle_int)(1, 10)::Bool, 10, (Base.sub_int)(1, 1)::Int64)::Int64))))::Tuple{UnitRange{Int64}}, (Core.getfield)((Core._apply)(Core.apply_type, (Core.tuple)(Main.Tuple, Int64)::Tuple{DataType,DataType}, (Core.getfield)((Main.eltypes)($(Expr(:new, UnitRange{Int64}, 1, :((Base.select_value)((Base.sle_int)(1, 10)::Bool, 10, (Base.sub_int)(1, 1)::Int64)::Int64))))::ANY, :types)::ANY)::ANY, :types)::ANY)::Bool
      goto 6 # line 10:
      6: 
      # meta: pop location
      $(Expr(:inbounds, :pop))
      return 1
  end::Int64

JeffBezanson · 2017-04-08T05:03:24Z

In that code, eltypes is not defined. It works if you use Base.eltypes.

TotalVerb · 2017-04-08T05:06:22Z

My bad! You're right, this problem has been fixed.

Refactor null_safe_op to workaround codegen changes

4054143

TotalVerb force-pushed the fw/workaround-nullable branch from edd906b to 4054143 Compare April 5, 2017 21:08

ararslan added missing data Base.missing and related functionality performance Must go faster labels Apr 5, 2017

vchuravy requested a review from nalimilan April 5, 2017 21:36

vtjnash reviewed Apr 5, 2017

View reviewed changes

Avoid use of @pure

368b0eb

TotalVerb mentioned this pull request Apr 6, 2017

Codegen change during 0.6 development cycle #21305

Closed

TotalVerb mentioned this pull request Apr 8, 2017

Broadcast eltype error #21325

Closed

TotalVerb closed this Apr 8, 2017

TotalVerb deleted the fw/workaround-nullable branch April 8, 2017 05:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor null_safe_op to workaround codegen changes #21290

Refactor null_safe_op to workaround codegen changes #21290

TotalVerb commented Apr 5, 2017 •

edited

Loading

TotalVerb commented Apr 5, 2017

ararslan commented Apr 5, 2017 •

edited

Loading

vtjnash Apr 5, 2017

TotalVerb Apr 5, 2017

JeffBezanson Apr 6, 2017

ararslan commented Apr 5, 2017

nanosoldier commented Apr 6, 2017

TotalVerb commented Apr 6, 2017 •

edited

Loading

ararslan commented Apr 6, 2017

TotalVerb commented Apr 6, 2017

ararslan commented Apr 6, 2017

jrevels commented Apr 6, 2017

nanosoldier commented Apr 6, 2017

JeffBezanson commented Apr 6, 2017

TotalVerb commented Apr 6, 2017

JeffBezanson commented Apr 7, 2017

TotalVerb commented Apr 8, 2017

TotalVerb commented Apr 8, 2017

JeffBezanson commented Apr 8, 2017

TotalVerb commented Apr 8, 2017

Refactor null_safe_op to workaround codegen changes #21290

Refactor null_safe_op to workaround codegen changes #21290

Conversation

TotalVerb commented Apr 5, 2017 • edited Loading

TotalVerb commented Apr 5, 2017

ararslan commented Apr 5, 2017 • edited Loading

vtjnash Apr 5, 2017

Choose a reason for hiding this comment

TotalVerb Apr 5, 2017

Choose a reason for hiding this comment

JeffBezanson Apr 6, 2017

Choose a reason for hiding this comment

ararslan commented Apr 5, 2017

nanosoldier commented Apr 6, 2017

TotalVerb commented Apr 6, 2017 • edited Loading

ararslan commented Apr 6, 2017

TotalVerb commented Apr 6, 2017

ararslan commented Apr 6, 2017

jrevels commented Apr 6, 2017

nanosoldier commented Apr 6, 2017

JeffBezanson commented Apr 6, 2017

TotalVerb commented Apr 6, 2017

JeffBezanson commented Apr 7, 2017

TotalVerb commented Apr 8, 2017

TotalVerb commented Apr 8, 2017

JeffBezanson commented Apr 8, 2017

TotalVerb commented Apr 8, 2017

TotalVerb commented Apr 5, 2017 •

edited

Loading

ararslan commented Apr 5, 2017 •

edited

Loading

TotalVerb commented Apr 6, 2017 •

edited

Loading