huge performance regression in vector math #17794

stevengj · 2016-08-03T21:29:40Z

There has been a huge performance regression in simple vector-math operations like +, e.g.

x = rand(10^7); y = rand(10^7);
@time x + y;
@time x + y;

gives 0.495843 seconds (20.00 M allocations: 381.463 MB, 20.92% gc time) ... notice the 20M allocations, indicative of a type instability in an inner loop.

The x + y call devolves into a call to Base._elementwise(+, Float64, x, y) in arraymath.jl, which was most recently touched by #17389 (@pabloferz) and #17313 (@martinholters).

Since @nanosoldier didn't detect any performance regressions in #17313, I'm guessing #17389 is the problem here?

The text was updated successfully, but these errors were encountered:

ufechner7 · 2016-08-03T21:55:11Z

I cannot reproduce this problem, comparing Version 0.4.5 (2016-03-18 00:58 UTC)
and Version 0.5.0-rc0+0 (2016-07-26 20:22 UTC) on Linux (Ubuntu 16.04).
What did you compare?

yuyichao · 2016-08-03T21:57:36Z

0.5.0-rc0+174

pabloferz · 2016-08-03T22:03:31Z

Adding the type parameter seems to fix it, I was trying to take some type parameters out, but I took out too much.

function Base._elementwise{T}(op, ::Type{T}, A::AbstractArray, B::AbstractArray)
    F = similar(A, T, promote_shape(A, B))
    for (iF, iA, iB) in zip(eachindex(F), eachindex(A), eachindex(B))
        @inbounds F[iF] = op(A[iA], B[iB])
    end
    return F
end

JeffBezanson · 2016-08-03T22:10:51Z

Yes, the Type{T} ones tend to be important, since we try to avoid specializing on every type argument.

tkelman · 2016-08-03T23:12:23Z

yikes, we really should have run nanosoldier there before merging. whoops.

pabloferz · 2016-08-03T23:34:59Z

#17798

eschnett · 2016-08-04T01:22:05Z

@JeffBezanson Is there documentation / a rough design document / a blog entry / a certain set of functions that one could look at to get a feeling for the rules that govern specialization?

tkelman · 2016-08-04T05:21:06Z

While #17798 helped, it didn't fix all of it. https://github.com/JuliaCI/BaseBenchmarkReports/blob/f42bed6fb5e9d16970da9b58cf24755de6dc7d0f/daily_2016_8_4/report.md I think what I'm going to do is revert #17389 on the release-0.5 branch for rc1.

pabloferz · 2016-08-04T08:07:04Z

I think that for the rest of the problems another type parameter (again) does the trick

function promote_op{S}(f, ::Type{S})
    T = _promote_op(f, _default_type(S))
    return isleaftype(S) ? T : typejoin(S, T)
end
function promote_op{R,S}(f, ::Type{R}, ::Type{S})
    T = _promote_op(f, _default_type(R), _default_type(S))
    isleaftype(R) && return isleaftype(S) ? T : typejoin(S, T)
    return isleaftype(S) ? typejoin(R, T) : typejoin(R, S, T)
end

(currently these two do not have type parameters).

So, echoing @eschnett, a guideline here would be helpful (maybe even worth a place on the performance tips).

KristofferC · 2016-08-04T08:11:28Z

My mental model has been that function arguments are only for dispatch and has nothing to do with performance (except for ANY). That is apparently wrong so I would also be interested in that.

pabloferz · 2016-08-04T08:19:06Z

As far as I can see the also seem to matter when dispatching on Type{T}s, for the rest it seems that your mental model (which is the one endorsed) still works.

KristofferC · 2016-08-04T08:58:49Z

Alright, good to know. Thanks.

stevengj · 2016-08-04T14:56:35Z

@pabloferz, my understanding is that if you write a function f(T::Type), then T is just a value that is determined at runtime (the same version of f is compiled for all values of T), whereas if you write f{T}(::Type{T}) then T is part of the type signature of the function — hence it is known at compile time and a specialized version of f is compiled for every T.

pabloferz · 2016-08-04T14:58:43Z

@stevengj That was my understanding too. But, I forgot for a moment while writing the changes on #17389.

Now I believe that a bad interaction between the object_id hashing, cached typed changes and using inference to try to find the return type (and some missing static type parameters) is causing most of these problems. But I'm not actually sure.

JeffBezanson · 2016-08-04T16:44:19Z

@pabloferz Could we have a PR with the more complete fix?

pabloferz · 2016-08-04T23:52:04Z

I can work on that, but I won't be able to get to it until Tuesday.

tkelman · 2016-08-05T17:38:01Z

in that case we'll probably have to put rc2 out without reinstating #17389

JeffBezanson · 2016-08-05T18:42:30Z

IIUC, this performance regression no longer exists on the release branch, so this is not blocking anymore.

tkelman · 2016-08-12T11:02:18Z

I think this is fixed by #17929. Reopen or leave a comment if you think otherwise.

stevengj · 2016-08-12T15:11:23Z

LGTM.

stevengj added performance Must go faster regression Regression in behavior compared to a previous version labels Aug 3, 2016

stevengj added this to the 0.5.0 milestone Aug 3, 2016

tkelman closed this as completed in e68bc89 Aug 4, 2016

tkelman reopened this Aug 4, 2016

JeffBezanson modified the milestones: 0.5.x, 0.5.0 Aug 5, 2016

pabloferz mentioned this issue Aug 9, 2016

Improve promote_op #17929

Merged

tkelman closed this as completed Aug 12, 2016

timholy mentioned this issue Oct 25, 2016

Inference regression (julia 0.5) on types-as-arguments #19096

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

huge performance regression in vector math #17794

huge performance regression in vector math #17794

stevengj commented Aug 3, 2016

ufechner7 commented Aug 3, 2016

yuyichao commented Aug 3, 2016

pabloferz commented Aug 3, 2016

JeffBezanson commented Aug 3, 2016

tkelman commented Aug 3, 2016

pabloferz commented Aug 3, 2016

eschnett commented Aug 4, 2016

tkelman commented Aug 4, 2016

pabloferz commented Aug 4, 2016 •

edited

Loading

KristofferC commented Aug 4, 2016

pabloferz commented Aug 4, 2016

KristofferC commented Aug 4, 2016

stevengj commented Aug 4, 2016

pabloferz commented Aug 4, 2016 •

edited

Loading

JeffBezanson commented Aug 4, 2016

pabloferz commented Aug 4, 2016

tkelman commented Aug 5, 2016

JeffBezanson commented Aug 5, 2016

tkelman commented Aug 12, 2016

stevengj commented Aug 12, 2016

huge performance regression in vector math #17794

huge performance regression in vector math #17794

Comments

stevengj commented Aug 3, 2016

ufechner7 commented Aug 3, 2016

yuyichao commented Aug 3, 2016

pabloferz commented Aug 3, 2016

JeffBezanson commented Aug 3, 2016

tkelman commented Aug 3, 2016

pabloferz commented Aug 3, 2016

eschnett commented Aug 4, 2016

tkelman commented Aug 4, 2016

pabloferz commented Aug 4, 2016 • edited Loading

KristofferC commented Aug 4, 2016

pabloferz commented Aug 4, 2016

KristofferC commented Aug 4, 2016

stevengj commented Aug 4, 2016

pabloferz commented Aug 4, 2016 • edited Loading

JeffBezanson commented Aug 4, 2016

pabloferz commented Aug 4, 2016

tkelman commented Aug 5, 2016

JeffBezanson commented Aug 5, 2016

tkelman commented Aug 12, 2016

stevengj commented Aug 12, 2016

pabloferz commented Aug 4, 2016 •

edited

Loading

pabloferz commented Aug 4, 2016 •

edited

Loading