Add fast path for merge with Dicts #22737

garborg · 2017-07-10T12:01:05Z

A hair slower if length(Dict) == 1, faster generally.
Closes #22519

KristofferC · 2017-07-10T12:17:11Z

Any benchmarks to share?

garborg · 2017-07-10T13:53:43Z

Can do when I get home tonight. The quick benchmark in #22519 shows best-case improvements.
With respect to the first argument, it's marginally faster for dicts with 3 elements, over an order of magnitude for dicts of 1000 elements. Speedup dampened when copy rehashes the dict argument, but then I guess you get that cleanup for free.

Methodology-wise, I saw no slowdown vs a specialized method restricted to known, shared key and value types.

nalimilan · 2017-07-10T16:55:30Z

There's already an emptymergedict function. Couldn't you add a specialized method for it?

EDIT: maybe giving it another name since it seems it's faster to make a copy than allocating a new dict, but you could keep the existing mechanism.

garborg · 2017-07-10T19:33:19Z

Exactly, emptymergedict seemed like a misnomer. Is making both mergedict, with an empty or fill keyword argument, what you're suggesting, more or less?

nalimilan · 2017-07-11T07:59:10Z

Actually I think I'd suggest merging emptymergedict and your mergedict into a single function (maybe with a different name, since none of them seems very explicit about what they do), without a keyword argument. Indeed it seems emptymergedict could have the same behavior as mergedict, i.e. return a new dict of the right type with the contents of the first Associative argument.

garborg · 2017-07-13T01:56:09Z

Good call. Once I went to combine helper functions, I realized all three previous paths were covered by well by either Dict{K,V}(d) or associative_with_eltype.

Timings confused me benchmarking, though. I tried a few variations before I realized the noise was just a matter of whether or not BenchmarkTools inlined the merge function being tested. The functions not being inlined weren't necessarily the ones I would have guessed. Timings look as expected after forcing inlining.

I assume that's just an artifact of the benchmarking and I don't want to force merge to be inlined.

timings w/ & w/o inlining here.

tkelman · 2017-07-13T05:22:03Z

That looks pretty simple. Dunno if any existing benchmarks would cover this, if not might be worth adding some new ones to BaseBenchmarks to track it. @nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2017-07-13T08:19:43Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @ararslan

garborg · 2017-07-13T11:38:15Z

I'd be happy to add a benchmark or two, but I'm still a little confused about the interaction between BenchmarkTools and inlining from that benchmarking script: i.e. I needed to force inlining in benchmarking to see the comparison I expected, and I didn't think merge was something you'd force inlining on.

Sacha0

superficially lgtm :).

tkelman · 2017-07-18T20:46:30Z

we should probably rename the existing benchmark label to needs-benchmark rather than adding a new near-duplicate one, since that's most of what it's been used for by jrevels and myself ever since it was first added

StefanKarpinski · 2017-07-18T22:31:53Z

It was unclear to me that this was what the benchmark label was for.

tkelman · 2017-07-18T22:53:16Z

#13893 (comment)

StefanKarpinski · 2017-07-18T23:49:39Z

Ah, yes, how could I have missed that one-line comment from two years ago? I've renamed the label to make its meaning clear without ancient comment spelunking.

nalimilan

Looks good to me too. I wish we could find an even more explicit name, but without using a full sentence that doesn't sound possible. :-)

KristofferC · 2017-07-31T12:01:25Z

I needed to force inlining in benchmarking to see the comparison I expected, and I didn't think merge was something you'd force inlining on.

Here, merge just hands over to merge! and where a bunch of splatting occurs so it is possible the inlining avoids the splatting penalty. If the benchmarks are better with forced inlining, I would keep it.

KristofferC · 2017-10-01T11:48:58Z

I don't see any difference between explicit inlining or not anymore. Will rerun CI and merge if green.

Allow faster merge for Dicts

e995fc3

garborg force-pushed the dictmerge branch from 28b7782 to e995fc3 Compare July 13, 2017 01:41

Sacha0 approved these changes Jul 14, 2017

View reviewed changes

StefanKarpinski added the needs benchmark label Jul 18, 2017

StefanKarpinski added potential benchmark Could make a good benchmark in BaseBenchmarks and removed needs benchmark labels Jul 18, 2017

nalimilan approved these changes Jul 19, 2017

View reviewed changes

KristofferC closed this Oct 1, 2017

KristofferC reopened this Oct 1, 2017

KristofferC merged commit e88b264 into JuliaLang:master Oct 1, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fast path for merge with Dicts #22737

Add fast path for merge with Dicts #22737

garborg commented Jul 10, 2017

KristofferC commented Jul 10, 2017

garborg commented Jul 10, 2017

nalimilan commented Jul 10, 2017 •

edited

Loading

garborg commented Jul 10, 2017

nalimilan commented Jul 11, 2017

garborg commented Jul 13, 2017

tkelman commented Jul 13, 2017

nanosoldier commented Jul 13, 2017

garborg commented Jul 13, 2017

Sacha0 left a comment

tkelman commented Jul 18, 2017 •

edited

Loading

StefanKarpinski commented Jul 18, 2017

tkelman commented Jul 18, 2017

StefanKarpinski commented Jul 18, 2017 •

edited

Loading

nalimilan left a comment

KristofferC commented Jul 31, 2017 •

edited

Loading

KristofferC commented Oct 1, 2017

Add fast path for merge with Dicts #22737

Add fast path for merge with Dicts #22737

Conversation

garborg commented Jul 10, 2017

KristofferC commented Jul 10, 2017

garborg commented Jul 10, 2017

nalimilan commented Jul 10, 2017 • edited Loading

garborg commented Jul 10, 2017

nalimilan commented Jul 11, 2017

garborg commented Jul 13, 2017

tkelman commented Jul 13, 2017

nanosoldier commented Jul 13, 2017

garborg commented Jul 13, 2017

Sacha0 left a comment

Choose a reason for hiding this comment

tkelman commented Jul 18, 2017 • edited Loading

StefanKarpinski commented Jul 18, 2017

tkelman commented Jul 18, 2017

StefanKarpinski commented Jul 18, 2017 • edited Loading

nalimilan left a comment

Choose a reason for hiding this comment

KristofferC commented Jul 31, 2017 • edited Loading

KristofferC commented Oct 1, 2017

nalimilan commented Jul 10, 2017 •

edited

Loading

tkelman commented Jul 18, 2017 •

edited

Loading

StefanKarpinski commented Jul 18, 2017 •

edited

Loading

KristofferC commented Jul 31, 2017 •

edited

Loading