O(n^2) scaling in llvm symbol lookup #15619

yuyichao · 2016-03-24T22:50:05Z

While fiddling #14846 with gdb folks I noticed that there's a O(n^2) scaling in our own JIT too.

The test is done on patched llvm 3.7.1 with this script which is basically emitting a lot of small functions with different names by,

for i in 1:n
    f = symbol("f$i")
    @eval $f() = $i
    @eval $f()
end

Ploting the time vs n on a log-log scale it's very clear that the O(n^2) scaling takes over for n > 10k.

The profile (done with perf since I was profiling gdb with the same command) with n = 50000 is,

and it seems that most of the time is spent in the symbol lookup (some of the other functions are also quite significant but the symbol lookup seems to be the worst).

@vtjnash and @carnaval suggested that llvm has a O(n) object file look up and we might need to replace it with our own hash table in jitlayer.cpp.

Also c.c. @Keno

The text was updated successfully, but these errors were encountered:

timholy · 2016-03-24T23:37:41Z

Nice detective work as always, @yuyichao.

vtjnash · 2016-03-25T02:17:57Z

it's probably worthwhile to memoize the result of findSymbol in a StringMap<orc::JITSymbol> (at https://github.com/JuliaLang/julia/blob/master/src/jitlayers.cpp#L398), since we can end up jitting such an insane number of small object files and this call is linear in the number of object files that have been added

vtjnash · 2016-03-26T01:57:53Z

i've created a version of yuyichao's script that also emits a plot and shows my fix will be O(1): https://gist.github.com/vtjnash/89cff0605d29fe968e56 (with total compile time const on this benchmark roughly 2x the old jit, but at least with constant scaling now)

CompileLayer.findSymbol is O(n) in the number of modules that have been emitted but we can pre-compute the result of findSymbolIn when notified that an object has been emitted and store it in one hash table for the JIT fix #15619

yuyichao · 2016-04-02T23:53:27Z

Reported upstream https://llvm.org/bugs/show_bug.cgi?id=27188

yuyichao added the performance Must go faster label Mar 24, 2016

JeffBezanson added the compiler:codegen Generation of LLVM IR and native code label Mar 24, 2016

jrevels added the potential benchmark Could make a good benchmark in BaseBenchmarks label Mar 24, 2016

StefanKarpinski closed this as completed in 3f4ba52 Mar 28, 2016

KristofferC removed the potential benchmark Could make a good benchmark in BaseBenchmarks label Oct 31, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

O(n^2) scaling in llvm symbol lookup #15619

O(n^2) scaling in llvm symbol lookup #15619

yuyichao commented Mar 24, 2016

timholy commented Mar 24, 2016

vtjnash commented Mar 25, 2016

vtjnash commented Mar 26, 2016

yuyichao commented Apr 2, 2016

O(n^2) scaling in llvm symbol lookup #15619

O(n^2) scaling in llvm symbol lookup #15619

Comments

yuyichao commented Mar 24, 2016

timholy commented Mar 24, 2016

vtjnash commented Mar 25, 2016

vtjnash commented Mar 26, 2016

yuyichao commented Apr 2, 2016