leaf-types cache for ml-matches #36166

vtjnash · 2020-06-05T19:32:15Z

When we do a methods lookup and get back a single result, it's very easy for us to cache that information, since we essentially already have the data-structure for it (TypeMapEntry). This lets us shave off a bit of time on micro-benchmarks by putting this information into a hash table instead of a tree (which is still used to handle the general case):

julia> @btime methods(+, (Int, Int))
master:  3.322 μs (21 allocations: 912 bytes)
PR:      2.069 μs (18 allocations: 784 bytes)

julia> @btime Base._methods_by_ftype(Tuple{typeof(+), Int, Int}, -1, typemax(UInt64))
master:  1.496 μs (7 allocations: 464 bytes)
PR:      0.175 μs (4 allocations: 336 bytes)

Since we are doing a subtype search (like a the top of the function), the resulting object from the search does not need to be the same.

This lets us put more objects in here without incurring additional search code (just the initial cost of computing the hash for the tuple type lookup computation).

Keno · 2020-06-08T22:32:56Z

test/channels.jl

@@ -390,6 +390,7 @@ end
        t = Timer(0) do t
            tc[] += 1
        end
+        Libc.systemsleep(0.005)


Is there something better we can do here. Tests that depend on system scheduling behavior almost always turn out flakey.

Without this, the test depends on the scheduling (this test assumes that this statement takes a least a millisecond to complete). This completely fixes the test to avoid that flakiness by ensuring it always takes at least that long.

Keno · 2020-06-08T22:40:14Z

src/gf.c

+            return env.t;
+        }
+    }
+    if (((jl_datatype_t*)unw)->isdispatchtuple) {


Isn't this the same condition as the previous if block?

It is in characters in this PR right now. But conceptually, they don't share a common root, so I've listed them separately. That way, if someone alters one, it won't affect the other.

Alright, you're the methodcache czar, it just looked odd as is. Maybe there should be helpers that make clear in which sense this is used, even if the implementation of these helpers is the same at the moment?

True, I'll keep that in mind. It won't be the last time (even this month) that I edit the code, haha.

vtjnash · 2020-06-15T22:44:35Z

I'd like to plan to merge this tomorrow (after #36260). Please let me know if review is incomplete.

JeffBezanson · 2020-06-19T03:24:26Z

Looks fine to me. No reason to block merging, but I have a couple thoughts.

I wonder if we can take this farther to memoize more of abstract_call_gf_by_type --- i.e. avoid the "double lookup" of ml_matches followed by typeinf_edge.

JeffBezanson · 2020-06-19T03:25:21Z

src/gf.c

        JL_TIMING(METHOD_LOOKUP_FAST);
        mt = jl_gf_mtable(F);
        entry = jl_typemap_assoc_exact(mt->cache, F, args, nargs, jl_cachearg_offset(mt), world);
+        if (entry == NULL) {


I suspect we'll lose a bit of performance from needing two lookups here sometimes. Would be nice to be able to combine the tables somehow.

That would be likely to hurt performance and memory usage to combine them (had that situation in a intermediate state of this PR, before I finished separating the tables). What we're doing here is expecting that one of these two tables is most likely empty (functions usually either get specialized on leaf types entirely or they don't)—and so we're just going to bypass the previous table in most cases without even attempting a lookup.

JeffBezanson · 2020-06-19T03:47:12Z

src/gf.c

+                env.match.ti = mi->specTypes;
+            }
+            else {
+                // TODO: should we use jl_subtype_env instead (since we know that `type <: meth->sig` by transitivity)


👍, plus we can skip this entirely if there are no static params.

Oh, wow, yeah, not sure how I missed that, as it seems so obvious. I'm also thinking of removing this value entirely from the results (since often it should be available from the later typeinf_edge cache lookup)

vtjnash added 6 commits June 5, 2020 15:20

IdDict: handle size zero inputs gracefully

9fbd135

fix bad tests

40bb06d

improve code quality

166031c

remove incorrect assertion

f4a983d

Since we are doing a subtype search (like a the top of the function), the resulting object from the search does not need to be the same.

For dispatch, move from using a tree to a hash lookup of leaf types

be77eb8

This lets us put more objects in here without incurring additional search code (just the initial cost of computing the hash for the tuple type lookup computation).

avoid recomputing ml-matches worlds during method-caching [NFCI]

6c54523

vtjnash requested a review from JeffBezanson June 5, 2020 19:32

Keno reviewed Jun 8, 2020

View reviewed changes

vtjnash merged commit 92197a7 into master Jun 19, 2020

vtjnash deleted the jn/ml-matches-leaf-cache branch June 19, 2020 01:32

JeffBezanson reviewed Jun 19, 2020

View reviewed changes

This was referenced Jun 24, 2020

fixes for "leaf-types cache for ml-matches" #36413

Closed

gf: improve ordering of operations based on performance estimates #36436

Merged

JeffBezanson added the compiler:latency Compiler latency label Jul 30, 2020

NHDaly mentioned this pull request Dec 20, 2020

The new stacktrace printing is missing "repeated N times" info for some StackOverflows #37587

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

leaf-types cache for ml-matches #36166

leaf-types cache for ml-matches #36166

vtjnash commented Jun 5, 2020

Keno Jun 8, 2020

vtjnash Jun 15, 2020

Keno Jun 15, 2020

Keno Jun 8, 2020

vtjnash Jun 15, 2020

Keno Jun 15, 2020

vtjnash Jun 19, 2020

vtjnash commented Jun 15, 2020

JeffBezanson commented Jun 19, 2020

JeffBezanson Jun 19, 2020

vtjnash Jun 19, 2020

JeffBezanson Jun 19, 2020

vtjnash Jun 19, 2020 •

edited

Loading

leaf-types cache for ml-matches #36166

leaf-types cache for ml-matches #36166

Conversation

vtjnash commented Jun 5, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vtjnash commented Jun 15, 2020

JeffBezanson commented Jun 19, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vtjnash Jun 19, 2020 • edited Loading

Choose a reason for hiding this comment

vtjnash Jun 19, 2020 •

edited

Loading