Micro allocations optimisation in ArC #25835

Sanne · 2022-05-27T17:03:55Z

I've omitted implementing some of the methods in the new Map implementation:
in part because it seemed unneccessary, but in part also because I feel it would be safer to not expose such operations.

independent-projects/arc/runtime/src/main/java/io/quarkus/arc/impl/ContextDataMap.java

manovotn

@Sanne how often does Arc initialize the lazy map we've had before?
It should only happen if the interceptor accesses its bindings - is there some interceptor forcibly doing that for majority of cases?
I guess I am just trying to get the hang of why the previous lazy init was inefficient.

I understand you want to avoid init even in the case of bindings but I don't like that some methods are left out. Technically, that's against specification.

independent-projects/arc/runtime/src/main/java/io/quarkus/arc/impl/ContextDataMap.java

Ladicek · 2022-05-30T09:03:03Z

independent-projects/arc/runtime/src/main/java/io/quarkus/arc/impl/ContextDataMap.java

+    @Override
+    public void putAll(Map m) {
+        if (m.containsKey(ArcInvocationContext.KEY_INTERCEPTOR_BINDINGS)) {
+            throw new IllegalArgumentException("Not allowed to put key '" + ArcInvocationContext.KEY_INTERCEPTOR_BINDINGS);


Nit: missing + "'" at the end, here and on 2 more places below.

👍 thanks!

Ladicek · 2022-05-30T09:07:58Z

Disallowing calling some methods (and/or with some parameters) on the context data map is an interesting idea, as it protects interceptors from stepping on each others toes too much. Too bad the specification doesn't really say much about how this map should work. What's in this PR seems reasonable to me.

manovotn · 2022-05-30T09:26:44Z

Disallowing calling some methods (and/or with some parameters) on the context data map is an interesting idea, as it protects interceptors from stepping on each others toes too much. Too bad the specification doesn't really say much about how this map should work. What's in this PR seems reasonable to me.

Note that the specification says:

The InvocationContext instance allows an interceptor method to save information in the Map obtained via the getContextData method. This information can subsequently be retrieved and/or updated by other interceptor methods in the invocation chain, and thus serves as a means to pass contextual data between interceptors.

Hence it actually allows them to step on each others toes to some extent.

Ladicek · 2022-05-30T09:32:26Z

Right, I mean, sure, the entire purpose of the context data map is for 2 interceptors to be able to exchange information. But they need to be aware of each other, which I assume typically means there's a fixed set of keys the 2 interceptors know and use. But if an interceptor tries to access the whole key set / value collection of the context data map, that is suspicious.

manovotn · 2022-05-30T09:35:18Z

Yes, but the spec doesn't limit that either.
It it true however that I have only seen values/keySet used in debugging (where it is pretty handy to inspect all of the data inside)

Ladicek · 2022-05-30T10:09:29Z

I understand and agree that the spec doesn't restrict the usage of the context data map, so there's a good argument to not restrict it on our side either. At the same time, I believe there's also a good argument to be made to restrict it, and the present PR does a decent job in my opinion. Obviously, Hyrum's law dictates that we shouldn't really change it now :-)

Sanne · 2022-05-30T10:26:11Z

@Sanne how often does Arc initialize the lazy map we've had before? It should only happen if the interceptor accesses its bindings - is there some interceptor forcibly doing that for majority of cases? I guess I am just trying to get the hang of why the previous lazy init was inefficient.

Right, sorry I sent the draft in a rush and was supposed to give some better explanations after finishing tests.

We were running performance tests again, and noticed a recurring pattern in interceptors; take SmallRyeCurrentThreadContextInterceptor for example - which is applied extensively by default in all reactive applications;

The first lines of code start with:

    @AroundInvoke
    public Object manageCurrentContext(InvocationContext ctx) throws Exception {
        CurrentThreadContext config = null;
        Object binding = ctx.getContextData().get("io.quarkus.arc.interceptorBindings");

First thing it does is to try reading from the getContextData, which makes the fact that the Map for it is being "lazily allocated" today a bit pointless as the initialization is triggered for each read operation as well; not to mention that looking up the interceptor bindings is also very likely. So technically the use of the LazyValue just adds processing overhead and an additional object (the LazyValue instance) over a plain Map, when there's high likelyhood of such read operations to be triggered.

The purpose of this PR is to keep us from initializing the Map if all what's needed is read operations.

Also: since the pattern I see in SmallRyeCurrentThreadContextInterceptor seems rather reasonable, I expect similar things to be performed by many other interceptors. I think it's plausible to assume that many such interceptors will mainly require read operations - which we can optimise for.

Also: this optimisation proposal doesn't have performance drawbacks for the write cases either - it just replaces the LazyValue holder with the custom Map - and the custom Map implementation I'm suggesting here has the same memory impact as the LazyValue approach.
It actually might improve on the current state even when write operations are triggered: the map is smaller.

So I believe it provides a win-win situation, and optimised for the more realistic scenario of getContextData being actually triggered in a non-trivial application.

I understand you want to avoid init even in the case of bindings but I don't like that some methods are left out. Technically, that's against specification.

Right - we can certainly implement the additional methods as well, but while looking into it I had a bad feeling about allowing that - for the same points @Ladicek made.

Please think about it and let me know the vertict - I can certainly implement the missing capabilities so to only focus on the performance optimisation at this stage, but I'm inclined to think this approach is safer.

I understand the purpose is for interceptors to "share" some state but I hope - for sake of our sanity - that even among spec writers there's an implied assumption that only one interceptor is the "owner" of a particular key, so while allowing others to read it, it shouldn't be allowed for a different one to write / replace / delete on a key space they don't own.
N.B. this patch still allows 3rd party interceptors to mess with each other's state - so we're not even breaking this assumption in case others have a different opinion about design: we're only hardening access to our own platform's interceptorBindings.

Ladicek · 2022-05-30T10:59:33Z

I think most interceptors actually don't use the context data map at all, so the lazy allocation isn't as inefficient as you think. Unfortunately the "current" interceptor binding annotations are not exposed in a better way -- but in the CDI community, we're talking about exposing the interceptor binding annotations directly from the InvocationContext, so that should help in the future. In the meantime, treating that one key in a special way makes a lot of sense to me.

manovotn · 2022-05-30T11:02:02Z

Thanks for more context Sanne, few more notes:

We were running performance tests again, and noticed a recurring pattern in interceptors; take SmallRyeCurrentThreadContextInterceptor for example - which is applied extensively by default in all reactive applications;

Where does this usage come from? I mean, which project uses the interceptor in question? grep-ing in Quarkus gave me no result and I don't see this in mutiny either yet from your description it sounds like this interceptor is a primary method through which we implement CP in reactive scenarios?

Please think about it and let me know the vertict - I can certainly implement the missing capabilities so to only focus on the performance optimisation at this stage, but I'm inclined to think this approach is safer.

I won't hard press you for that; looks like I am the only one thinking they should be there :)
What we should add though is a javadoc to the map impl class explaining why we do what we do.

manovotn · 2022-05-30T11:12:31Z

I think most interceptors actually don't use the context data map at all, so the lazy allocation isn't as inefficient as you think.

True, most user-defined interceptors won't care much, this is more used by other frameworks that need to introspect values in their respective bindings. For instance JTA (Narayana) was using this particular feature (if memory serves) which basically means any operations with transactions will trigger it as well.

Unfortunately the "current" interceptor binding annotations are not exposed in a better way -- but in the CDI community, we're talking about exposing the interceptor binding annotations directly from the InvocationContext, so that should help in the future.
In the meantime, treating that one key in a special way makes a lot of sense to me.

+1, I agree it makes sense to special-case this scenario.

Sanne · 2022-05-30T11:19:02Z

Thanks for all suggestions :) I've pushed a revised version - same basic principles but:

documented the rationale
implemented the missing methods (althouh not allowing to mutate the interceptor bindings specifically)

Sanne · 2022-05-30T11:23:44Z

Thanks for more context Sanne, few more notes:

We were running performance tests again, and noticed a recurring pattern in interceptors; take SmallRyeCurrentThreadContextInterceptor for example - which is applied extensively by default in all reactive applications;

Where does this usage come from? I mean, which project uses the interceptor in question? grep-ing in Quarkus gave me no result and I don't see this in mutiny either yet from your description it sounds like this interceptor is a primary method through which we implement CP in reactive scenarios?

Right, this interceptor from CP is pulled in and activated by default by many Quarkus extensions by default. It's very likely that any application that has any reactive dependencies will have this interceptor on many endpoints, even when users didn't intentionally opt-in or are aware of this. So this is having wide impact.

franz1981

LGTM!

franz1981 · 2022-05-30T11:54:58Z

independent-projects/arc/runtime/src/main/java/io/quarkus/arc/impl/ContextDataMap.java

+
+    @Override
+    public Set<Map.Entry<String, Object>> entrySet() {
+        final AbstractMap.SimpleImmutableEntry<String, Object> firstEntry = new AbstractMap.SimpleImmutableEntry<>(


This can be lazily allocated (and reused) as well, given that users cannot modify it?

yes it could - but I don't think it's worth it: hopefully nobody is using entrySet heavily on this map.

manovotn

Looks good

Ladicek · 2022-05-30T13:09:19Z

Technically if we really want the context data map to be a full map, then entrySet(), keySet() and values() should be write-through views on the original map :-) Also clear() could possibly just set the delegate to null, but I realize I'm just nit-picking here. Ignoring that, it looks fine.

Sanne · 2022-05-30T13:20:18Z

Technically if we really want the context data map to be a full map, then entrySet(), keySet() and values() should be write-through views on the original map :-)

oh that would certainly be simpler. But are you saying that the interceptors entry doesn't need to be listed? And what about the size() and isEmpty() methods then? I don't think we can ignore the "virtual" entry from the interceptors, but if we do these should probably be consistent.

Also clear() could possibly just set the delegate to null, but I realize I'm just nit-picking here. Ignoring that, it looks fine.

I could do that but it depends on the previous doubt - I think how we treat the "interceptors entry" needs to be consistent.

Sanne · 2022-05-30T13:22:05Z

ah sorry you were referring to fact one can modify the keyset and see changes reflected in the Map. Right I ignored that.

mkouba

I like the idea in general but speaking of optimizations and SmallRyeCurrentThreadContextInterceptor it might be more effective to provide a quarkus-specific version of this interceptor that would simply use io.quarkus.arc.ArcInvocationContext#getInterceptorBindings() instead.

mkouba · 2022-05-31T07:28:41Z

independent-projects/arc/runtime/src/main/java/io/quarkus/arc/impl/ContextDataMap.java

+    }
+
+    @Override
+    public Collection values() {


Is there a reason to use so many raw types in this class? I get a lot of useless warnings in my IDE ;-)

quarkus-bot bot added area/arc Issue related to ARC (dependency injection) area/hibernate-orm Hibernate ORM area/hibernate-reactive Hibernate Reactive area/persistence OBSOLETE, DO NOT USE labels May 27, 2022

Sanne force-pushed the ArcOpt branch 2 times, most recently from 76f2e36 to 6a061e5 Compare May 27, 2022 17:08

Sanne removed area/persistence OBSOLETE, DO NOT USE area/hibernate-orm Hibernate ORM area/hibernate-reactive Hibernate Reactive labels May 27, 2022

Sanne requested a review from manovotn May 27, 2022 17:09

manovotn reviewed May 30, 2022

View reviewed changes

independent-projects/arc/runtime/src/main/java/io/quarkus/arc/impl/ContextDataMap.java Show resolved Hide resolved

manovotn reviewed May 30, 2022

View reviewed changes

independent-projects/arc/runtime/src/main/java/io/quarkus/arc/impl/ContextDataMap.java Show resolved Hide resolved

manovotn requested review from Ladicek and mkouba May 30, 2022 08:23

Ladicek reviewed May 30, 2022

View reviewed changes

ArC: memory optimisations for storing of context data map

95f70e8

Sanne force-pushed the ArcOpt branch from 6a061e5 to 95f70e8 Compare May 30, 2022 11:17

Sanne marked this pull request as ready for review May 30, 2022 11:19

franz1981 approved these changes May 30, 2022

View reviewed changes

manovotn approved these changes May 30, 2022

View reviewed changes

Sanne merged commit 15180ec into quarkusio:main May 30, 2022

quarkus-bot bot added this to the 2.10 - main milestone May 30, 2022

Sanne deleted the ArcOpt branch May 30, 2022 16:42

mkouba reviewed May 31, 2022

View reviewed changes

mkouba mentioned this pull request May 31, 2022

ContextDataMap - get rid of raw types #25877

Merged

Micro allocations optimisation in ArC #25835

Micro allocations optimisation in ArC #25835

Uh oh!

Conversation

Sanne commented May 27, 2022

Uh oh!

Uh oh!

manovotn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Ladicek May 30, 2022

Choose a reason for hiding this comment

Uh oh!

Sanne May 30, 2022

Choose a reason for hiding this comment

Uh oh!

Ladicek commented May 30, 2022

Uh oh!

manovotn commented May 30, 2022

Uh oh!

Ladicek commented May 30, 2022

Uh oh!

manovotn commented May 30, 2022

Uh oh!

Ladicek commented May 30, 2022

Uh oh!

Sanne commented May 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ladicek commented May 30, 2022

Uh oh!

manovotn commented May 30, 2022

Uh oh!

manovotn commented May 30, 2022

Uh oh!

Sanne commented May 30, 2022

Uh oh!

Sanne commented May 30, 2022

Uh oh!

franz1981 left a comment

Choose a reason for hiding this comment

Uh oh!

franz1981 May 30, 2022

Choose a reason for hiding this comment

Uh oh!

Sanne May 30, 2022

Choose a reason for hiding this comment

Uh oh!

manovotn left a comment

Choose a reason for hiding this comment

Uh oh!

Ladicek commented May 30, 2022

Uh oh!

Sanne commented May 30, 2022

Uh oh!

Sanne commented May 30, 2022

Uh oh!

mkouba left a comment

Choose a reason for hiding this comment

Uh oh!

mkouba May 31, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Sanne commented May 30, 2022 •

edited

Loading