Excess contention in ExecutorService #2118

carl-mastrangelo · 2016-08-02T21:10:53Z

When profiling a client with 200K active RPCs, there is a point of contention on the Executor. Each RPC gets its own SerializingExecutor, which executes work on an underlying executor. Currently, that executor is ThreadPoolExecutor in almost all cases, which itself has a BlockingQueue. That queue is heavily contended showing up at minutes of wasted time:

141.17mins 79.41% 79.41% 141.22mins 79.44%  java.util.concurrent.LinkedBlockingQueue.offer LinkedBlockingQueue.java
 36.51mins 20.54% 99.95%  36.52mins 20.54%  java.util.concurrent.LinkedBlockingQueue.take LinkedBlockingQueue.java

An idea to fix this is to have some sort of striping executor in order to prevent this contention from happening.

The text was updated successfully, but these errors were encountered:

carl-mastrangelo · 2016-08-02T23:58:46Z

@ejona86 Suggested using ForkJoinPool. While this is not possible in general due to being limited to Java 6 APIs, running a local server/ client with this does in fact reduce the contention.

ejona86 · 2016-08-03T00:14:17Z

And it's unknown how much better ForkJoinPool does when receiving runnables from threads outside of the pool, but it seems worth a check.

carl-mastrangelo · 2016-08-03T00:30:57Z

A spot check shows the contention gone, but QPS plummets to half (86kqps -> 42kqps). Run with:

carl-mastrangelo@f2ab548

carl-mastrangelo · 2016-08-03T02:52:58Z

Hmmm, running on a 32x thread machine it does speed up a lot. Maybe there is a threshold.

buchgr · 2016-08-03T08:20:41Z

On how many cores and for how long did this client run? Also, I believe those numbers might be cumulative over all threads. So if say 32 threads are trying to add to the queue concurrently and one gets the lock for 100 micros, then that means 3.1millis of contention.

carl-mastrangelo · 2016-08-03T17:23:56Z

@buchgr The numbers are cumulative. It is a 32core client talking to a 32core server. I can't recall if I looked at the client or the server, but since they both use the executor in the same way it doesn't matter which.

The contention profiler records how long a thread waits on a lock to become available and also acquires it. (So the thread that holds the lock and releases it will not be recorded).

Running last night with FJP showed a 3x perf jump (~460kqps) so this contention matters a lot in high qps cases.

buchgr · 2016-08-09T21:09:49Z

It might be worth mentioning that Netty backported the FJP, so that it can be used with Java 1.6 https://github.com/netty/netty/blob/4.1/common/src/main/java/io/netty/util/internal/chmv8/ForkJoinPool.java

We could check if Netty's FJP is there and if so use it?

carl-mastrangelo · 2016-08-09T21:16:04Z

@buchgr FJP depends heavily on the number of available cores available for use. For example, running on a 32core machine, but under 50% load from other processes, FJP does worse at parallelism level 32 than 16. Picking the number to be too high or too low causes painful performance swings, so it would be hard to set it as a default.

Also, blocking calls are going to make it act poorly. Only Future / Async Clients (and servers) really benefit from it. It's a good optimization, but only after recognizing it as applicable to the use case.

carl-mastrangelo added the performance label Aug 2, 2016

carl-mastrangelo changed the title ~~Extreme contention in ExecutorService~~ Excess contention in ExecutorService Aug 3, 2016

carl-mastrangelo mentioned this issue Aug 3, 2016

Excess contention in ManagedChannelImpl.exitIdleMode #2119

Closed

ejona86 added this to the Next milestone Nov 7, 2016

marcoferrer mentioned this issue Dec 19, 2018

Non blocking server impl and recommended usage of Server.directExecutor() #5185

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Excess contention in ExecutorService #2118

Excess contention in ExecutorService #2118

carl-mastrangelo commented Aug 2, 2016

carl-mastrangelo commented Aug 2, 2016

ejona86 commented Aug 3, 2016

carl-mastrangelo commented Aug 3, 2016

carl-mastrangelo commented Aug 3, 2016

buchgr commented Aug 3, 2016

carl-mastrangelo commented Aug 3, 2016

buchgr commented Aug 9, 2016

carl-mastrangelo commented Aug 9, 2016

Excess contention in ExecutorService #2118

Excess contention in ExecutorService #2118

Comments

carl-mastrangelo commented Aug 2, 2016

carl-mastrangelo commented Aug 2, 2016

ejona86 commented Aug 3, 2016

carl-mastrangelo commented Aug 3, 2016

carl-mastrangelo commented Aug 3, 2016

buchgr commented Aug 3, 2016

carl-mastrangelo commented Aug 3, 2016

buchgr commented Aug 9, 2016

carl-mastrangelo commented Aug 9, 2016