Change TaskRunner to limit context switches. #5532

swankjesse · 2019-10-05T03:47:56Z

Now we don't have to alternate between the coordinator thread and the task
thread between task runs if the task returns 0. Instead the task thread can
stay resident.

This implementation works by having task runnables that can switch from
the coordinator role (sleeping until the next task starts) and the executor
role.

#5512

yschimke · 2019-10-05T04:41:30Z

okhttp/src/main/java/okhttp3/internal/concurrent/Task.kt

@@ -56,8 +56,6 @@ abstract class Task(
  /** Undefined unless this is in [TaskQueue.futureTasks]. */
  internal var nextExecuteNanoTime = -1L

-  internal var runRunnable: Runnable? = null
-
  /** Returns the delay in nanoseconds until the next execution, or -1L to not reschedule. */


nit: Negative values < -1 seem to be effectively a priority, is that ok? Or should be an error?

Not a big concern given this is internal.

Intended contract is for implementations to never return such values! If ever we wanted to make this more broadly usable we’d need to enforce that requirement.

yschimke · 2019-10-05T04:48:51Z

okhttp/src/main/java/okhttp3/internal/concurrent/TaskRunner.kt

+      while (true) {
+        val task = synchronized(this@TaskRunner) {
+          awaitTaskToRun()
+        } ?: return


I think my biggest concern is that the delay of the time to execute a solitary task becomes the limiting time to execute a later (but soon) task on another queue.

Or am I missing something?

Do you schedule a kick for next next task time somewhere?

Yeah, awaitTaskToRun might be misnamed. It’ll always schedule a thread if there’s something to do now (run or sleep) that competes with something to do later (run or sleep).

yschimke · 2019-10-05T04:52:24Z

okhttp/src/main/java/okhttp3/internal/concurrent/TaskRunner.kt

+      var readyTask: Task? = null
+      var multipleReadyTasks = false
+
+      // Decide what to run. This loop's goal wants to:


Crazy Idea: Instead of this loop goal, could you store the next two scheduled times, and loop to find the third at this point?

Maybe the question I should answer is, why two items?

The first one is what we’re going to execute on the current thread.
The second one justifies starting a new thread that’ll itself look for up to two items.

We stop after two because once we know we’re going to fork a thread, it becomes that thread’s problem to fork a thread for the third thing.

yschimke · 2019-10-05T05:07:56Z

Overall, I'd love to see real world thread usage for this. Any thoughts of how it looks in a profiler? Or whether we could output something like the Google Chrome Trace Event format?

yschimke

Hard for me to conclusively review in-situ. Needs more eyes or enough bake time and testing before release.

swankjesse · 2019-10-05T13:46:23Z

Before & after effects of fewer context switches. These are measurements for the cost of 100 sequential empty tasks on the same queue.

BEFORE

Benchmark                                          (executionCount)    Mode    Cnt     Score    Error  Units
SelectBenchmark.executeTasks                                    100  sample  10730   930.829 ± 10.726  us/op
SelectBenchmark.executeTasks:executeTasks·p0.00                 100  sample          655.360           us/op
SelectBenchmark.executeTasks:executeTasks·p0.50                 100  sample          761.856           us/op
SelectBenchmark.executeTasks:executeTasks·p0.90                 100  sample         1476.608           us/op
SelectBenchmark.executeTasks:executeTasks·p0.95                 100  sample         1673.216           us/op
SelectBenchmark.executeTasks:executeTasks·p0.99                 100  sample         2012.549           us/op
SelectBenchmark.executeTasks:executeTasks·p0.999                100  sample         3156.435           us/op
SelectBenchmark.executeTasks:executeTasks·p0.9999               100  sample         3675.513           us/op
SelectBenchmark.executeTasks:executeTasks·p1.00                 100  sample         3678.208           us/op

AFTER

Benchmark                                          (executionCount)    Mode    Cnt     Score   Error  Units
SelectBenchmark.executeTasks                                    100  sample  24266   412.118 ± 1.106  us/op
SelectBenchmark.executeTasks:executeTasks·p0.00                 100  sample          375.808          us/op
SelectBenchmark.executeTasks:executeTasks·p0.50                 100  sample          394.752          us/op
SelectBenchmark.executeTasks:executeTasks·p0.90                 100  sample          445.952          us/op
SelectBenchmark.executeTasks:executeTasks·p0.95                 100  sample          488.960          us/op
SelectBenchmark.executeTasks:executeTasks·p0.99                 100  sample          635.218          us/op
SelectBenchmark.executeTasks:executeTasks·p0.999                100  sample          959.896          us/op
SelectBenchmark.executeTasks:executeTasks·p0.9999               100  sample         1423.714          us/op
SelectBenchmark.executeTasks:executeTasks·p1.00                 100  sample         1501.184          us/op

swankjesse · 2019-10-05T17:47:56Z

Figuring out how to mature this up is the next challenge. The best way to do that is some real world exercise. I’m going to run our crawler a bunch, and maybe write some torture tests.

Now we don't have to alternate between the coordinator thread and the task thread between task runs if the task returns 0. Instead the task thread can stay resident. This implementation works by having task runnables that can switch from the coordinator role (sleeping until the next task starts) and the executor role. #5512

swankjesse · 2019-10-06T03:15:12Z

okhttp/src/main/java/okhttp3/internal/concurrent/TaskQueue.kt

@@ -87,7 +81,7 @@ class TaskQueue internal constructor(
  fun awaitIdle(delayNanos: Long): Boolean {
    val latch = CountDownLatch(1)

-    val task = object : Task("awaitIdle") {
+    val task = object : Task("awaitIdle", cancelable = false) {


this was the problem with flaky tests!

swankjesse · 2019-10-06T03:19:18Z

okhttp/src/test/java/okhttp3/CallKotlinTest.kt

@@ -134,6 +134,9 @@ class CallKotlinTest {
    recordedRequest = server.takeRequest()
    assertEquals("HEAD", recordedRequest.method)

+    recordedRequest = server.takeRequest()
+    assertThat(recordedRequest.failure).isNotNull()


this fixes a green/green collision with two earlier pull requests

yschimke reviewed Oct 5, 2019

View reviewed changes

yschimke approved these changes Oct 5, 2019

View reviewed changes

swankjesse force-pushed the jwilson.1004.new_task_runner branch from b785000 to 264e932 Compare October 6, 2019 01:59

swankjesse force-pushed the jwilson.1004.new_task_runner branch from 264e932 to ef4b5ec Compare October 6, 2019 03:14

swankjesse commented Oct 6, 2019

View reviewed changes

swankjesse merged commit afd9db3 into master Oct 6, 2019

swankjesse mentioned this pull request Oct 6, 2019

TaskRunner tracking bug #5512

Closed

15 tasks

swankjesse deleted the jwilson.1004.new_task_runner branch January 1, 2020 19:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change TaskRunner to limit context switches. #5532

Change TaskRunner to limit context switches. #5532

swankjesse commented Oct 5, 2019

yschimke Oct 5, 2019

swankjesse Oct 5, 2019

yschimke Oct 5, 2019

swankjesse Oct 5, 2019

yschimke Oct 5, 2019 •

edited

Loading

swankjesse Oct 5, 2019

yschimke commented Oct 5, 2019

yschimke left a comment

swankjesse commented Oct 5, 2019

swankjesse commented Oct 5, 2019

swankjesse Oct 6, 2019

swankjesse Oct 6, 2019

Change TaskRunner to limit context switches. #5532

Change TaskRunner to limit context switches. #5532

Conversation

swankjesse commented Oct 5, 2019

yschimke Oct 5, 2019

Choose a reason for hiding this comment

swankjesse Oct 5, 2019

Choose a reason for hiding this comment

yschimke Oct 5, 2019

Choose a reason for hiding this comment

swankjesse Oct 5, 2019

Choose a reason for hiding this comment

yschimke Oct 5, 2019 • edited Loading

Choose a reason for hiding this comment

swankjesse Oct 5, 2019

Choose a reason for hiding this comment

yschimke commented Oct 5, 2019

yschimke left a comment

Choose a reason for hiding this comment

swankjesse commented Oct 5, 2019

BEFORE

AFTER

swankjesse commented Oct 5, 2019

swankjesse Oct 6, 2019

Choose a reason for hiding this comment

swankjesse Oct 6, 2019

Choose a reason for hiding this comment

yschimke Oct 5, 2019 •

edited

Loading