TaskRunner tracking bug #5512

swankjesse · 2019-09-29T14:36:32Z

Now we don't have to alternate between the coordinator thread and the task thread between task runs if the task returns 0. Instead the task thread can stay resident. This implementation works by having task runnables that can switch from the coordinator role (sleeping until the next task starts) and the executor role. #5512

swankjesse · 2019-10-07T11:19:16Z

To improve visibility I think I'll borrow from what we do for HTTP/2 frames: use logger.fine() to print events and metrics. If you know it exists you can turn it on, but no public API.

This was a regression introduced with the TaskRunner changes. I couldn't find other places where daemon threads were likely to cause potential problems. #5512

swankjesse · 2019-12-09T05:40:27Z

I wanna build debug logging for tasks. Here’s a sketch of a sample log:

Q1 ▌ scheduled after 100 ms: OkHttp example.com ping
Q1 ▌▌ scheduled after 200 ms: OkHttp example.com[3] writeSynReset
Q1 ▌▌ starting: OkHttp example.com ping
Q1 ▌▌ executed in 130 ms (next after 100 ms): OkHttp example.com ping
Q1 ▌▌ starting (31 ms late): OkHttp example.com[3] writeSynReset
Q1 ▌▌ executed in 2 ms: OkHttp example.com[3] writeSynReset
Q2 ▌ posted: OkHttp ConnectionPool

Notes:

Q1, Q2 etc. is queue name. This is a process-wide ID.
▌▌ is queue size in tasks. This includes task being just-added or just-completed.
scheduled for delayed tasks, posted for immediate tasks.
units are ms, µs, or s. No nanos in logs; just truncate to 0 µs.
"starting" includes start delay like (31 ms late) if task start is more than 10 ms later than scheduled. This will be due to busy CPU or a preceding task going overtime.
need to measure execution time!

These logs will be off by default. Enable FINE logging on TaskRunner to get logs.

Egorand · 2019-12-09T15:19:28Z

A couple thoughts:

Probably worth keeping logs indented to the same level after the queue size, otherwise it feels like the logs with the same indentation are grouped, while they're not. Perhaps having a uniform indentation that grows if the queue gets bigger could work?

Q1 ▌     scheduled after 100 ms: OkHttp example.com ping
Q1 ▌▌    scheduled after 200 ms: OkHttp example.com[3] writeSynReset
Q1 ▌▌▌   starting: OkHttp example.com ping
Q1 ▌▌▌▌  executed in 130 ms (next after 100 ms): OkHttp example.com ping
Q1 ▌▌    starting (31 ms late): OkHttp example.com[3] writeSynReset
Q1 ▌▌▌▌▌ executed in 2 ms: OkHttp example.com[3] writeSynReset
Q2 ▌▌▌▌▌▌▌▌▌▌ posted: OkHttp ConnectionPool
Q2 ▌▌▌▌▌▌▌    posted: OkHttp ConnectionPool
Q2 ▌▌▌▌▌▌▌▌▌  posted: OkHttp ConnectionPool
Q2 ▌     posted: OkHttp ConnectionPool

How about printing scheduled, starting, executed and posted in bold?

yschimke · 2019-12-23T08:21:59Z

@swankjesse Nice work on this, impressive to see the progression!

swankjesse · 2019-12-23T08:31:02Z

I think I'm ready to call this complete. Anything else you wanna see for observability or troubleshooting?

yschimke · 2019-12-23T09:10:13Z

@swankjesse let's follow up with that based on working through flaky CI tests. That's my main goal - can we use our own tools to after the fact debug

swankjesse · 2019-12-30T22:52:14Z

No further action for this issue. Will work through flaky tests!

swankjesse added the enhancement Feature not a bug label Sep 29, 2019

swankjesse added this to the 4.3 milestone Sep 29, 2019

swankjesse mentioned this issue Oct 5, 2019

Change TaskRunner to limit context switches. #5532

Merged

swankjesse mentioned this issue Oct 22, 2019

Tomcat is not able to stop because of "OkHttp ConnectionPool" and "Okio Watchdog" threads #5542

Closed

swankjesse pushed a commit that referenced this issue Dec 7, 2019

Don't use daemon threads in MockWebServer

d0d728e

This was a regression introduced with the TaskRunner changes. I couldn't find other places where daemon threads were likely to cause potential problems. #5512

swankjesse pushed a commit that referenced this issue Dec 7, 2019

Don't use daemon threads in MockWebServer

20fd8c3

This was a regression introduced with the TaskRunner changes. I couldn't find other places where daemon threads were likely to cause potential problems. #5512

swankjesse mentioned this issue Dec 7, 2019

Don't use daemon threads in MockWebServer #5633

Merged

swankjesse pushed a commit that referenced this issue Dec 7, 2019

Don't use daemon threads in MockWebServer

d26bfca

This was a regression introduced with the TaskRunner changes. I couldn't find other places where daemon threads were likely to cause potential problems. #5512

swankjesse closed this as completed Dec 30, 2019

angyalfold mentioned this issue Oct 7, 2020

Java execution cannot stop using default client segmentio/analytics-java#168

Closed

gpfjeff mentioned this issue Aug 11, 2023

Upgrade OkHttp dependency to 4.3+ duosecurity/duo_universal_java#23

Open

martinbonnin mentioned this issue Dec 3, 2024

Resource management apollographql/apollo-kotlin#6300

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TaskRunner tracking bug #5512

TaskRunner tracking bug #5512

swankjesse commented Sep 29, 2019 •

edited

Loading

swankjesse commented Oct 7, 2019

swankjesse commented Dec 9, 2019

Egorand commented Dec 9, 2019

yschimke commented Dec 23, 2019

swankjesse commented Dec 23, 2019

yschimke commented Dec 23, 2019

swankjesse commented Dec 30, 2019

TaskRunner tracking bug #5512

TaskRunner tracking bug #5512

Comments

swankjesse commented Sep 29, 2019 • edited Loading

swankjesse commented Oct 7, 2019

swankjesse commented Dec 9, 2019

Egorand commented Dec 9, 2019

yschimke commented Dec 23, 2019

swankjesse commented Dec 23, 2019

yschimke commented Dec 23, 2019

swankjesse commented Dec 30, 2019

swankjesse commented Sep 29, 2019 •

edited

Loading