Gc/compaction thread pool, take 2 #1933

bojanserafimov · 2022-06-15T13:47:15Z

Resolves #1646

Resolves #1623

Previous attempt: #1794

pageserver/src/tenant_threads.rs

SomeoneToIgnore

I like this approach more than the previous one and find it simpler generally, but I'm biased so take me with a grain of salt.

I find somewhat concerning the new gc and compaction task management, but if it fixes the thread issues on staging, never mind my concerns.
Looks good otherwise.

pageserver/src/tenant_mgr.rs

pageserver/src/tenant_threads.rs

LizardWizzard

Thanks! I like the approach. To me, it is a better option than take 1, but in that regard I'm as biased as @SomeoneToIgnore :) +1 for @SomeoneToIgnore comments

…nt-tasks

SomeoneToIgnore

Thank you for fixing all this.

I think we're getting somewhat on par with what was before, but lack some error observability: let's add more error logging before anyhow::bail in both gc_loop & compaction_loop.

AFAIK, we were not restarting errored gc and compaction threads too before, so all my other comments are nice to have, but nothing required for this PR.

pageserver/src/tenant_threads.rs

pageserver/src/layered_repository.rs

LizardWizzard · 2022-06-29T14:29:16Z

Personally I'm ok with this rwlock approach, though I'm for unification where possible, so if walreceiver approach with cancellation channels works here too I would use it here ass well. Or if we're changing it to something different lets change it in both places. Currently tenant state management already has some problems we've discussed possible improvements with @SomeoneToIgnore so we can polish it separately. What is important this patch should solve the thread count issue. Do you have ideas on tests we can add to check that all this works correctly?

bojanserafimov · 2022-06-29T19:19:15Z

Personally I'm ok with this rwlock approach, though I'm for unification where possible, so if walreceiver approach with cancellation channels works here too I would use it here ass well. Or if we're changing it to something different lets change it in both places. Currently tenant state management already has some problems we've discussed possible improvements with @SomeoneToIgnore so we can polish it separately. What is important this patch should solve the thread count issue. Do you have ideas on tests we can add to check that all this works correctly?

Most tests that I'd like to write I already know will fail. For example, there's the state management race conditions. Also I tried testing idle->active->idle->active transitions, but the only way to make a tenant idle is to detach it. Stopping the compute node doesn't make it idle.

We need to rework tenant state management. I think @SomeoneToIgnore is already working on a RFC so I don't want to get in too deep here

bojanserafimov · 2022-06-29T20:05:58Z

I'll write a test to assert that tasks started and stopped at least

This reverts commit 255c014.

pageserver/src/tenant_mgr.rs

bojanserafimov · 2022-06-29T21:52:16Z

pageserver/src/tenant_tasks.rs

+
+                            // Spawn new task, request cancellation of the old one if exists
+                            let (cancel_send, cancel_recv) = watch::channel(());
+                            // TODO this instrument doesn't work


Not sure why. info! traces from inside the future don't show up with compaction loop as context. Will fix before merging

Replacing trace_span with info_span fixes the issue. What are the logging level semantics here? Does a span with a certain level apply only to events of an equal or stronger level? The docs don't say anything.

Hmm, thats weird, I tried the example from docs here https://docs.rs/tracing/latest/tracing/index.html#configuring-attributes and it worked for me, the info message was emitted. Though it does not use futures. I'll try reproduce it tomorrow the code from this branch

test_runner/batch_others/test_tenant_tasks.py

LizardWizzard

Looks good to me, thanks!

One nit: I think the place with the new file lock may benefit from an extended comment mentioning cases which are not covered by the lock. Maybe even point to an issue

It should conflict with my PR #1936, so go ahead and I'll rebase my patch on top of yours :)

bojanserafimov · 2022-07-05T15:15:06Z

stage metrics, fyi https://observer.zenith.tech/d/GOx33ve7z/tenant-tasks?orgId=1

knizhnik

40 working threads and 100 max_blocking_threads means that we actually can do 100 compactions in parallel. It seems to be too much. The question is what is proper value of this parameters. Even two parallel compactions can significantly degrade performance of pageserver. But if we limit it to 1, then compaction will be performed too slowly and it can also affect performance because of read amplification and extra page reconstructions.

Three my main concerns about this patch:

Compaction thread performs both compaction and materialization. This are different operation. I still think that them should be separated.
Except compaction, a lot of IO is produced by frozen layer flush. And it is performed
by other threads which are not controlled by this page pool. If the main goal of this patch is to reduce number of threads, then it may be not considered as big problem (although number of spawned flush threads can be as much as tenants, so it still can be too much). But if we want also to avoid "write storm" by reducing number of expensive IO operations concurrently performed, then we should take this flush threads into account.
This thread pool is used both for GC and compaction threads.
But this operation have completely different complexity. GC just iterate layers and read layer metadata. It is relatively fast operation and not IO intensive. While compaction is very IO intensive operation can can take a lot of time. May be it is not so good idea to put them in single thread pool.

bojanserafimov and others added 9 commits June 14, 2022 09:39

WIP

a1f8571

Add runtime

c79e72e

Simplify

ec45285

Fix init

e1a4c06

Fmt

36cc6d2

Run compaction as task

ee36ca5

Add TenantTaskManager

02a9883

Use tokio sleep instead

1a5d1a1

Add docs

865e874

bojanserafimov requested review from hlinnaka and LizardWizzard June 15, 2022 13:47

bojanserafimov commented Jun 15, 2022

View reviewed changes

pageserver/src/tenant_threads.rs Outdated Show resolved Hide resolved

Fmt

9a9a58d

SomeoneToIgnore reviewed Jun 15, 2022

View reviewed changes

LizardWizzard mentioned this pull request Jun 17, 2022

restart dead threads after error/panic #1623

Closed

LizardWizzard reviewed Jun 17, 2022

View reviewed changes

bojanserafimov added 9 commits June 22, 2022 14:49

Merge branch 'main' into tenant-tasks

83dc93a

Expand blocking scope

9aab1d0

Error instead of panic

d7d4cc8

Update TODO

0f4552a

Cancel tasks

692496d

Add comment

c44c8a0

Merge branch 'tenant-tasks' of github.com:neondatabase/neon into tena…

763b00c

…nt-tasks

Remove redundant error context

2c029d9

Add cancellation

24a5bd1

SomeoneToIgnore approved these changes Jun 24, 2022

View reviewed changes

bojanserafimov added 4 commits June 24, 2022 09:50

Remove unnecessary map_err

b31ce41

Instrument the task, not the await

796ee4d

Log errors

3a23869

Rename threads to tasks

1169e9e

Handle errors on shutdown

2b4c3cb

LizardWizzard reviewed Jun 29, 2022

View reviewed changes

pageserver/src/layered_repository.rs Show resolved Hide resolved

Fix metric name

633b176

bojanserafimov added 4 commits June 29, 2022 17:42

Merge main, add test

255c014

Revert "Merge main, add test"

df505d2

This reverts commit 255c014.

Merge branch 'main' into tenant-tasks

64cfd7c

Add test

cd20eec

bojanserafimov commented Jun 29, 2022

View reviewed changes

pageserver/src/tenant_mgr.rs Outdated Show resolved Hide resolved

bojanserafimov commented Jun 29, 2022

View reviewed changes

test_runner/batch_others/test_tenant_tasks.py Outdated Show resolved Hide resolved

bojanserafimov commented Jun 29, 2022

View reviewed changes

test_runner/batch_others/test_tenant_tasks.py Outdated Show resolved Hide resolved

bojanserafimov added 6 commits June 30, 2022 14:30

Wait for tasks to finish in test

d58032c

Use info_span

5fcfd67

Remove stale todo

f59898f

Use wait_until in test

80efbb3

Remove redundant code

14b4001

Fix type hint

b0e7a2b

LizardWizzard approved these changes Jul 4, 2022

View reviewed changes

bojanserafimov mentioned this pull request Jul 5, 2022

Tenant doesn't go to "idle" state unless explicitly detached #2027

Closed

Add todos

150135b

bojanserafimov mentioned this pull request Jul 5, 2022

Add thread pools for gc and compaction jobs #1794

Closed

bojanserafimov merged commit d29c545 into main Jul 5, 2022

bojanserafimov deleted the tenant-tasks branch July 5, 2022 06:06

knizhnik reviewed Jul 6, 2022

View reviewed changes

aome510 mentioned this pull request Jul 8, 2022

Add test for cascade branching #1569

Merged

LizardWizzard mentioned this pull request Aug 8, 2022

manage the threadcount issues in pageserver #1609

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gc/compaction thread pool, take 2 #1933

Gc/compaction thread pool, take 2 #1933

bojanserafimov commented Jun 15, 2022 •

edited

Loading

SomeoneToIgnore left a comment

LizardWizzard left a comment

SomeoneToIgnore left a comment

LizardWizzard commented Jun 29, 2022

bojanserafimov commented Jun 29, 2022 •

edited

Loading

bojanserafimov commented Jun 29, 2022

bojanserafimov Jun 29, 2022

bojanserafimov Jun 30, 2022

LizardWizzard Jul 4, 2022

LizardWizzard left a comment •

edited

Loading

bojanserafimov commented Jul 5, 2022

knizhnik left a comment

Gc/compaction thread pool, take 2 #1933

Gc/compaction thread pool, take 2 #1933

Conversation

bojanserafimov commented Jun 15, 2022 • edited Loading

SomeoneToIgnore left a comment

Choose a reason for hiding this comment

LizardWizzard left a comment

Choose a reason for hiding this comment

SomeoneToIgnore left a comment

Choose a reason for hiding this comment

LizardWizzard commented Jun 29, 2022

bojanserafimov commented Jun 29, 2022 • edited Loading

bojanserafimov commented Jun 29, 2022

bojanserafimov Jun 29, 2022

Choose a reason for hiding this comment

bojanserafimov Jun 30, 2022

Choose a reason for hiding this comment

LizardWizzard Jul 4, 2022

Choose a reason for hiding this comment

LizardWizzard left a comment • edited Loading

Choose a reason for hiding this comment

bojanserafimov commented Jul 5, 2022

knizhnik left a comment

Choose a reason for hiding this comment

bojanserafimov commented Jun 15, 2022 •

edited

Loading

bojanserafimov commented Jun 29, 2022 •

edited

Loading

LizardWizzard left a comment •

edited

Loading