client: Support TSO RPC Parallelizing #8432

MyonKeminta · 2024-07-23T10:59:35Z

Development Task

The main tracking issue in TiDB repo: TSO Request Parallelizing pingcap/tidb#54960

We used to find that in some OLTP workloads where the QPS is high and the queries are simple, the TSO Wait duration usually become a significant portion of the total duration of queries. In TiDB, TSO loading is already made concurrent with some other works such as compiling. In cases that the queries are simple, it would be hard to further optimize it by making it concurrent with more phases of the SQL execution. But we found a practical way to optimize it is to do it from the TSO client.

Currently, a TSO client object has a goroutine that collects GetTS (and GetTSAsync) calls (tsoRequests) as a batch, send it to PD, wait for the response, and dispatch the results to these tsoRequests, serially. As a result, each GetTS calls may need to spend up to 1x TSO RPC time to wait for being collected to the next batch.

Considering the case that PD's TSO allocator is not the bottle neck and can deal with more TSO requests (so that the majority part of TSO RPC's time cost is on the network), we find that it's possible to start collecting the next batch and send it before receiving the response of the previous batch. So that each GetTS call needs to wait for less time to be batched, and gets a shorter total duration.

So this is an approach that reduces the duration of GetTS & GetTSAsync - Wait at the expense of higher TSO RPC OPS and higher pressure to PD. It's not suitable to be enabled by default, but we can provide such an option when the TSO Wait duration becomes a problem.

Subtasks

Side changes:

Merge the two xxxTSOStream types so that the error handling and metrics reporting logic for PD server deployment and TSO service deployment can be reused. client: Merge the two tsoStream types to reuse the same error handling and metrics reporting code #8433
Split sending and receiving part of tsoStream into separated goroutines
Let tsoDispatcher support batching according to estimated TSO RPC duration from tsoStream

The text was updated successfully, but these errors were encountered:

…g and metrics reporting code (#8433) ref #8432 client: Merge the two tsoStream types to reuse the same error handling and metrics reporting code This commit merges the two `xxxTSOStream` types so that the error handling and metrics reporting logic for PD server deployment and TSO service deployment can be reused. Signed-off-by: MyonKeminta <MyonKeminta@users.noreply.github.com> Co-authored-by: ti-chi-bot[bot] <108142056+ti-chi-bot[bot]@users.noreply.github.com>

ref #8432 client: Make tsoStream receives asynchronously. This makes it possible to allow the tsoDispatcher send multiple requests and wait for their responses concurrently. Signed-off-by: MyonKeminta <MyonKeminta@users.noreply.github.com> Co-authored-by: ti-chi-bot[bot] <108142056+ti-chi-bot[bot]@users.noreply.github.com>

MyonKeminta added the type/development The issue belongs to a development tasks label Jul 23, 2024

This was referenced Jul 23, 2024

client: Merge the two tsoStream types to reuse the same error handling and metrics reporting code #8433

Merged

TSO Request Parallelizing pingcap/tidb#54960

Closed

MyonKeminta mentioned this issue Aug 30, 2024

client: Make tsoStream receives asynchronously #8483

Merged

MyonKeminta mentioned this issue Sep 12, 2024

client: Add benchmark for tsoStream and tsoDispatcher #8618

Closed

ti-chi-bot bot closed this as completed in #8633 Sep 26, 2024

ti-chi-bot bot closed this as completed in 642f0e9 Sep 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

client: Support TSO RPC Parallelizing #8432

client: Support TSO RPC Parallelizing #8432

MyonKeminta commented Jul 23, 2024 •

edited

Loading

client: Support TSO RPC Parallelizing #8432

client: Support TSO RPC Parallelizing #8432

Comments

MyonKeminta commented Jul 23, 2024 • edited Loading

Development Task

Subtasks

MyonKeminta commented Jul 23, 2024 •

edited

Loading