vtgate/buffer: Fix leakage of buffer pool slots due to canceled requests. #2599

michael-berlin · 2017-02-26T06:24:47Z

Canceled requests, e.g. those with a deadline shorter than the failover duration, would not return their buffer pool slot. Eventually, all slots would have leaked and the buffer would start and stop buffering as usual but reject all requests immediately because it assumed there are other pending failovers which are holding the needed slots.

Previously, the code path for canceled requests differed from the common method unblockAndWait() which would unblock a request and also release its buffer pool slot after the request finished its retry. This made this oversight possible. I've changed this now: canceled requests also use unblockAndWait() now.

The code was also out of sync with the documentation for WaitForFailoverEnd(): The RetryDoneFunc() must not be returned when the function returns an error as well. However, for canceled requests both were returned.

…sts. Canceled requests, e.g. those with a deadline shorter than the failover duration, would not return their buffer pool slot. Eventually, all slots would have leaked and the buffer would start and stop buffering as usual but reject all requests immediately because it assumed there are other pending failovers which are holding the needed slots. Previously, the code path for canceled requests differed from the common method unblockAndWait() which would unblock a request and also release its buffer pool slot after the request finished its retry. This made this oversight possible. I've changed this now: canceled requests also use unblockAndWait() now. The code was also out of sync with the documentation for WaitForFailoverEnd(): The RetryDoneFunc() must not be returned when the function returns an error as well. However, for canceled requests both were returned.

sougou

Some drive-by comments.

sougou · 2017-02-26T20:54:02Z

go/sync2/semaphore.go

+
+// Size returns the current number of available slots.
+func (sem *Semaphore) Size() int {
+ return len(sem.slots)


I think this racy, unless the language guarantees it. I couldn't find any text about this.

sougou · 2017-02-26T20:55:53Z

go/vt/vtgate/buffer/buffer_test.go

+// returned when the request has finished. See also shardBuffer.unblockAndWait().
+func waitForPoolSlots(b *Buffer, want int) error {
+ start := time.Now()
+ for {


This is a tight loop and will hang in non-preemptive implementations of the runtime. Even otherwise, it's something we should avoid.

Addressed code review comment in vitessio#2599.

michael-berlin requested a review from alainjobart February 26, 2017 06:24

googlebot added the cla: yes label Feb 26, 2017

alainjobart approved these changes Feb 26, 2017

View reviewed changes

michael-berlin merged commit 5705d03 into vitessio:master Feb 26, 2017

michael-berlin deleted the fix_buffer_leak branch February 26, 2017 18:40

sougou reviewed Feb 26, 2017

View reviewed changes

michael-berlin added a commit to michael-berlin/vitess that referenced this pull request Apr 19, 2017

vtgate: Avoid tight loop in unit tests.

cec0142

Addressed code review comment in vitessio#2599.

michael-berlin mentioned this pull request Apr 19, 2017

vtgate: Avoid tight loop in unit tests. #2787

Merged

frouioui pushed a commit to planetscale/vitess that referenced this pull request Mar 26, 2024

cherry pick of 12963 (vitessio#2599)

2adf195

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vtgate/buffer: Fix leakage of buffer pool slots due to canceled requests. #2599

vtgate/buffer: Fix leakage of buffer pool slots due to canceled requests. #2599

michael-berlin commented Feb 26, 2017

sougou left a comment

sougou Feb 26, 2017

sougou Feb 26, 2017

vtgate/buffer: Fix leakage of buffer pool slots due to canceled requests. #2599

vtgate/buffer: Fix leakage of buffer pool slots due to canceled requests. #2599

Conversation

michael-berlin commented Feb 26, 2017

sougou left a comment

Choose a reason for hiding this comment

sougou Feb 26, 2017

Choose a reason for hiding this comment

sougou Feb 26, 2017

Choose a reason for hiding this comment