Correct tryTimeoutAndWriteError to write timeout regardless of prior writes by Fedosin · Pull Request #15900 · knative/serving

Fedosin · 2025-05-26T16:13:58Z

Proposed Changes

Previously, the function comment suggested it would only write errors if nothing had been written, but the implementation correctly only checks the timedOut flag. This allows timeout errors to be written even after a response has started, which is the desired behavior for handling slow responses.

Fixed misleading function comment
Updated test to match actual behavior
Added comprehensive test coverage

Release Note

NONE

codecov · 2025-05-26T16:18:58Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 80.93%. Comparing base (c36383e) to head (2ce75a8).
Report is 10 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #15900      +/-   ##
==========================================
- Coverage   80.94%   80.93%   -0.01%     
==========================================
  Files         210      210              
  Lines       16769    16769              
==========================================
- Hits        13573    13572       -1     
- Misses       2844     2845       +1     
  Partials      352      352

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

matzew · 2025-05-27T05:07:09Z

pkg/http/handler/timeout.go

 	defer tw.mu.Unlock()

-	if tw.lastWriteTime.IsZero() {
+	if !tw.timedOut {


is this always true when nothing has been written?

Looking at the changes to the comment and the removal of lastWriteTime. (Sounded to me the "has not been written" part was added b/c of the lastWriteTime ?

Just wondering 😅

Well, there are two situations when we set the timeout flag:
1. When we reach the response start timeout: https://github.com/knative/serving/blob/main/pkg/http/handler/timeout.go#L248
2. When we hit the regular timeout: https://github.com/knative/serving/blob/main/pkg/http/handler/timeout.go#L230

Currently, both functions (tryResponseStartTimeoutAndWriteError and tryTimeoutAndWriteError) are identical and do the following:

Timeout (write an error header and set the timeout flag to true) if nothing has been written to the response.

This means that we never get inside the condition on line 228: either we write something and lastWriteTime.IsZero() is false, or we don’t write anything and fail earlier by response start timeout.

In my vision, tryTimeoutAndWriteError should always write the error, ignoring whether anything was written before or not. Checking the timedOut flag is needed to make this function idempotent.

@Fedosin thanks for the further explanation. It makes sense to me even though I don't a deep knowledge of Serving's timeout handling.

In my vision, tryTimeoutAndWriteError should always write the error, ignoring whether anything was written before or not.

That would be unexpected for end-users - so we shouldn't do this.

If something is written on the wire already and a timeout occurs we should just close the connection.

…writes Previously, the function comment suggested it would only write errors if nothing had been written, but the implementation correctly only checks the timedOut flag. This allows timeout errors to be written even after a response has started, which is the desired behavior for handling slow responses. - Fixed misleading function comment - Updated test to match actual behavior - Added comprehensive test coverage

dsimansk

/lgtm

dsimansk · 2025-05-28T11:31:03Z

PTAL
/assign @dprotaso

matzew

/lgtm
/approve

Thanks for the details, @Fedosin

dsimansk · 2025-06-05T08:57:28Z

/approve

knative-prow · 2025-06-05T08:57:37Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dsimansk, Fedosin, matzew

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [dsimansk]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

dprotaso · 2025-06-05T18:30:55Z

One odd thing I find weird about this package is if a time out happens - we should exit the for loop and close the connection - but we only do that conditionally for some reason which doesn't seem right. The idle timeout should be the only one that is looping.

for {
	select {
  		// ...
		case <-timeout.C():
			timeoutDrained = true
			if tw.tryTimeoutAndWriteError(h.body) {
				return
			}
	}
}

…writes (knative#15900) Previously, the function comment suggested it would only write errors if nothing had been written, but the implementation correctly only checks the timedOut flag. This allows timeout errors to be written even after a response has started, which is the desired behavior for handling slow responses. - Fixed misleading function comment - Updated test to match actual behavior - Added comprehensive test coverage

* Fix flakes in TestIdleTimeoutHandler (knative#15918) * Correct tryTimeoutAndWriteError to write timeout regardless of prior writes (knative#15900) Previously, the function comment suggested it would only write errors if nothing had been written, but the implementation correctly only checks the timedOut flag. This allows timeout errors to be written even after a response has started, which is the desired behavior for handling slow responses. - Fixed misleading function comment - Updated test to match actual behavior - Added comprehensive test coverage * Fix request hanging after response start timeout expires (knative#15899) When a response starts before the responseStartTimeout but the timeout still fires, the timeout handler would not properly continue processing the request. This caused requests to hang indefinitely if they started responding just before the responseStartTimeout expired. The issue occurred because after handling the responseStartTimeout case, the select loop would continue but without properly waiting for the handler to complete. Setting responseStartTimeoutDrained ensures the timer is properly cleaned up and the loop continues to process other events (completion, overall timeout, or idle timeout). Fixes requests that start responding before responseStartTimeout but take longer than responseStartTimeout to complete. --------- Co-authored-by: Mike Fedosin <mfedosin@redhat.com>

knative-prow bot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label May 26, 2025

knative-prow bot requested review from dprotaso, gauron99 and skonto May 26, 2025 16:14

matzew reviewed May 27, 2025

View reviewed changes

Fedosin force-pushed the tryTimeoutAndWriteError branch from bad6248 to d2f583e Compare May 27, 2025 08:22

Fedosin force-pushed the tryTimeoutAndWriteError branch from d2f583e to 2ce75a8 Compare May 27, 2025 08:22

dsimansk reviewed May 28, 2025

View reviewed changes

knative-prow bot assigned dsimansk May 28, 2025

knative-prow bot added the lgtm Indicates that a PR is ready to be merged. label May 28, 2025

knative-prow bot assigned dprotaso May 28, 2025

dsimansk mentioned this pull request May 29, 2025

Activator sometimes crashing when requests time out #15850

Open

matzew approved these changes Jun 5, 2025

View reviewed changes

knative-prow bot assigned matzew Jun 5, 2025

knative-prow bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 5, 2025

knative-prow bot merged commit 80d7335 into knative:main Jun 5, 2025
68 checks passed

Fedosin mentioned this pull request Jun 10, 2025

Fix flakes in TestIdleTimeoutHandler #15918

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correct tryTimeoutAndWriteError to write timeout regardless of prior writes#15900

Correct tryTimeoutAndWriteError to write timeout regardless of prior writes#15900
knative-prow[bot] merged 1 commit intoknative:mainfrom
Fedosin:tryTimeoutAndWriteError

Fedosin commented May 26, 2025

Uh oh!

codecov bot commented May 26, 2025 •

edited

Loading

Uh oh!

matzew May 27, 2025

Uh oh!

Fedosin May 27, 2025

Uh oh!

dsimansk May 28, 2025

Uh oh!

dprotaso Jun 5, 2025 •

edited

Loading

Uh oh!

dsimansk left a comment

Uh oh!

dsimansk commented May 28, 2025

Uh oh!

matzew left a comment

Uh oh!

dsimansk commented Jun 5, 2025

Uh oh!

knative-prow bot commented Jun 5, 2025

Uh oh!

Uh oh!

dprotaso commented Jun 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Fedosin commented May 26, 2025

Proposed Changes

Uh oh!

codecov bot commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

matzew May 27, 2025

Choose a reason for hiding this comment

Uh oh!

Fedosin May 27, 2025

Choose a reason for hiding this comment

Uh oh!

dsimansk May 28, 2025

Choose a reason for hiding this comment

Uh oh!

dprotaso Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dsimansk left a comment

Choose a reason for hiding this comment

Uh oh!

dsimansk commented May 28, 2025

Uh oh!

matzew left a comment

Choose a reason for hiding this comment

Uh oh!

dsimansk commented Jun 5, 2025

Uh oh!

knative-prow bot commented Jun 5, 2025

Uh oh!

Uh oh!

dprotaso commented Jun 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented May 26, 2025 •

edited

Loading

dprotaso Jun 5, 2025 •

edited

Loading