[Experimental-Feature] DeliverySpec.RetryAfter #5813

travis-minke-sap · 2021-10-14T18:08:08Z

This PR addresses parts of #5811 by creating the experimental-feature in the Alpha stage.

TLDR = New experimental feature allowing users to opt-in to respecting the Retry-After header when calculating the appropriate backoff duration for 429 and 503 responses.

Proposed Changes

🎁 Add new delivery-retryafter feature flag.
🎁 Enhance DeliverySpec with new optional RetryAfterMax component
🎁 Enforce DeliverySpec.RetryAfterMax usage against feature flag in WebHook validation
🎁 Add new DeliverySpec.RetryAfterMax to Subscription Controller/Reconciler updating of Channel.Spec.Subscribers
🎁 Enhance RetryConfig struct to include RetryAfterMax configuration.
🎁 Enhance RetryConfigFromDeliverySpec() to parse DeliverySpec.RetryAfterMax into RetryConfig.
🎁 Enhance SendWithRetries() to consider Retry-After headers and RetryConfig settings when calculating backoff durations.
🎁 Update codegen (local docs)
🎁 Update experimental-features documentation

Open Questions

I modified the pkg/reconciler/subscription/subscription_test.go to cover the changes in the Subscription controller. This introduced the need to have the test be aware of experimental-features. I think this a win but wanted to confirm it was acceptable (ie - no need to keep the controller test "pure").
This implementation mirrors that of the Timeout experimental-feature, which also updated the v1beta1 version of the DeliverySpec without any experimental-feature gates. I wasn't sure of the rationale for doing so and have NOT done so in this PR. If someone could explain why we'd wan to do this I'll be happy to include it.

Pre-review Checklist

At least 80% unit test coverage
E2E tests for any new behavior
Docs PR for any user-facing impact
Spec PR for any new API feature (Will be created if experimental-feature graduates to Stable state.)
Conformance test for any change to the spec. (If needed will be added in experimental-feature Beta release.)

Release Note

New experimental-feature "delivery-retryafter" flag allows use of "DeliverySpec.retryAfter" to configure handling of Retry-After headers in 429 / 503 responses.  See https://github.com/knative/docs/blob/main/docs/eventing/experimental-features.md

Docs

knative/docs#4361

Holding for ~~Experimental-Features review process & refactoring approach~~ review
/hold

codecov · 2021-10-14T18:25:11Z

Codecov Report

Merging #5813 (5be0e20) into main (2cda8f4) will increase coverage by 0.21%.
The diff coverage is 96.00%.

@@            Coverage Diff             @@
##             main    #5813      +/-   ##
==========================================
+ Coverage   82.02%   82.23%   +0.21%     
==========================================
  Files         220      220              
  Lines        7527     7572      +45     
==========================================
+ Hits         6174     6227      +53     
+ Misses        918      911       -7     
+ Partials      435      434       -1

Impacted Files	Coverage Δ
pkg/kncloudevents/message_sender.go	`90.19% <92.00%> (+0.54%)`	⬆️
pkg/apis/duck/v1/delivery_types.go	`93.54% <100.00%> (+1.88%)`	⬆️
pkg/kncloudevents/retries.go	`100.00% <100.00%> (+13.63%)`	⬆️
pkg/reconciler/subscription/subscription.go	`82.46% <100.00%> (+2.23%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2cda8f4...5be0e20. Read the comment docs.

matzew · 2021-10-27T15:25:18Z

pkg/apis/duck/v1/delivery_types.go

+	// header value will be used when calculating the next backoff duration.  This will
+	// only be considered when a 429 (Too Many Requests) or 503 (Service Unavailable)
+	// response code is received and Retry is greater than 0.
+	Enabled bool `json:"enabled"`


shouldn't we always respect the Retry-After?

I am not really sure I like the toggling here.

This PR is stale - based on the original design which has evolved (is evolving in the associated Issue). Yes, everyone was against the "enabled" flag and in the latest design it is gone. I'm waiting to update this PR for the new design to settle - i need to summarize and push for agreement again...

matzew · 2021-10-27T15:27:43Z

pkg/apis/duck/v1/delivery_types.go

+	//  - https://www.iso.org/iso-8601-date-and-time-format.html
+	//  - https://en.wikipedia.org/wiki/ISO_8601
+	// +optional
+	MaxDuration *string `json:"maxDuration,omitempty"`


Assuming we do not toggle (just thinking out loud) and have respecting Retry-after as default.

Setting MaxDuration: -1 could disable it.
If nothing (or a value greater than 0) is provided we do as state on the comments

🤷

Yes the idea is that at least in GA it would be always respected and you could opt out by specifying "PT0S" or something similar.

matzew · 2021-11-09T16:44:51Z

/assign @matzew

travis-minke-sap · 2021-11-10T16:24:07Z

PR has been refactored to align with current plan and is ready for review!

odacremolbap

awesome piece of code.
added minor comments and learned some stuff about testing at eventing, thanks for that.

Could we add to the docs PR (not sure if also to the API) that in case the backoff + retry-after competing for setting the duration, the largest is honored?

eventing/pkg/kncloudevents/message_sender.go

Lines 143 to 148 in 6bb059a

    
           // Return The Larger Of The Two Backoff Durations 
        
           if retryAfterDuration > backoffDuration { 
        
           	return retryAfterDuration 
        
           } else { 
        
           	return backoffDuration 
        
           }

pkg/apis/duck/v1/delivery_types.go

odacremolbap · 2021-11-10T21:05:41Z

pkg/kncloudevents/retries.go

@@ -112,6 +119,15 @@ func RetryConfigFromDeliverySpec(spec v1.DeliverySpec) (RetryConfig, error) {
 		retryConfig.RequestTimeout, _ = timeout.Duration()
 	}

+	if spec.RetryAfterMax != nil {


Not sure if this is too picky but I think it would be a nice policy to re-check the feature enablement here. If an admin enables a feature, then disable it and restart the controller, I think it would be expected that this value is not used.

This would be important for security related features, probably not for this one. Leaving the comment here for your consideration.

While this might be nice to do it is a bit problematic. The goal was to do all validation of the experimental-feature in the Webhook in order to avoid requiring downstream implementations from having to load/watch the config-features ConfigMap into their context and then plumb that context through. I suppose we could build a separate ConfigMap lookup inline here, but that feels heavyweight and non-standard with other ConfigMap tracking logic. I'm inclined to not recheck the feature flags here but we can let others weigh in?

I think you are right, and I didn't thought of other implementations.
On my side, this can be resolved.

While this might be nice to do it is a bit problematic. The goal was to do all validation of the experimental-feature in the Webhook in order to avoid requiring downstream implementations from having to load/watch the config-features ConfigMap into their context and then plumb that context through. I suppose we could build a separate ConfigMap lookup inline here, but that feels heavyweight and non-standard with other ConfigMap tracking logic. I'm inclined to not recheck the feature flags here but we can let others weigh in?

I think we actually must check if the feature is enabled. New experimental API fields will be stored as unknown fields if the feature is disabled and the webhook will ignore them. This means in a "enable-disable" scenario, this code will find a valid value for the RetryAfterMax and will honor it, even thought the feature is disabled.

I believe experimental features are designed to be part of eventing core as:

API changes that're optionally implemented in alternative implementations.

A change in the reference implementation. This is to validate the feature, as well as provide a reference implementation for other alternative implementations.

That means, that by design, if an alternative implementation will adopt an experimental feature, it needs to watch the features CM and provide its own implementation. Otherwise, it can wait until the graduation of the feature and becomes "on by default".

pkg/kncloudevents/message_sender.go

pierDipi · 2021-11-18T19:19:12Z

Well, that's my opinion, if folks feel confident in reviewing big PRs, go for it. Lines of code are not a problem, it's the scope.

…

On Thu, Nov 18, 2021 at 7:13 PM Ben Moss ***@***.***> wrote: I don't think we need to split this up — You are receiving this because you were assigned. Reply to this email directly, view it on GitHub, or unsubscribe.

-- Pierangelo Di Pilato Software Engineer Red Hat, Inc https://www.redhat.com/

.

pkg/kncloudevents/message_sender.go

travis-minke-sap · 2021-11-23T15:44:45Z

@lionelvillard - are you still planning on reviewing? I realize it's a holiday week so it can wait till next week if you're out - just checking ; )

docs/eventing-api.md

lionelvillard · 2021-11-23T16:31:21Z

Can you replace fixes #5811 by parts of #5811, to show this PR is one step towards fully implementing #5811

/unhold

lionelvillard · 2021-11-23T16:31:42Z

/lgtm
/approve

knative-prow-robot · 2021-11-23T16:31:55Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: lionelvillard, odacremolbap, travis-minke-sap

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [lionelvillard]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

travis-minke-sap · 2021-11-23T16:53:25Z

Can you replace fixes #5811 by parts of #5811, to show this PR is one step towards fully implementing #5811

yep - good call - thanks!

knative-metrics-robot · 2021-11-23T16:57:32Z

The following is the coverage report on the affected files.
Say /test pull-knative-eventing-go-coverage to re-run this coverage report

File	Old Coverage	New Coverage	Delta
pkg/apis/duck/v1/delivery_types.go	95.2%	96.3%	1.1
pkg/kncloudevents/message_sender.go	88.2%	92.5%	4.3
pkg/kncloudevents/retries.go	94.7%	100.0%	5.3
pkg/reconciler/subscription/subscription.go	86.1%	87.2%	1.0

travis-minke-sap · 2021-11-23T17:39:01Z

/retest

lionelvillard · 2021-11-23T20:52:48Z

/lgtm

pierDipi

This doesn't work for a Channel-based Broker since the filter handler doesn't propagate response headers back to the channel.

eventing/pkg/broker/filter/filter_handler.go

Lines 201 to 307 in 820db20

    
           func (h *Handler) send(ctx context.Context, writer http.ResponseWriter, headers http.Header, target string, reportArgs *ReportArgs, event *cloudevents.Event, ttl int32) { 
        
           	// send the event to trigger's subscriber 
        
           	response, err := h.sendEvent(ctx, headers, target, event, reportArgs) 
        
           	if err != nil { 
        
           		h.logger.Error("failed to send event", zap.Error(err)) 
        
           		writer.WriteHeader(http.StatusInternalServerError) 
        
           		_ = h.reporter.ReportEventCount(reportArgs, http.StatusInternalServerError) 
        
           		return 
        
           	} 
        
           	h.logger.Debug("Successfully dispatched message", zap.Any("target", target)) 
        
           	// If there is an event in the response write it to the response 
        
           	statusCode, err := h.writeResponse(ctx, writer, response, ttl, target) 
        
           	if err != nil { 
        
           		h.logger.Error("failed to write response", zap.Error(err)) 
        
           	} 
        
           	_ = h.reporter.ReportEventCount(reportArgs, statusCode) 
        
           } 
        
           func (h *Handler) sendEvent(ctx context.Context, headers http.Header, target string, event *cloudevents.Event, reporterArgs *ReportArgs) (*http.Response, error) { 
        
           	// Send the event to the subscriber 
        
           	req, err := h.sender.NewCloudEventRequestWithTarget(ctx, target) 
        
           	if err != nil { 
        
           		return nil, fmt.Errorf("failed to create the request: %w", err) 
        
           	} 
        
           	message := binding.ToMessage(event) 
        
           	defer message.Finish(nil) 
        
           	additionalHeaders := utils.PassThroughHeaders(headers) 
        
           	// Following the spec https://github.com/knative/specs/blob/main/specs/eventing/data-plane.md#derived-reply-events 
        
           	additionalHeaders.Set("prefer", "reply") 
        
           	err = kncloudevents.WriteHTTPRequestWithAdditionalHeaders(ctx, message, req, additionalHeaders) 
        
           	if err != nil { 
        
           		return nil, fmt.Errorf("failed to write request: %w", err) 
        
           	} 
        
           	start := time.Now() 
        
           	resp, err := h.sender.Send(req) 
        
           	dispatchTime := time.Since(start) 
        
           	if err != nil { 
        
           		err = fmt.Errorf("failed to dispatch message: %w", err) 
        
           	} 
        
           	sc := 0 
        
           	if resp != nil { 
        
           		sc = resp.StatusCode 
        
           	} 
        
           	_ = h.reporter.ReportEventDispatchTime(reporterArgs, sc, dispatchTime) 
        
           	return resp, err 
        
           } 
        
           // The return values are the status 
        
           func (h *Handler) writeResponse(ctx context.Context, writer http.ResponseWriter, resp *http.Response, ttl int32, target string) (int, error) { 
        
           	response := cehttp.NewMessageFromHttpResponse(resp) 
        
           	defer response.Finish(nil) 
        
           	if response.ReadEncoding() == binding.EncodingUnknown { 
        
           		// Response doesn't have a ce-specversion header nor a content-type matching a cloudevent event format 
        
           		// Just read a byte out of the reader to see if it's non-empty, we don't care what it is, 
        
           		// just that it is not empty. This means there was a response and it's not valid, so treat 
        
           		// as delivery failure. 
        
           		body := make([]byte, 1) 
        
           		n, _ := response.BodyReader.Read(body) 
        
           		response.BodyReader.Close() 
        
           		if n != 0 { 
        
           			// Note that we could just use StatusInternalServerError, but to distinguish 
        
           			// between the failure cases, we use a different code here. 
        
           			writer.WriteHeader(http.StatusBadGateway) 
        
           			return http.StatusBadGateway, errors.New("received a non-empty response not recognized as CloudEvent. The response MUST be either empty or a valid CloudEvent") 
        
           		} 
        
           		h.logger.Debug("Response doesn't contain a CloudEvent, replying with an empty response", zap.Any("target", target)) 
        
           		writer.WriteHeader(resp.StatusCode) 
        
           		return resp.StatusCode, nil 
        
           	} 
        
           	event, err := binding.ToEvent(ctx, response) 
        
           	if err != nil { 
        
           		// Like in the above case, we could just use StatusInternalServerError, but to distinguish 
        
           		// between the failure cases, we use a different code here. 
        
           		writer.WriteHeader(http.StatusBadGateway) 
        
           		// Malformed event, reply with err 
        
           		return http.StatusBadGateway, err 
        
           	} 
        
           	// Reattach the TTL (with the same value) to the response event before sending it to the Broker. 
        
           	if err := broker.SetTTL(event.Context, ttl); err != nil { 
        
           		writer.WriteHeader(http.StatusInternalServerError) 
        
           		return http.StatusInternalServerError, fmt.Errorf("failed to reset TTL: %w", err) 
        
           	} 
        
           	eventResponse := binding.ToMessage(event) 
        
           	defer eventResponse.Finish(nil) 
        
           	if err := cehttp.WriteResponseWriter(ctx, eventResponse, resp.StatusCode, writer); err != nil { 
        
           		return http.StatusInternalServerError, fmt.Errorf("failed to write response event: %w", err) 
        
           	} 
        
           	h.logger.Debug("Replied with a CloudEvent response", zap.Any("target", target)) 
        
           	return resp.StatusCode, nil 
        
           }

lionelvillard · 2021-11-24T13:35:38Z

@pierDipi can you open an issue?

pierDipi · 2021-11-24T15:11:54Z

Done

devguyio

I believe we need a followup to check if the feature is enabled. This current behavior will be problematic in case of enabling then disabling the feature.

devguyio · 2022-01-25T04:12:24Z

pkg/kncloudevents/retries.go

@@ -112,6 +119,15 @@ func RetryConfigFromDeliverySpec(spec v1.DeliverySpec) (RetryConfig, error) {
 		retryConfig.RequestTimeout, _ = timeout.Duration()
 	}

+	if spec.RetryAfterMax != nil {


While this might be nice to do it is a bit problematic. The goal was to do all validation of the experimental-feature in the Webhook in order to avoid requiring downstream implementations from having to load/watch the config-features ConfigMap into their context and then plumb that context through. I suppose we could build a separate ConfigMap lookup inline here, but that feels heavyweight and non-standard with other ConfigMap tracking logic. I'm inclined to not recheck the feature flags here but we can let others weigh in?

I think we actually must check if the feature is enabled. New experimental API fields will be stored as unknown fields if the feature is disabled and the webhook will ignore them. This means in a "enable-disable" scenario, this code will find a valid value for the RetryAfterMax and will honor it, even thought the feature is disabled.

I believe experimental features are designed to be part of eventing core as:

API changes that're optionally implemented in alternative implementations.

A change in the reference implementation. This is to validate the feature, as well as provide a reference implementation for other alternative implementations.

That means, that by design, if an alternative implementation will adopt an experimental feature, it needs to watch the features CM and provide its own implementation. Otherwise, it can wait until the graduation of the feature and becomes "on by default".

travis-minke-sap added 5 commits October 8, 2021 12:30

Retry-After Experimental-Feature

91b3795

update-codegen

b942a39

minor enhancements

fea4cc5

Updated Issue Number TODO

a6d0d35

Merging main/

bfd444b

knative-prow-robot added do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Oct 14, 2021

google-cla bot added the cla: yes Indicates the PR's author has signed the CLA. label Oct 14, 2021

travis-minke-sap mentioned this pull request Oct 14, 2021

[Experimental] DeliverySpec RetryAfter #5811

Open

11 tasks

travis-minke-sap added 2 commits October 21, 2021 08:05

Merge branch 'main' into retry-after-experimental-feature

8e602dc

Add e2e tests

3f7d714

knative-prow-robot added the area/test-and-release Test infrastructure, tests or release label Oct 21, 2021

travis-minke-sap added 2 commits October 21, 2021 08:30

build tag update

d227062

Merge branch 'main' into retry-after-experimental-feature

692765c

matzew reviewed Oct 27, 2021

View reviewed changes

travis-minke-sap added 4 commits November 3, 2021 13:20

Merge branch 'main' into retry-after-experimental-feature

88e75b6

Refactor implementation for new design based on community feedback.

9fc7448

Refactor implementation for new design based on community feedback.

c28795b

Refactor implementation for new design based on community feedback.

6bb059a

knative-prow-robot assigned matzew Nov 9, 2021

matzew mentioned this pull request Nov 10, 2021

Initial attempt at 429 Retry-After header support #5285

Closed

odacremolbap approved these changes Nov 10, 2021

View reviewed changes

knative-prow-robot assigned odacremolbap Nov 10, 2021

knative-prow-robot added lgtm Indicates that a PR is ready to be merged. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. labels Nov 10, 2021

PR feedback

6d5ec5f

matzew reviewed Nov 19, 2021

View reviewed changes

pkg/kncloudevents/message_sender.go Show resolved Hide resolved

travis-minke-sap added 2 commits November 19, 2021 11:47

Refactor RetryAfter message_sender test

2eab560

Merge branch 'main' into retry-after-experimental-feature

5ce88dd

lionelvillard reviewed Nov 23, 2021

View reviewed changes

docs/eventing-api.md Outdated Show resolved Hide resolved

Merge branch 'main' into retry-after-experimental-feature

1cab330

knative-prow-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 23, 2021

knative-prow-robot assigned lionelvillard Nov 23, 2021

knative-prow-robot added the lgtm Indicates that a PR is ready to be merged. label Nov 23, 2021

knative-prow-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 23, 2021

PR feedback

5be0e20

knative-prow-robot removed the lgtm Indicates that a PR is ready to be merged. label Nov 23, 2021

knative-prow-robot added the lgtm Indicates that a PR is ready to be merged. label Nov 23, 2021

knative-prow-robot merged commit 820db20 into knative:main Nov 23, 2021

pierDipi reviewed Nov 24, 2021

View reviewed changes

pierDipi mentioned this pull request Nov 24, 2021

Retry After doesn't work for MT Channel Based Broker #5935

Closed

travis-minke-sap deleted the retry-after-experimental-feature branch November 24, 2021 15:21

snneji mentioned this pull request Dec 1, 2021

DeliverySpec.RetryAfter Experimental-Feature knative/docs#4361

Merged

devguyio reviewed Jan 25, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Experimental-Feature] DeliverySpec.RetryAfter #5813

[Experimental-Feature] DeliverySpec.RetryAfter #5813

travis-minke-sap commented Oct 14, 2021 •

edited

Loading

codecov bot commented Oct 14, 2021 •

edited

Loading

matzew Oct 27, 2021

travis-minke-sap Oct 27, 2021

matzew Oct 27, 2021

travis-minke-sap Oct 27, 2021

matzew commented Nov 9, 2021

travis-minke-sap commented Nov 10, 2021

odacremolbap left a comment •

edited

Loading

odacremolbap Nov 10, 2021

travis-minke-sap Nov 12, 2021

odacremolbap Nov 12, 2021

devguyio Jan 25, 2022 •

edited

Loading

pierDipi commented Nov 18, 2021 via email

travis-minke-sap commented Nov 23, 2021

lionelvillard commented Nov 23, 2021

lionelvillard commented Nov 23, 2021

knative-prow-robot commented Nov 23, 2021

travis-minke-sap commented Nov 23, 2021

knative-metrics-robot commented Nov 23, 2021

travis-minke-sap commented Nov 23, 2021

lionelvillard commented Nov 23, 2021

pierDipi left a comment •

edited

Loading

lionelvillard commented Nov 24, 2021

pierDipi commented Nov 24, 2021

devguyio left a comment

devguyio Jan 25, 2022 •

edited

Loading

	// Return The Larger Of The Two Backoff Durations
	if retryAfterDuration > backoffDuration {
	return retryAfterDuration
	} else {
	return backoffDuration
	}

	func (h Handler) send(ctx context.Context, writer http.ResponseWriter, headers http.Header, target string, reportArgs ReportArgs, event *cloudevents.Event, ttl int32) {
	// send the event to trigger's subscriber
	response, err := h.sendEvent(ctx, headers, target, event, reportArgs)
	if err != nil {
	h.logger.Error("failed to send event", zap.Error(err))
	writer.WriteHeader(http.StatusInternalServerError)
	_ = h.reporter.ReportEventCount(reportArgs, http.StatusInternalServerError)
	return
	}

	h.logger.Debug("Successfully dispatched message", zap.Any("target", target))

	// If there is an event in the response write it to the response
	statusCode, err := h.writeResponse(ctx, writer, response, ttl, target)
	if err != nil {
	h.logger.Error("failed to write response", zap.Error(err))
	}
	_ = h.reporter.ReportEventCount(reportArgs, statusCode)
	}

	func (h Handler) sendEvent(ctx context.Context, headers http.Header, target string, event cloudevents.Event, reporterArgs ReportArgs) (http.Response, error) {
	// Send the event to the subscriber
	req, err := h.sender.NewCloudEventRequestWithTarget(ctx, target)
	if err != nil {
	return nil, fmt.Errorf("failed to create the request: %w", err)
	}

	message := binding.ToMessage(event)
	defer message.Finish(nil)

	additionalHeaders := utils.PassThroughHeaders(headers)

	// Following the spec https://github.com/knative/specs/blob/main/specs/eventing/data-plane.md#derived-reply-events
	additionalHeaders.Set("prefer", "reply")

	err = kncloudevents.WriteHTTPRequestWithAdditionalHeaders(ctx, message, req, additionalHeaders)
	if err != nil {
	return nil, fmt.Errorf("failed to write request: %w", err)
	}

	start := time.Now()
	resp, err := h.sender.Send(req)
	dispatchTime := time.Since(start)
	if err != nil {
	err = fmt.Errorf("failed to dispatch message: %w", err)
	}

	sc := 0
	if resp != nil {
	sc = resp.StatusCode
	}

	_ = h.reporter.ReportEventDispatchTime(reporterArgs, sc, dispatchTime)

	return resp, err
	}

	// The return values are the status
	func (h Handler) writeResponse(ctx context.Context, writer http.ResponseWriter, resp http.Response, ttl int32, target string) (int, error) {
	response := cehttp.NewMessageFromHttpResponse(resp)
	defer response.Finish(nil)

	if response.ReadEncoding() == binding.EncodingUnknown {
	// Response doesn't have a ce-specversion header nor a content-type matching a cloudevent event format
	// Just read a byte out of the reader to see if it's non-empty, we don't care what it is,
	// just that it is not empty. This means there was a response and it's not valid, so treat
	// as delivery failure.
	body := make([]byte, 1)
	n, _ := response.BodyReader.Read(body)
	response.BodyReader.Close()
	if n != 0 {
	// Note that we could just use StatusInternalServerError, but to distinguish
	// between the failure cases, we use a different code here.
	writer.WriteHeader(http.StatusBadGateway)
	return http.StatusBadGateway, errors.New("received a non-empty response not recognized as CloudEvent. The response MUST be either empty or a valid CloudEvent")
	}
	h.logger.Debug("Response doesn't contain a CloudEvent, replying with an empty response", zap.Any("target", target))
	writer.WriteHeader(resp.StatusCode)
	return resp.StatusCode, nil
	}

	event, err := binding.ToEvent(ctx, response)
	if err != nil {
	// Like in the above case, we could just use StatusInternalServerError, but to distinguish
	// between the failure cases, we use a different code here.
	writer.WriteHeader(http.StatusBadGateway)
	// Malformed event, reply with err
	return http.StatusBadGateway, err
	}

	// Reattach the TTL (with the same value) to the response event before sending it to the Broker.
	if err := broker.SetTTL(event.Context, ttl); err != nil {
	writer.WriteHeader(http.StatusInternalServerError)
	return http.StatusInternalServerError, fmt.Errorf("failed to reset TTL: %w", err)
	}

	eventResponse := binding.ToMessage(event)
	defer eventResponse.Finish(nil)

	if err := cehttp.WriteResponseWriter(ctx, eventResponse, resp.StatusCode, writer); err != nil {
	return http.StatusInternalServerError, fmt.Errorf("failed to write response event: %w", err)
	}

	h.logger.Debug("Replied with a CloudEvent response", zap.Any("target", target))

	return resp.StatusCode, nil
	}

[Experimental-Feature] DeliverySpec.RetryAfter #5813

[Experimental-Feature] DeliverySpec.RetryAfter #5813

Conversation

travis-minke-sap commented Oct 14, 2021 • edited Loading

Proposed Changes

Open Questions

Pre-review Checklist

codecov bot commented Oct 14, 2021 • edited Loading

Codecov Report

matzew Oct 27, 2021

Choose a reason for hiding this comment

travis-minke-sap Oct 27, 2021

Choose a reason for hiding this comment

matzew Oct 27, 2021

Choose a reason for hiding this comment

travis-minke-sap Oct 27, 2021

Choose a reason for hiding this comment

matzew commented Nov 9, 2021

travis-minke-sap commented Nov 10, 2021

odacremolbap left a comment • edited Loading

Choose a reason for hiding this comment

odacremolbap Nov 10, 2021

Choose a reason for hiding this comment

travis-minke-sap Nov 12, 2021

Choose a reason for hiding this comment

odacremolbap Nov 12, 2021

Choose a reason for hiding this comment

devguyio Jan 25, 2022 • edited Loading

Choose a reason for hiding this comment

pierDipi commented Nov 18, 2021 via email

travis-minke-sap commented Nov 23, 2021

lionelvillard commented Nov 23, 2021

lionelvillard commented Nov 23, 2021

knative-prow-robot commented Nov 23, 2021

travis-minke-sap commented Nov 23, 2021

knative-metrics-robot commented Nov 23, 2021

travis-minke-sap commented Nov 23, 2021

lionelvillard commented Nov 23, 2021

pierDipi left a comment • edited Loading

Choose a reason for hiding this comment

lionelvillard commented Nov 24, 2021

pierDipi commented Nov 24, 2021

devguyio left a comment

Choose a reason for hiding this comment

devguyio Jan 25, 2022 • edited Loading

Choose a reason for hiding this comment

travis-minke-sap commented Oct 14, 2021 •

edited

Loading

codecov bot commented Oct 14, 2021 •

edited

Loading

odacremolbap left a comment •

edited

Loading

devguyio Jan 25, 2022 •

edited

Loading

pierDipi left a comment •

edited

Loading

devguyio Jan 25, 2022 •

edited

Loading