[providers-cncf-kubernetes] Throttle HTTPError during consume pod logs #54761

AutomationDev85 · 2025-08-21T06:34:37Z

Overview

We use Airflow to scale different kind of tasks and some tasks are running for a long time (> 5 hours) and we are using the KubernetesPodOperator to start a Pod and track the log output of the task. We also use Keda autoscaling to scale up and down the cluster nodes. The issue is that during scale up and down the consume log http connection is disturbed as the API backend also scales up and down. We see then an exception in the log file which confuses the user, as he thinks it has something to do with his task.

As the pod manager already tries to reconnect automatically the idea is to add the HTTPError only it it occurred more than 2 times in 60 seconds to not confuse a user log with exceptions which are ok an handled.

Details of change:

Track the amount of HTTP errors.
Only add exception text to the log file, if more than 2 HTTPError occurred in the last 60 Seconds.
With that the normal reconnect is not visible in the log file.
Warning about possible duplication of the log file lines due to short log file read interruption is still added to the log file.

jscheffl

Looks good for me. Whereas the readability of the threshold is a bit hard. I am thinking a bit about if it would be possible to put the check for threshold in a small utility that can be shared for more similar cases? But not blocking...

potiuk

Good for me as well.

Co-authored-by: AutomationDev85 <AutomationDev85>

Throttle HTTPError during consume pod logs

548d441

AutomationDev85 requested review from hussein-awala and jedcunningham as code owners August 21, 2025 06:34

boring-cyborg bot added area:providers provider:cncf-kubernetes Kubernetes (k8s) provider related issues labels Aug 21, 2025

jscheffl approved these changes Aug 21, 2025

View reviewed changes

potiuk approved these changes Aug 22, 2025

View reviewed changes

potiuk merged commit 1cb057f into apache:main Aug 22, 2025
86 checks passed

mangal-vairalkar pushed a commit to mangal-vairalkar/airflow that referenced this pull request Aug 30, 2025

Throttle HTTPError during consume pod logs (apache#54761)

4786a24

Co-authored-by: AutomationDev85 <AutomationDev85>

eladkal mentioned this pull request Sep 5, 2025

Status of testing Providers that were prepared on September 05, 2025 #55285

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[providers-cncf-kubernetes] Throttle HTTPError during consume pod logs #54761

[providers-cncf-kubernetes] Throttle HTTPError during consume pod logs #54761

Uh oh!

AutomationDev85 commented Aug 21, 2025

Uh oh!

jscheffl left a comment

Uh oh!

potiuk left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[providers-cncf-kubernetes] Throttle HTTPError during consume pod logs #54761

[providers-cncf-kubernetes] Throttle HTTPError during consume pod logs #54761

Uh oh!

Conversation

AutomationDev85 commented Aug 21, 2025

Overview

Details of change:

Uh oh!

jscheffl left a comment

Choose a reason for hiding this comment

Uh oh!

potiuk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants