-
Notifications
You must be signed in to change notification settings - Fork 14.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
When an ECS Task fails to start, ECS Operator raises a CloudWatch exception #15000
Comments
Thanks for opening your first issue here! Be sure to follow the issue template! |
Yes. And Fargate tasks are more likely to experience this
But luckily we only see around 1% Fargate tasks failed to start due to this random issue. I didn't close #11663 just to remind me to create a new issue here but apparently @kanga333 beat me to it 😁 |
Apache Airflow version: 1.10.13
Environment:
uname -a
): 4.14.209-160.339.amzn2.x86_64What happened:
When an ECS Task exits with
stopCode: TaskFailedToStart
, the ECS Operator will exit with a ResourceNotFoundException for the GetLogEvents operation. This is because the task has failed to start, so no log is created.What you expected to happen:
ResourceNotFoundException is misleading because it feels like a problem with CloudWatchLogs. Expect AirflowException to indicate that the task has failed.
How to reproduce it:
This can be reproduced by running an ECS Task that fails to start, for example by specifying a non-existent entry_point.
Anything else we need to know:
I suspect Issue #11663 has the same problem, i.e. it's not a CloudWatch issue, but a failure to start an ECS Task.
The text was updated successfully, but these errors were encountered: