-
Notifications
You must be signed in to change notification settings - Fork 14.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make gantt view to show also retries #17487
Comments
I've been thinking about how to make the gantt view more useful (I think it has a lot of potential). Showing the full state history of a task instance would be a great part of that. |
I would like to try implementing this |
Sweet. Just assigned you. Feel free to reach out if you need anything! |
@ShakaibKhan @bbovenzi see this example As you can see there is significant gap between the tasks. When you hover over the task you see it is the 2nd retry. If going into the log it shown that the first retry run for the duration of the purple rectangle. |
@alexInhert make sense to me and agree that retries should be shown; could you please share the DAG you used to make the above gantt chart |
Sorry I do not think I can take on this issue at this time. Unassigning to myself |
Yes, we record Task Fails should be included in the Gantt chart. We also record Task Reschedules which would probably be helpful. But I think that's the most we could do with the existing tables in Airflow. |
TaskFail entries are now displayed in gantt chart with #37918 |
Any other improvements will come with AIP-64 |
Description
Currently Gantt View shows only the last successful task retry.
This means that if we have t1 >> t2 >> t3
if we had:
t1 started on 08:00 finished 09:00
t2 1st retry started 09:01 failed on 13:00
t2 2nd retry started on 13:02 finished on 15:00
t3 started on 15:01 finished on 15:02
What the gantt view will show us is
t1 08:00-09:00
t2 13:02-15:00
t3 15:01-15:02
This makes it really hard to understand what really happen because it doesn't show what really happened.
This is very annoying for DAGs with so many tasks because you can't tell if the problem is due to tasks not being scheduled or this is something else. I understand the value of the current display but maybe it can add some toggle to consider also all the task instances?
Use case / motivation
The gantt somehow should show the actual information.
Are you willing to submit a PR?
Don't have the skills for that.
The text was updated successfully, but these errors were encountered: