Bug: Using active deadline does not wait for container to terminate before finishing #64
Labels
area/workloads
Related to workload execution (e.g. jobs, tasks)
component/execution
Issues or PRs related exclusively to the Execution component (Job, JobConfig)
kind/bug
Categorizes issue or PR as related to a bug.
Milestone
Currently, we use active deadlines to kill Pods, which apparently does not ensure that the container is already terminated before
The following JobConfig allows us to replicate this issue. We use https://github.com/irvinlim/signalbin to test interactions with signal handlers.
When killing the Job with
killTimestamp
, we see that the Job had reached a Killed phase even while the container is running.Once the container is complete, we can see that the logs stopped (meaning that the container exited), and the Pod's containerStatuses
running
moved toterminated
.The implications of this include:
Possible solutions:
status.phase
of the Pod to determine the task state. In this case, we need to look at thecontainerStatuses
AND thephase
to determine if all containers are dead AND they will not be recreated.The text was updated successfully, but these errors were encountered: