Job processing failing after booting new worker #1499
Closed
bagedevimo
started this conversation in
General
Replies: 1 comment 2 replies
-
That's strange! The Are you seeing the process in the |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
We're seeing situations where a job process (or multiple) is no longer processing jobs after being booted during a deploy.
Our platform runs on ECS in containers, with multiple containers booted to run jobs with some configuration on which queues each set of workers runs from and which priority, via CLI flags. Sporadically, we see situations where the good_job process is booted but does not participate in running any jobs. We've only had a few attempts to try and debug this, and so far discovered very little. We're seeing nothing of interest in the logs or in our error monitoring.
What I'm hoping to figure out is: what tips do y'all have for debugging this?
Beta Was this translation helpful? Give feedback.
All reactions