-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve job acquisition job execution logging #2666
Comments
I would like to pick this one to start contributing. Please let me know if that's okay.. |
Great to hear you would like to make a contribution! We are always open to assist you to do so. Best, |
Hi @yanavasileva , Thank you for responding. I found this issue interesting when browsing through the open issues. Going through the guidelines for contribution. Will reachout in case of questions. Thank you!! |
Hi @yanavasileva, Hope you are doing good. I was able to setup local camunda clusters (Camunda 7.20) pointing to single postgres DB Instance and was able to reproduce the Issue. Was using a model with Async continuation task calling a delegate which takes more time to complete the task compared to the value in property lock-time-in-millis. Im working with a single process instance of the model and hence a single Job to be executed. I was able to observe the below from the logs:
The difference that I observed compared to the above logs in the description was, I see that 1 Job was acquired, as opposed to the above, where it states 0 Acquired jobs followed by Execute Job logs. Since that (number of Job acquired part) conflicts with the Observed Behaviour stated above, wanted to check with you. Attaching my Log snippets below for the reference.
|
Sounds great that you're getting started!
You do have the Best, |
@yanavasileva : I have created the below PR for the above Issue. Requesting you to please take a look. Thanks! |
This issue was imported from JIRA:
Environment (Required on creation):
Camunda Platform, 2 or more node cluster.
Description (Required on creation; please attach any relevant screenshots, stacktraces, log files, etc. to the ticket):
As described in SUPPORT-12684, with:
It is possible for the nodes to become locked in a cycle of constantly processing the same jobs, getting OLEs, and produce debug logging like:
Steps to reproduce (Required on creation):
Analysis from this comment regarding the scenario that causes the above logging:
Observed Behavior (Required on creation):
The DEBUG logs are unhelpful and a bit confusing:
If 0 jobs were acquired, how were there 3 jobs to execute?
Where are the OLE exception errors?
Expected behavior (Required on creation):
Helpful DEBUG logs, including:
Root Cause (Required on prioritization):
Solution Ideas (Optional):
Hints (optional):
Links:
Pull requests
The text was updated successfully, but these errors were encountered: