-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
k8s hook runner fails occasionally #41
Comments
Can you please provide an example repository to reproduce this? The only thing I can think of that can fail with the error message like this is this line:
We probably want to wrap this in try/catch as well but I am unsure. Is it possible that something is breaking the JSON causing JSON.parse to throw and promise to be rejected? If you can provide a reproduction repo or just an example I can create, it will be really helpful! |
Hello @nikola-jokic |
This information is really useful! If you find more information, it would be amazing. But I will start trying to reproduce this issue myself. I think I might have an idea where this might occur. I'll let you know if I get stuck and need more information. In the meantime, if you find anything else that might help us, please let us know |
Hey @nikola-jokic |
Hey @atahardjebbar-ledger, thanks! Happy new year to you as well! |
Hello @nikola-jokic |
I would love to dig deeper. Can you please provide an example workflow that I can use to reproduce it? Can you see the error? The fix was meant to pull the input read from the stdin into the try catch so the exception does not obfuscate the error for further inspection. By heavy workflows, do you mean they run for the long time, or they send a lot of data to the hook in a call? Also, is it possible that the runner runs out of memory on that machine causing this exception? Can you please provide diagnostics log of the runner? That might be helpful |
Hi @nikola-jokic, Regarding the workflow file, I can't share it publicly, but It's composed of multiple jobs/actions to do the compilation. This workflow is composed of around 20 jobs, each job having a runtime from 5 min to 1h30 (with a total time for the workflow of approximately 4h). So, we have more information about the error based on the debug log (sent to Github); in this case, it happened for the "Post-Clone" operation:
Thanks for your help; also, do you have a recommended way to extract logs from the |
Hey @AEnguerrand, Thank you for providing this! I'll try my best to reproduce it, but this might require another change to see what is exactly happening. The only place that is outside the |
Hey @AEnguerrand, @atahardjebbar-ledger, Can you try the fix from the PR #65 |
Hi @nikola-jokic, Yes, I'm doing the setup for the Github Action runners to use this version; thanks. I will post the result after a few runs 👍 |
Hello,
We're having an issue on our self hosted runner, we're using k8s hook to execute our jobs, but some of jobs fails occasionally with this error, can't seems to understand what the issue is.
Image: summerwind/actions-runner:v2.296.1-ubuntu-20.04
GHES version: 3.6.1
Any idea what the issue is ?
Thank you
The text was updated successfully, but these errors were encountered: