-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hanging forever when low memory #103
Comments
Hi @gygabyte017, thanks for the report. I'm not sure if this is possible for you, but it would be helpful to see if any logging is collected (but not displayed) before it hangs. Are you able to reproduce the issue from a python repl? If so, the instructions in this issue might yield some extra info that would be helpful (#36 (comment)). If possible, what would be most helpful would be a reproducible example consisting of:
Thanks! |
Hi, unfortunately it is hard for me to give you what you asked, sorry about that :( because since it doesn't happen on my local pc while testing and it only happens on serverless containers spawned on EKS, andthe plotting happens after a lot of complex calculations involving other resources.
(Not sure about how could I access the frozen container and send an interrupt and interact with repl to provide more info). Thanks |
Thanks for this info @gygabyte017, that's helpful. Marking as a bug. |
Same happening for the |
@Bhanuchander210, are you seeing this behavior being related to low memory as well? |
@jonmmease |
Ok, thanks @Bhanuchander210. |
Notes: We're already refreshing the page when the heap reaches 50% of the maximum allowed. But I don't know whether this maximum limit (as returned by
|
Hi, Is there any progress on that topic? Here is the code that I use to limit the virtual memory. (Thats something we need to do for that specific program to make sure that i wont get in conflict with the productive processes...)
Since it is the first export anyways, I cannot use the proposed workaround with Im running on
|
Hi @gygabyte017 Did you ever resolve this issue? Did downgrading to v0.1.0 work to solve this issue? Thanks. |
Hi @MaartenBW, unfortunately I didn't, any version seems random, I don't believe there are reasons to prefeer 0.1.0 over 0.2.1 or whatever, it's just luck depending on the machine resources. I managed to develop a ugly workaround, that is: 1) Increase the maximum ram on the containers, even though it wouldn't be necessary, and 2) the write_image is executed in a separate process with a timeout, if after i.e. 30 seconds it is still working, I kill the separate process and try again up to 5 tries. In this way it's very rare that all the 5 tries fails, however it may still happen. Now I want to try the solution described here, maybe it can work in a stable way? #110 (comment) |
@gygabyte017 Wow, thanks for your fast reply. |
Hi, am experiencing kaleido randomly freezing on our production environment (unix with kubernetes).
I noticed that when the container has low memory, perhaps because the main python program consumed a lot of resources, for instance for holding the dataframe data needed to be plotted, when it calls
write_image
it will hang forever.The kaleido process never terminates, there are no errors about a low memory condition, it just sits there with zero cpu consumption forever.
How can this be improved?
This behavior is very frustrating because sometimes I just find containers stuck running forever, that if I manually relaunch with the very same conditions they may run correctly, so I have no way to monitor if they got stuck,.
It would be ok that kaleido returns a memory error or a process failed exception, then it could be handled. But freezing forever... is just bad.
Any advice? Thank you
The text was updated successfully, but these errors were encountered: