-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
'Application not available' when starting a Che workspace #17542
Comments
This is a known issue that is supposed to be addressed in the upstream #17002 |
I have some doubts about #17002 and the current issue. Theia is a special server that is tested before reported that workspace is Running. |
I am wondering if there is a global latency between the time the server is available internally (from the same cluster) and the time when it is available externally. I have also noticed that issue when opening an app preview in Che. it is happening randomly but still quite frequently. Could we do a |
Application not available
when starting a Che workspace
or is readiness probe missing? |
@skabashnyuk this might be related to route flakiness in this case and we might need to enable retries (e.g. mark theia as ready only after 3 successful checks). As I recall the retry functionality should be already in place and available for configuration |
The issue should be fixed on Hosted Che end by increasing the success threshold - redhat-developer/rh-che#2009 |
@ibuziuk would that make sense to have that value the default one in Che ? |
@sunix it depends how easy it to repro against upstream + some default infra |
TL;DR this issue is infra related and the optimal config may vary |
I frequently have that issue on openshift 4.4 |
frequently => maybe 1/10 workspace start, but it is very random obviously. |
yeah, so we can increase it in the success threshold in the upstream, but it would result in slower performance - so it is a tradeoff. |
This is a central outage issue and if can not affect it. Closing since there was no issue reported during the periodic test summary. |
I understand but I don't think changing threshold is the fix. The dashboard javascript client should check if the route works or not and retry. |
@ibuziuk ok to reopen that one ? |
@sunix the real solution is single-host, we just need to wait till it becomes the default option. You can reopen, but I doubt that anything apart from what has been already done will be improved |
Let's reopen it and close it when we have a proper fix (single host or anything else) |
I've encountered that issue again when testing CRW. |
I faced a bit different but similar issue #19059 |
@sleshchenko I put the UD label |
@Katka92 @sleshchenko folks, isn't this problem multi-host specific (should not be reproducible against single-host configuration)? |
IDK, I'm not load testing with single-host config. Maybe it would be worth adding that test case to load tests, but with lower priority than multi-host. Now I'm testing config with multi-host (~100 workspace startups for now) and I haven't seen that yet, but it was seen for testing CRW 2.9 (based on Che 7.30.x). I can try with single-host if I have time and let you know. |
@skabashnyuk do you have any input on when we will have a single-host enabled by default? |
I don't know |
I think now single host is the default suggested exposure strategy, so it should not happen and from dashboard point of view it would be overhead. Closing it. |
Describe the bug
Starting a workspace it happens randomly about 3 times a day in che.openshift.io
I guess there is a synch issue, che-theia may be available internally but not externally maybe?
The text was updated successfully, but these errors were encountered: