-
Notifications
You must be signed in to change notification settings - Fork 7.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Envoy issuing 503's intermittently #15285
Comments
Just been re-reading this @chrismckean, the reason you're seeing a connection reset by the source envoy is because it received a 503 from the destination envoy and http1.1 spec dictates that connection must be closed. So that part of the picture can be disregarded, and we need to focus on the connection issue in the destination envoy to destination app. |
This issue has been automatically marked as stale because it has not had activity in the last 90 days. It will be closed in the next 30 days unless it is tagged "help wanted" or other activity occurs. Thank you for your contributions. |
@chrismckean, @Stono does the issue still appears in 1.3? |
Yes, we still see occasional 503s but not to the level we used to, and the investigation here is likely stale. I'll close this ticket and re-open another with more recent info if we feel we need to. Ta |
Karl is correct. The errors are present still but are at a manageable rate. I'll close this issue. |
@Stono @chrismckean great, thanks! |
@vadimeisenbergibm we are still seeing 503 UC errors #18043 |
Bug description
Hi All,
@Stono @martinhighton and myself have been looking into an issue with some of our K8 applications getting 503 errors. We're seeing roughly 5-10 an hour between the 2 applications we've been targeting in this investigation. I have obtained the istio debug logs and some packet captures. From the captures, you can see that the source application is the one sending the reset (below). The applications talking aren't particularly busy, so I'm guessing there's some sort of connection timeout issue.
I've looked at the debug logs for a while and I can't figure out how to read them. I've not managed to find a lot of documentation on them online. I'm assuming that the 'Cxx' value is a connection. So, all of the entries under, say, C22 relate to one connection between the source and destination envoy? If that is the case should the connection number on the destination logs match the source logs??
There's not a lot to see from the packet capture. Just that line above. Here are what I believe to be the relevant debug logs from the istio container
Source
Destination
We're having trouble figuring out exactly what is going on here. Could someone possibly help read these debug logs? The source envoy just seems to reset the connection. If the connection numbers should match then the source envoy is trying to use a connection that maybe doesn't exist on the destination envoy anymore? If this isn't the case I'm not sure why the source envoy is resetting the connection
Please let me know if somebody needs any more info on this problem.
--
Cheers
Chris
Affected product area (please put an X in all that apply)
[ ] Configuration Infrastructure
[ ] Docs
[ ] Installation
[x] Networking
[x ] Performance and Scalability
[ ] Policies and Telemetry
[ ] Security
[ ] Test and Release
[ ] User Experience
[ ] Developer Infrastructure
Expected behavior
Envoy not to issue a 503 unless there is a problem with the source, destination app
Steps to reproduce the bug
This happens randomly
Version (include the output of
istioctl version --remote
andkubectl version
)Kubelet v1.13
Istio v.1.2.2
Environment where bug was observed (cloud vendor, OS, etc)
GKE
master 1.13.6-gke.13 (Nodes v1.12.7-gke.24)
Container-Optimized OS from Google Kernel v.4.14.127+ docker://17.3.2
The text was updated successfully, but these errors were encountered: