-
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Lost logcontexts during federation send timeouts #5604
Comments
The basis of the problem is that the request is cancelled, but only the .well-known fetch is cancelled - the SRV lookup and main request are attempted anyway, but there is now nothing to unpick the log context. To be honest, the logcontext feels like the least of the problems here. Failing test on the |
likewise, if the SRV lookup takes ages, then when the federation request times out, there is nothing left to catch and report the (eventual) failure of the DNS lookup, which makes it hard to tell what the failure was. |
ok, please remember to add references to any relevant issues when you create a PR, to make it easier to cross-reference things later ;) I'm going to go ahead and close this, and open new issues for the remaining problems. |
filed as #10855 |
we're still seeing reports of this in Synapse 1.45, so I guess it's not fully fixed. |
it appears this was an incorrect report: the server in question was in fact running 1.43 at the time the messages were logged. |
My logs are full of warnings like this:
Evidently something is dropping a logcontext that shouldn't be.
The text was updated successfully, but these errors were encountered: