EPoll error capture - store in NetEvent for later processing. #7803

SolidWallOfCode · 2021-05-10T19:14:40Z

Testing some HostDB issues, it turns out that if there is an error reported epoll that is simply dropped on the floor and the error isn't detected until a subsequent I/O operation. The best way to handle this is debatable, but it was agreed the first step is to store the error in a way that can be found in later processing.

With the current code if the socket is being checked for being writable, on error epoll will report both writable and error. Because the error indication is dropped, this generates a WRITE_READY as if there were no error. This in turn causes the state machine to validate the TCP handshake even though it did not occur, which means the subsequent error on writing the proxy request causes a 502 response to the user agent but has no effect on marking the upstream as dead.

@shinrich notes that we may want to handle this differently for read vs. write, because in a read situation there may be useful data on the socket, but that's never the case for a write. Therefore it might be reasonable to ignore the error for read, but signal it for write.

SolidWallOfCode · 2021-05-10T20:11:47Z

Further testing reveals this is a problem when there is a negative ICMP response. In that case epoll reports an error. This is a bit more serious as it would be quite reasonable in this case to indicate the upstream IP address is dead.

SolidWallOfCode · 2021-05-13T03:06:57Z

Superceded by #7809.

SolidWallOfCode added Core Cleanup labels May 10, 2021

SolidWallOfCode added this to the 10.0.0 milestone May 10, 2021

SolidWallOfCode requested review from bryancall and shinrich May 10, 2021 19:14

SolidWallOfCode self-assigned this May 10, 2021

EPoll error capture - store in NetEvent for later processing.

a5c2b4a

SolidWallOfCode force-pushed the netevent-store-epoll-error-indicator branch from 68c3a45 to a5c2b4a Compare May 10, 2021 20:10

shinrich mentioned this pull request May 11, 2021

Save and propagate epoll network error #7809

Merged

SolidWallOfCode closed this May 13, 2021

zwoop modified the milestones: 10.0.0, 9.2.0 Sep 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

EPoll error capture - store in NetEvent for later processing. #7803

EPoll error capture - store in NetEvent for later processing. #7803

Uh oh!

SolidWallOfCode commented May 10, 2021 •

edited

Loading

Uh oh!

SolidWallOfCode commented May 10, 2021 •

edited

Loading

Uh oh!

SolidWallOfCode commented May 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

EPoll error capture - store in NetEvent for later processing. #7803

EPoll error capture - store in NetEvent for later processing. #7803

Uh oh!

Conversation

SolidWallOfCode commented May 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SolidWallOfCode commented May 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SolidWallOfCode commented May 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SolidWallOfCode commented May 10, 2021 •

edited

Loading

SolidWallOfCode commented May 10, 2021 •

edited

Loading