I think ThriftClientManager's ClientMap has some problem, it will be caused Connection reset by peer
#5413
Labels
affects/none
PR/issue: this bug affects none version.
process/fixed
Process of bug
severity/none
Severity of bug
type/bug
Type: something is unexpected
Describe the bug (required)
As a title and reference the follow topic:
https://discuss.nebula-graph.com.cn/t/topic/10803
How To Reproduce(required)
Occasionally
Expected behavior
graphd send rpc to storage, it can be successful.
Additional context
If the cluster run long time, graphd send rpc to storage, it will be show the
Connection reset by peer
, if we restart graphd, do not touch any other things, the error was gone, so I guess from the following code:nebula/src/common/thrift/ThriftClientManager-inl.h
Line 42 in 9fee385
the ThriftClientManager's ClientMap AsyncClient judgement, it has some problem here:
I try to dig the folly AsyncSocket, find some method to judge the connection normal or not, maybe be can help, but need you guys to confirm this worked or not, thx:
The AsyncSocket has some method:
I think the above check in
nebula/src/common/thrift/ThriftClientManager-inl.h
Line 42 in 9fee385
this line can be changed such like this can be fixed this problem, Raise this problem becuase our ClientMap connection has some problem can not be removed in here:
Anyway, thanks, and pls discuss this problem.
The text was updated successfully, but these errors were encountered: