fix config error in cluster_basic_test #205
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What problem does this PR solve?
in cluster_basic_test
mds start error on InitEtcdClient(), and produced coredump
Problem Summary:
log:
{"level":"warn","ts":"2020-12-29T11:17:52.264+0800","caller":"clientv3/retry_interceptor.go:61","msg":"retrying of unary invoker failed","target":"endpoint://client-e074fa9d-964b-4feb-ad0f-23a198366627/127.0.0.1:2301","attempt":0,"error":"rpc error: code = Canceled desc = context canceled"}
2020-12-29 11:17:52.264637 I | session of current mds(127.0.0.1:2310) occur error
E 2020-12-29T11:17:52.264657+0800 1779304 leader_election.cpp:79] 127.0.0.1:2310 leader observe for prefix:07leader occur error, errcode: 23
I 2020-12-29T11:17:52.264678+0800 1779304 leader_election.cpp:86] mds is existing due to the error of leader observation
I 2020-12-29T11:17:52.264693+0800 1779256 heartbeat_manager.cpp:88] heartbeatManager not running.
I 2020-12-29T11:17:52.319905+0800 1771645 cluster.cpp:787] probe 127.0.0.1:2310 fail.
I 2020-12-29T11:17:54.320030+0800 1771645 cluster.cpp:161] stop mds 127.0.0.1:2310 success
I 2020-12-29T11:17:54.320063+0800 1771645 cluster.cpp:148] stop mds 127.0.0.1:2311 begin...
I 2020-12-29T11:17:54.323937+0800 1771645 cluster.cpp:787] probe 127.0.0.1:2311 fail.
2020-12-29 11:17:55.255706 I | etcd do NewCient get err:context deadline exceeded, errCode:DeadlineExceeded
F 2020-12-29T11:17:55.255734+0800 1780290 mds.cpp:234] init etcd client err! etcdaddr: 127.0.0.1:2379int, etcdaddr len: 14, etcdtimeout: 5000, operation timeout: 5000, etcd retrytimes: 3
*** Check failure stack trace: ***
*** Aborted at 1609211875 (unix time) try "date -d @1609211875" if you are using GNU date ***
PC: @ 0x0 (unknown)
*** SIGABRT (@0x1b2a42) received by PID 1780290 (TID 0x7f5d501c8a00) from PID 1780290; stack trace: ***
@ 0x7f5d4e9720e0 (unknown)
@ 0x7f5d4d062fff gsignal
@ 0x7f5d4d06442a abort
@ 0x55aa10d27f56 google::FlushAndAbort()
@ 0x55aa10d21aa8 google::LogMessage::Fail()
@ 0x55aa10d21962 google::LogMessage::SendToLog()
@ 0x55aa10d20deb google::LogMessage::Flush()
@ 0x55aa10d270da google::LogMessageFatal::~LogMessageFatal()
@ 0x55aa0fe7c615 curve::mds::MDS::InitEtcdClient()
@ 0x55aa0fe79923 curve::mds::MDS::StartCompaginLeader()
@ 0x55aa0fe71a2b main
What's Changed:
How it Works:
Side effects(Breaking backward compatibility? Performance regression?):
Check List