Add retry and timeout in unit test #1504

ianco · 2021-01-15T15:43:48Z

Signed-off-by: Ian Costanzo ian@anon-solutions.ca

Increases likelihood of test passing from roughly 50% to 90%

Signed-off-by: Ian Costanzo <ian@anon-solutions.ca>

sovbot · 2021-01-15T15:43:50Z

Can one of the admins verify this patch?

WadeBarnes · 2021-01-15T15:49:08Z

[ci] please test

WadeBarnes · 2021-01-15T15:50:21Z

(ci) please test

WadeBarnes · 2021-01-15T15:53:34Z

(ci) test this please

Toktar

Hello Ian,
thanks for your emphasize an issue with failed tests. I really apologize if the logic of these tests is a bit implicit. But these are simulation tests for checking the consensus. A random seed is selected with each new launch. That is, if you just restart a test, then you ignore a possible critical issue in the consensus logic (or view change in this case). Could you please write down the seed on which the test falls in the list of exceptions if you do not have time to figure out what is wrong?
I suggest not to put into the master branch rule to ignore tests that can find real bugs in the code.
As far as I understand it's a start of creating simulation tests:
#1242
#1246
It's a random article about seeds in simulation or randomization testing: https://sciprincess.wordpress.com/2019/03/14/how-to-select-a-seed-for-simulation-or-randomization/
Tomorrow I'll ask an author of these tests to get a better link to describing this approach.
UPD: The link to article from our expert: https://alexwlchan.net/2016/06/hypothesis-intro/

ianco · 2021-01-20T20:26:22Z

Hello Ian,
thanks for your emphasize an issue with failed tests. I really apologize if the logic of these tests is a bit implicit. But these are simulation tests for checking the consensus. A random seed is selected with each new launch. That is, if you just restart a test, then you ignore a possible critical issue in the consensus logic (or view change in this case). Could you please write down the seed on which the test falls in the list of exceptions if you do not have time to figure out what is wrong?
I suggest not to put into the master branch rule to ignore tests that can find real bugs in the code.
As far as I understand it's a start of creating simulation tests:
#1242
#1246
It's a random article about seeds in simulation or randomization testing: https://sciprincess.wordpress.com/2019/03/14/how-to-select-a-seed-for-simulation-or-randomization/
Tomorrow I'll ask an author of these tests to get a better link to describing this approach.

@Toktar The unit test creates a simulated pool using a random number of nodes, and then creates a random number of votes, so there is not a single "seed" per se that causes the test to fail. We suspect it is a timing issue hence the retries.

There are too many layers of logic for me to try to dig into it unfortunately :-( If you can get someone to take a look, they just need to retry the test several times and they should get some failures - for me & Wade this test fails about half the time (or about 10% of the time with the retries).

Toktar · 2021-01-22T13:20:00Z

@ianco In our case a seed is a random parameter and the number that you can see near the name of a failed test in logs. Looks like
...
test_new_view_combinations(440868)
...
The fails reproduce locally for me. And it's really a consensus problem for a ViewChange protocol but just with a medium risk.
I created an issue with description of the failed case #1506
And I'll send a PR with skipping tests for problem seeds. Could you please extend this exception list if find a new one?

ianco · 2021-01-22T14:43:01Z

@Toktar thanks for the notes, I'll re-test and try to track the failing seeds.

Add retry and timeout in unit test

c65a14f

Signed-off-by: Ian Costanzo <ian@anon-solutions.ca>

ianco requested review from lampkin-diet, ashcherbakov, skhoroshavin and Toktar as code owners January 15, 2021 15:43

WadeBarnes approved these changes Jan 15, 2021

View reviewed changes

Toktar requested changes Jan 19, 2021

View reviewed changes

ianco closed this Feb 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add retry and timeout in unit test #1504

Add retry and timeout in unit test #1504

ianco commented Jan 15, 2021

sovbot commented Jan 15, 2021

WadeBarnes commented Jan 15, 2021

WadeBarnes commented Jan 15, 2021

WadeBarnes commented Jan 15, 2021

Toktar left a comment •

edited

Loading

ianco commented Jan 20, 2021

Toktar commented Jan 22, 2021 •

edited

Loading

ianco commented Jan 22, 2021

Add retry and timeout in unit test #1504

Add retry and timeout in unit test #1504

Conversation

ianco commented Jan 15, 2021

sovbot commented Jan 15, 2021

WadeBarnes commented Jan 15, 2021

WadeBarnes commented Jan 15, 2021

WadeBarnes commented Jan 15, 2021

Toktar left a comment • edited Loading

Choose a reason for hiding this comment

ianco commented Jan 20, 2021

Toktar commented Jan 22, 2021 • edited Loading

ianco commented Jan 22, 2021

Toktar left a comment •

edited

Loading

Toktar commented Jan 22, 2021 •

edited

Loading