Fix Windows conflict test failures on main #1777

teor2345 · 2021-02-19T00:06:56Z

Motivation

In #1770, we unconditionally kill the conflicted node in the conflict acceptance tests. But on Windows, this makes was_killed return true, failing the tests. We didn't pick up this error in the PR, due to testnet unreliability, which will be fixed by #1222.

Solution

Only kill the conflicted node if it is still running
Make the code where both nodes are running as short as possible

Review

CI is failing all the time on main, so this PR is critical priority.

Since I don't have a Windows box locally, we should make sure Windows, macOS, and Linux CI pass on this PR.

Follow Up Work

Maybe we should disable the testnet large sync tests, or disable fail_fast in CI.

Fix the Windows-specific bugs in TestChild::kill and TestChild::is_running #1781

yaahc

looks good, one UX suggestion for the error messages then I think we can merge this

zebrad/tests/acceptance.rs

teor2345 · 2021-02-19T07:48:38Z

The macOS testnet sync failure will be fixed by #1222

The Windows testnet sync will be disabled by #1782

On Windows, if a process is killed after it is dead, it returns `true` for `was_killed`. Instead, check if the process is running before killing it. Also make the section where processes are running as short as possible, and include context for both processes in every error.

teor2345 · 2021-02-19T07:49:59Z

I cherry-picked #1776 so I could actually see the Windows tests succeed, even if macOS failed.

`node2.is_running()` can return `true` on Windows, even though `node2` has logged a panic. This cleanup code only runs if `node2` fails to panic and exit as expected. So it's ok for us to skip it. See ZcashFoundation#1781 for details.

teor2345 · 2021-02-19T08:02:52Z

I did an admin-merge because the code was reviewed, I just needed to disable some cleanup on Windows.

teor2345 added C-bug Category: This is a bug A-rust Area: Updates to Rust code P-Critical I-integration-fail Continuous integration fails, including build and test failures labels Feb 19, 2021

teor2345 added this to the 2021 Sprint 3 milestone Feb 19, 2021

teor2345 requested a review from a team February 19, 2021 00:06

teor2345 self-assigned this Feb 19, 2021

teor2345 changed the title ~~Fix Windows conflict tests kill code~~ Fix Windows conflict tests failures on main Feb 19, 2021

teor2345 changed the title ~~Fix Windows conflict tests failures on main~~ Fix Windows conflict test failures on main Feb 19, 2021

teor2345 mentioned this pull request Feb 19, 2021

Disable fail-fast from test job #1776

Merged

teor2345 force-pushed the conflict-fixes branch from 5f7302d to 0955eb5 Compare February 19, 2021 00:14

yaahc previously approved these changes Feb 19, 2021

View reviewed changes

zebrad/tests/acceptance.rs Outdated Show resolved Hide resolved

zebrad/tests/acceptance.rs Outdated Show resolved Hide resolved

teor2345 dismissed yaahc’s stale review via 9377ff7 February 19, 2021 00:30

yaahc previously approved these changes Feb 19, 2021

View reviewed changes

teor2345 commented Feb 19, 2021

View reviewed changes

zebrad/tests/acceptance.rs Show resolved Hide resolved

teor2345 dismissed yaahc’s stale review via a6b8ffb February 19, 2021 00:57

yaahc previously approved these changes Feb 19, 2021

View reviewed changes

teor2345 dismissed yaahc’s stale review via 79dee40 February 19, 2021 06:04

teor2345 mentioned this pull request Feb 19, 2021

TestChild::is_running can return true after a panic log #1781

Closed

teor2345 force-pushed the conflict-fixes branch from 79dee40 to d48ecb2 Compare February 19, 2021 06:51

teor2345 mentioned this pull request Feb 19, 2021

Set ZEBRA_SKIP_NETWORK_TESTS using Windows syntax #1782

Merged

teor2345 and others added 2 commits February 19, 2021 17:58

Skip node2.is_running() on Windows

e57185f

`node2.is_running()` can return `true` on Windows, even though `node2` has logged a panic. This cleanup code only runs if `node2` fails to panic and exit as expected. So it's ok for us to skip it. See ZcashFoundation#1781 for details.

remove fail-fast from test job

7734bd7

teor2345 force-pushed the conflict-fixes branch from 48f6ab8 to 7734bd7 Compare February 19, 2021 08:02

teor2345 merged commit a9e4768 into ZcashFoundation:main Feb 19, 2021

teor2345 mentioned this pull request Feb 23, 2021

zebrad 1.0.0-alpha.3 Release #1804

Merged

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Windows conflict test failures on main #1777

Fix Windows conflict test failures on main #1777

teor2345 commented Feb 19, 2021 •

edited

Loading

yaahc left a comment

teor2345 commented Feb 19, 2021 •

edited

Loading

teor2345 commented Feb 19, 2021

teor2345 commented Feb 19, 2021

Fix Windows conflict test failures on main #1777

Fix Windows conflict test failures on main #1777

Conversation

teor2345 commented Feb 19, 2021 • edited Loading

Motivation

Solution

Review

Follow Up Work

yaahc left a comment

Choose a reason for hiding this comment

teor2345 commented Feb 19, 2021 • edited Loading

teor2345 commented Feb 19, 2021

teor2345 commented Feb 19, 2021

teor2345 commented Feb 19, 2021 •

edited

Loading

teor2345 commented Feb 19, 2021 •

edited

Loading