Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve the reliability of the zebrad testnet sync tests #1207

Closed
4 tasks done
teor2345 opened this issue Oct 26, 2020 · 0 comments · Fixed by #1223
Closed
4 tasks done

Improve the reliability of the zebrad testnet sync tests #1207

teor2345 opened this issue Oct 26, 2020 · 0 comments · Fixed by #1223
Assignees
Labels
A-rust Area: Updates to Rust code C-bug Category: This is a bug

Comments

@teor2345
Copy link
Contributor

teor2345 commented Oct 26, 2020

Version

zebra main as of 26 October 2020.

Platform

ZcashFoundation/zebra CI tests.

Description

The zebrad sync tests frequently fail on testnet.

I tried this:

Submitting PR #1206.

I expected to see this happen:

The tests succeeded, like they did on my development machine.

Instead, this happened:

The testnet sync test failed, which happens frequently, and more often than mainnet.

TODO

  • Activate Hedge as soon as Zebra starts (from the genesis block download), rather than waiting until the histogram has 20 entries
    • this change improves request latency at genesis, immediately after a restart, and after long network delays
  • Improve genesis block download diagnostics at info level
  • Check genesis block retry interval / behaviour
    • A new genesis Download Set request is launched if the old request fails, times out, or the syncer genesis depth request happens before the state commits the genesis block
  • Check genesis block download timeout
    • It's the same as every other block

Alternatives

  • Increase the quick sync test timeout
    • It's currently 30 seconds, which is already a bit too much for a quick test

Follow-Up

@teor2345 teor2345 added C-bug Category: This is a bug A-rust Area: Updates to Rust code A-infrastructure Area: Infrastructure changes A-devops Area: Pipelines, CI/CD and Dockerfiles S-needs-triage Status: A bug report needs triage labels Oct 26, 2020
@teor2345 teor2345 added this to the First Alpha Release milestone Oct 26, 2020
@teor2345 teor2345 changed the title Disable testnet sync tests, until we have deployed more zcashd instances on testnet Improve the reliability of the zebrad testnet sync tests Oct 28, 2020
@teor2345 teor2345 removed A-devops Area: Pipelines, CI/CD and Dockerfiles A-infrastructure Area: Infrastructure changes S-needs-triage Status: A bug report needs triage labels Oct 28, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-rust Area: Updates to Rust code C-bug Category: This is a bug
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants