ingest/ledgerbackend: Restart captive core when a new version of core is detected on disk #3687

tamirms · 2021-06-11T15:04:14Z

PR Checklist

PR Structure

This PR has reasonably narrow scope (if not, break it down into smaller PRs).
This PR avoids mixing refactoring changes with feature changes (split into two PRs
otherwise).
This PR's title starts with name of package that is most changed in the PR, ex.
services/friendbot, or all or doc if the changes are broad or impact many
packages.

Thoroughness

This PR adds tests for the most critical parts of the new functionality or fixes.
I've updated any docs (developer docs, .md
files, etc... affected by this change). Take a look in the docs folder for a given service,
like this one.

Release planning

I've updated the relevant CHANGELOG (here for Horizon) if
needed with deprecations, added features, breaking changes, and DB schema changes.
I've decided if this PR requires a new major/minor version according to
semver, or if it's mainly a patch change. The PR is targeted at the next
release branch if it's not a patch change.

What

Close #3602

Add a ticker to stellarCoreRunner which will execute every 10 seconds. The stellarCoreRunner context will interrupt the goroutine attached to the ticker so that if captive core shutsdown the ticker will also stop.
Upon every tick, we will check to see if the stellar core binary has been modified. If it has been modified we log a warning message to announce we will be shutting down captive core and then we call stellarCoreRunner.close() to terminate the captive core instance. Once stellarCoreRunner.close() is called any pending operations (PrepareRange() or GetLedger()) should return immediately with an error and the ingestion state machine will handle the error by retrying.
When the retry happens it will create a new instance of stellarCoreRunner which should use the newest stellar core binary.

Why

See #3602 for motiviation.

ingest/ledgerbackend/stellar_core_runner.go

Shaptic

👍

services/horizon/CHANGELOG.md

bartekn · 2021-06-14T13:44:19Z

@tamirms please merge when ready.

Co-authored-by: George <Shaptic@users.noreply.github.com>

Restart captive core when a new version of core is detected on disk

85387b6

tamirms force-pushed the captive-core-restart branch from 73f0497 to 85387b6 Compare June 11, 2021 15:11

tamirms requested a review from a team June 11, 2021 15:17

bartekn approved these changes Jun 11, 2021

View reviewed changes

Shaptic reviewed Jun 11, 2021

View reviewed changes

ingest/ledgerbackend/stellar_core_runner.go Outdated Show resolved Hide resolved

Shaptic approved these changes Jun 11, 2021

View reviewed changes

services/horizon/CHANGELOG.md Outdated Show resolved Hide resolved

services/horizon/CHANGELOG.md Outdated Show resolved Hide resolved

tamirms and others added 4 commits June 14, 2021 16:04

Update services/horizon/CHANGELOG.md

6ac423c

Co-authored-by: George <Shaptic@users.noreply.github.com>

fix pr link

64cb6e0

Make binarywatcher optional in case of error

45131a2

Merge branch 'master' into captive-core-restart

9ed48d1

tamirms merged commit 226d70a into stellar:master Jun 14, 2021

tamirms deleted the captive-core-restart branch June 14, 2021 15:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ingest/ledgerbackend: Restart captive core when a new version of core is detected on disk #3687

ingest/ledgerbackend: Restart captive core when a new version of core is detected on disk #3687

tamirms commented Jun 11, 2021 •

edited

Loading

Shaptic left a comment

bartekn commented Jun 14, 2021

ingest/ledgerbackend: Restart captive core when a new version of core is detected on disk #3687

ingest/ledgerbackend: Restart captive core when a new version of core is detected on disk #3687

Conversation

tamirms commented Jun 11, 2021 • edited Loading

PR Structure

Thoroughness

Release planning

What

Why

Shaptic left a comment

Choose a reason for hiding this comment

bartekn commented Jun 14, 2021

tamirms commented Jun 11, 2021 •

edited

Loading