Skip to content

Investigate Production Issues and complete Prod Release #635

@darunrs

Description

@darunrs

The prod release recently done filed in production. It highlighted three issues:

  1. Coordinator logging is incomplete. Many actions are somehow not being logged.
  2. Block Streamer ran out of memory.
  3. Runner most likely also ran out of memory, though we couldn't see any explicit logs related to that. But, the machine was inaccesible through ssh.

The Prod Release was directly responsible for the third one. It inexplicably increased memory usage by Runner by A LOT. It needs to be investigated what change is causing this problem.

The first two are unrelated to the prod release. We've beefed up the Block Streamer machine for now.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions