Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

streamingccl: prevent node lag replanning starvation #114525

Merged
merged 1 commit into from
Nov 16, 2023

Conversation

msbutler
Copy link
Collaborator

This patch prevents the lastNodeLagCheck time from updating every time the frontier processor receives a checkpoint, which can happen every few seconds. This previously prevented the node lag replanning check to trigger because this time needed to be older than 10 minutes. Rather, this timestamp should only update if we actually compute the lag check.

Fixes #114341

Release note: none

@msbutler msbutler self-assigned this Nov 15, 2023
@msbutler msbutler requested a review from a team as a code owner November 15, 2023 20:57
@cockroach-teamcity
Copy link
Member

This change is Reviewable

@msbutler msbutler added the backport-23.2.x Flags PRs that need to be backported to 23.2. label Nov 15, 2023
@msbutler msbutler force-pushed the butler-debug-auto-replan branch from 5a6084f to 68c90d2 Compare November 15, 2023 21:22
Copy link

blathers-crl bot commented Nov 15, 2023

It looks like your PR touches production code but doesn't add or edit any test code. Did you consider adding tests to your PR?

🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.

Copy link
Collaborator

@stevendanna stevendanna left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Aha, thanks for looking into this. Left some nits about the logging.

This patch prevents the lastNodeLagCheck time from updating every time the
frontier processor receives a checkpoint, which can happen every few seconds.
This previously prevented the node lag replanning check to trigger because this
time needed to be older than 10 minutes. Rather, this timestamp should only
update if we actually compute the lag check.

Fixes cockroachdb#114341

Release note: none
@msbutler msbutler force-pushed the butler-debug-auto-replan branch from 68c90d2 to b61cbb9 Compare November 16, 2023 01:24
@msbutler
Copy link
Collaborator Author

TFTR!

bors r=stevendanna

@craig
Copy link
Contributor

craig bot commented Nov 16, 2023

Build succeeded:

@craig craig bot merged commit c748278 into cockroachdb:master Nov 16, 2023
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport-23.2.x Flags PRs that need to be backported to 23.2. T-disaster-recovery
Projects
None yet
Development

Successfully merging this pull request may close these issues.

roachtest: c2c/BulkOps/full failed
3 participants