Fix VReplication logging to file and db #8521

rohit-nayak-ps · 2021-07-22T18:30:06Z

Description

Errors like duplicate key errors while bulk inserting during the copy phase can generate huge log lines. Since we also write to _vt.vreplication_log that table also gets filled up fast since we keep retrying within vreplication. This PR truncates the logs both in the file logs and in the _vt.vreplication_log table
The logic for updating count of recurring errors in _vt.vreplication_log was not working because the controller would set the state of the restarted workflow again, which was getting logged to _vt.vreplication_log alternating with the actual error. This has been fixed by ignoring this spurious state change log
Some unnecessary/repetitive log lines have been removed

Aside

One reason why log files were filling up is that we retry the workflow on error every 5 seconds, by default. The reason we retry is that transient errors like network failures or server restarts may occur and we don't want the vreplication workflow to error out permanently but keep retrying. We may want to increase this default since, in general, most such situations appear to be non-recoverable and the user needs to intervene to fix it. This needs more discussion though.

For now users can setup -vreplication_retry_delay to change this.

Checklist

Tests were added or are not required
Documentation was added or is not required

Deployment Notes

…mming the logs. Fix logic for incrementing count of recurring error messages. Signed-off-by: Rohit Nayak <rohit@planetscale.com>

deepthi

LGTM

Backports of #8403 #8483 #8489 #8401 #8521 #8396 from main into release 11.0

rohit-nayak-ps added Component: VReplication Type: Bug labels Jul 22, 2021

rohit-nayak-ps force-pushed the rn-vr-logs branch from cd2d8cb to 99e49e0 Compare July 23, 2021 17:37

Truncate logs and db log messages to prevent huge failing queries spa…

d6be345

…mming the logs. Fix logic for incrementing count of recurring error messages. Signed-off-by: Rohit Nayak <rohit@planetscale.com>

rohit-nayak-ps force-pushed the rn-vr-logs branch from 99e49e0 to d6be345 Compare July 25, 2021 16:05

rohit-nayak-ps requested review from shlomi-noach, deepthi and a team July 25, 2021 17:37

rohit-nayak-ps marked this pull request as ready for review July 25, 2021 17:38

deepthi approved these changes Jul 26, 2021

View reviewed changes

deepthi merged commit 1053cff into vitessio:main Jul 26, 2021

deepthi deleted the rn-vr-logs branch July 26, 2021 03:53

rohit-nayak-ps mentioned this pull request Jul 26, 2021

Backports of #8403 #8483 #8489 #8401 #8521 #8396 from main into release 11.0 #8536

Merged

rohit-nayak-ps added a commit that referenced this pull request Jul 26, 2021

Merge pull request #8536 from planetscale/rn-8403-release-11

cc2de83

Backports of #8403 #8483 #8489 #8401 #8521 #8396 from main into release 11.0

frouioui added the release notes label Sep 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix VReplication logging to file and db #8521

Fix VReplication logging to file and db #8521

rohit-nayak-ps commented Jul 22, 2021 •

edited

Loading

deepthi left a comment

Fix VReplication logging to file and db #8521

Fix VReplication logging to file and db #8521

Conversation

rohit-nayak-ps commented Jul 22, 2021 • edited Loading

Description

Aside

Checklist

Deployment Notes

deepthi left a comment

Choose a reason for hiding this comment

rohit-nayak-ps commented Jul 22, 2021 •

edited

Loading