VReplication: ability to compress gtid when stored in _vt.vreplication's pos column #7877

rohit-nayak-ps · 2021-04-15T20:29:39Z

Signed-off-by: Rohit Nayak rohit@planetscale.com

Description

During the replication phase of a workflow the gtid of every transaction is recorded in the pos column of the _vt.vreplication table so that the target can keep track of their current position with respect to the source. We also have a heartbeat (default 1 second, configurable to a maximum of one minute) that updates the time_updated column for a proof-of-life of the stream.

For long running workflows this can cause a significant increase in the size of binlogs especially in cases where there is a high write QPS or if there have been a lot of reparenting operations increasing the size of each gtid. This problem is accentuated because in the full mode of RBR (which Vitess requires) the gtids are present both in the before and after images.

This PR adds an option to vttablet to compress gtids before storing it into the pos column. We use the same (zlib) algorithm as implemented by the compress() function. This makes it easier to inspect the column in sql using select uncompress(pos) from _vt.vreplication. This alss means we can use the (presumably efficient) mysql compress() function to compress and only need to write go code for the decompression. The compression achieved by this should be between 60-80% since the gtid is a hex string.

We put this functionality behind the -vreplication_store_compressed_gtid boolean flag (default: false), since users may prefer to have clear text positions for readability.

Independent of this option, we support both compressed and clear-text gtids while reading the pos column. So if a vttablet is first run with the option on and later it is turned off we will still be able to read the compressed gtid and future updates will store the pos in clear text. (and vice-versa).

Note that movetables or reshard workflows tend to be short-lived. Most of the data migration is done using in the copy phase which does not generate a lot of pos updates. So if you don't expect to use long-running Materialize flows it may not be very beneficial to enable this feature.

Checklist

Should this PR be backported?
Tests were added or are not required
Documentation was added or is not required

Impacted Areas in Vitess

Components that this PR will affect:

ajm188

I have a meta-question about the implementation details, but the approach of allowing forward/backward compatibility around using compression is really smart!

ajm188 · 2021-05-05T01:36:58Z

go/vt/binlog/binlogplayer/binlog_player.go

+ dataLength = binary.LittleEndian.Uint32(dataLengthBytes)
+
+ // uncompress using zlib
+ inputData := inputBytes[4:]


I got curious why this bit math was required and not provided by compress/zlib so I started to play around with this.

From my read and brief experimentation to write some benchmarks, that golang's zlib library expects this header to be present, so chopping off the first four bytes will always result in a zlib.ErrHeader result from zlib.NewReader: https://play.golang.org/p/p6XWE12Um89

So, I commented out the header truncation, and was able to exercise the rest of the decompression code as written, but it would still fail on the length check (line 740), which I haven't looked into yet. In any case, the results between doing this vs just calling into NewReader and then checking if the error is zlib.ErrHeader (which would indicate the data was not compressed with zlib) seem negligible. Benchmark repo.

Note: I did not check this against gtids compressed by mysql yet, so we should verify if the behavior of mysql's compress is actually different from golang's zlib compression. From the docs, it seems (but I am very unfamiliar, so happy to be told I'm wrong here), that mysql doesn't guarantee to be using zlib (due to "a compression library such as zlib" vs "this compression is always done with specifically zlib")

The four bytes are additionally added by MySQL. They are not related the header of zlib-generated data. Some references here:

https://bugs.mysql.com/bug.php?id=79400 https://www.bennadel.com/blog/3152-running-mysql-compress-and-uncompress-compatible-methods-in-coldfusion.htm

https://dev.mysql.com/doc/refman/8.0/en/innodb-compression-internals.html says zlib is used. Maybe there is a way to use a different library while compiling MySQL but I haven't seen references to anyone using a different library. Also I saw references to tools and drivers that have assumed zlib.

So if it is possible to link in other compression libraries and someone does it, we will have to update this implementation.

Thanks for the clarification!! This makes sense to me now

shlomi-noach

looks good!

…et option Signed-off-by: Rohit Nayak <rohit@planetscale.com>

Signed-off-by: Rohit Nayak <rohit@planetscale.com>

rohit-nayak-ps force-pushed the rn-gtid-compression branch from f792b53 to b40d241 Compare April 15, 2021 21:54

rohit-nayak-ps force-pushed the rn-gtid-compression branch from 8c8879d to 9f77a97 Compare May 2, 2021 22:31

rohit-nayak-ps added Component: VReplication Type: Enhancement Logical improvement (somewhere between a bug and feature) Type: Performance labels May 3, 2021

rohit-nayak-ps marked this pull request as ready for review May 3, 2021 14:58

rohit-nayak-ps requested a review from deepthi as a code owner May 3, 2021 14:58

rohit-nayak-ps requested review from shlomi-noach, ajm188 and a team May 3, 2021 14:58

ajm188 reviewed May 5, 2021

View reviewed changes

shlomi-noach approved these changes May 5, 2021

View reviewed changes

ajm188 approved these changes May 5, 2021

View reviewed changes

rohit-nayak-ps added 5 commits May 8, 2021 14:51

Compress gtid when stored in _vt.vreplication depending on new vttabl…

133dc88

…et option Signed-off-by: Rohit Nayak <rohit@planetscale.com>

Handle compressed pos in the Workflow command

5f74eb9

Signed-off-by: Rohit Nayak <rohit@planetscale.com>

Fix tests

3313f41

Signed-off-by: Rohit Nayak <rohit@planetscale.com>

Fix test

350759a

Signed-off-by: Rohit Nayak <rohit@planetscale.com>

Fix merge issue

3211e7f

Signed-off-by: Rohit Nayak <rohit@planetscale.com>

rohit-nayak-ps force-pushed the rn-gtid-compression branch from 30f4e18 to 3211e7f Compare May 8, 2021 12:53

rohit-nayak-ps merged commit 4578a9b into vitessio:master May 8, 2021

rohit-nayak-ps deleted the rn-gtid-compression branch May 8, 2021 15:32

mattlord mentioned this pull request Sep 25, 2024

Vreplication fails when gtid_executed is > 10000 bytes #8612

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VReplication: ability to compress gtid when stored in _vt.vreplication's pos column #7877

VReplication: ability to compress gtid when stored in _vt.vreplication's pos column #7877

rohit-nayak-ps commented Apr 15, 2021 •

edited

Loading

ajm188 left a comment

ajm188 May 5, 2021

rohit-nayak-ps May 5, 2021

ajm188 May 5, 2021

shlomi-noach left a comment

VReplication: ability to compress gtid when stored in _vt.vreplication's pos column #7877

VReplication: ability to compress gtid when stored in _vt.vreplication's pos column #7877

Conversation

rohit-nayak-ps commented Apr 15, 2021 • edited Loading

Description

Checklist

Impacted Areas in Vitess

ajm188 left a comment

Choose a reason for hiding this comment

ajm188 May 5, 2021

Choose a reason for hiding this comment

rohit-nayak-ps May 5, 2021

Choose a reason for hiding this comment

ajm188 May 5, 2021

Choose a reason for hiding this comment

shlomi-noach left a comment

Choose a reason for hiding this comment

rohit-nayak-ps commented Apr 15, 2021 •

edited

Loading