HBASE-25869 WAL value compression #3244

apurtell · 2021-05-08T01:34:50Z

WAL storage can be expensive, especially if the cell values represented in the edits are large, consisting of blobs or significant lengths of text. Such WALs might need to be kept around for a fairly long time to satisfy replication constraints on a space limited (or space-contended) filesystem.

We have a custom dictionary compression scheme for cell metadata that is engaged when WAL compression is enabled in site configuration. This is fine for that application, where we can expect the universe of values and their lengths in the custom dictionaries to be constrained. For arbitrary cell values it is better to use one of the available compression codecs, which are suitable for arbitrary albeit compressible data.

Microbrenchmark Results

Site configuration used:

<!-- retain all WALs  -->
<property>
  <name>hbase.master.logcleaner.ttl</name>
  <value>604800000</value>
</property>
<!-- enable compression -->
<property>
 <name>hbase.regionserver.wal.enablecompression</name>
 <value>true</value>
</property>
<!-- enable value compression -->
<property>
 <name>hbase.regionserver.wal.value.enablecompression</name>
 <value>true</value>
</property>
<!-- set value compression algorithm —>
<property>
 <name>hbase.regionserver.wal.value.compression.type</name>
 <value>snappy</value>
</property>

Loader: IntegrationTestLoadCommonCrawl

Input: s3n://commoncrawl/crawl-data/CC-MAIN-2021-10/segments/1614178347293.1/warc/CC-MAIN-20210224165708-20210224195708-00000.warc.gz

SNAPPY or ZSTD at level 1 are recommended, all other options provided for comparison.

Microbenchmarks are collected with this change.
Statistics are collected over the lifetime of the regionserver and are dumped at end of test at shutdown. Statistics are updated under synchronization but this is done in a way that excludes that overhead from measurement. The normal patch does not contain either the instrumentation or the synchronization point. Nanoseconds are converted to milliseconds for the table.

Mode	WALs aggregate size	WALs aggregate size difference	WAL writer append time (ms avg)
Default	5,117,369,553	-	0.290 (stdev 0.328)
Compression enabled, value compression not enabled	5,002,683,600	(2.241%)	0.372 (stddev 0.336)
~~Compression enabled, value compression enabled, v1 patch, Deflate (best speed)~~	~~1,209,947,515~~	~~(76.4%)~~	~~12.694 (stddev 8.48)~~
Compression enabled, value compression enabled, v2 patch, algorithm=SNAPPY	1,616,387,702	(68.4%)	0.027 (stddev 0.204)
Compression enabled, value compression enabled, v2 patch, algorithm=ZSTD (best speed)	1,149,008,133	(77.55%)	0.043 (stddev 0.195)
Compression enabled, value compression enabled, v2 patch, algorithm=ZSTD (default)	1,089,241,811	(78.7%)	0.056 (stdev 0.310)
Compression enabled, value compression enabled, v2 patch, algorithm=ZSTD (best compression)	941,452,655	(81.2%)	0.231 (stddev 1.11)
Options below not recommended.	-	-	-
Compression enabled, value compression enabled, v2 patch, algorithm=GZ	1,082,414,015	(78.9%)	0.267 (stddev 1.325)
Compression enabled, value compression enabled, v2 patch, algorithm=LZMA (level 1)	1,013,951,637	(80.2%)	2.157 (stddev 3.302)
Compression enabled, value compression enabled, v2 patch, algorithm=LZMA (default)	940,884,618	(81.7%)	4.739 (stdev 8.609)

hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/wal/WALCellCodec.java

hbase-common/src/main/java/org/apache/hadoop/hbase/KeyValue.java

apurtell · 2021-05-08T01:50:21Z

Clean up the commit message.

apurtell · 2021-05-08T19:24:59Z

I broke trunk on an unrelated issue last night. Rebased. Removed some precommit noise that resulted.

hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/wal/WALCellCodec.java

Apache-HBase · 2021-05-08T20:30:39Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 6s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+0 🆗	prototool	0m 0s	prototool was not available.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 11s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 41s	master passed
+1 💚	compile	5m 28s	master passed
+1 💚	checkstyle	1m 40s	master passed
+1 💚	spotbugs	6m 32s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 14s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 37s	the patch passed
+1 💚	compile	5m 26s	the patch passed
+1 💚	cc	5m 26s	the patch passed
+1 💚	javac	5m 26s	the patch passed
+1 💚	checkstyle	1m 41s	the patch passed
+1 💚	whitespace	0m 0s	The patch has no whitespace issues.
+1 💚	hadoopcheck	18m 6s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	hbaseprotoc	2m 3s	the patch passed
+1 💚	spotbugs	7m 6s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 37s	The patch does not generate ASF License warnings.
		65m 51s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/3/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile cc hbaseprotoc prototool
uname	Linux d0077d9c238e 4.15.0-112-generic #113-Ubuntu SMP Thu Jul 9 23:41:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `8c2332d`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Max. process+thread count	96 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/3/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-08T22:28:01Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 27s	Docker mode activated.
-0 ⚠️	yetus	0m 4s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 11s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 30s	master passed
+1 💚	compile	2m 54s	master passed
+1 💚	shadedjars	10m 2s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 20s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 16s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 25s	the patch passed
+1 💚	compile	2m 37s	the patch passed
+1 💚	javac	2m 37s	the patch passed
+1 💚	shadedjars	8m 9s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 19s	the patch passed
		_ Other Tests _
+1 💚	unit	1m 2s	hbase-protocol-shaded in the patch passed.
+1 💚	unit	2m 5s	hbase-common in the patch passed.
+1 💚	unit	141m 13s	hbase-server in the patch passed.
		183m 15s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/3/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 95308bf6c357 4.15.0-58-generic #64-Ubuntu SMP Tue Aug 6 11:12:41 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `8c2332d`
Default Java	AdoptOpenJDK-11.0.10+9
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/3/testReport/
Max. process+thread count	3982 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/3/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-09T23:46:53Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	6m 39s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+0 🆗	prototool	0m 0s	prototool was not available.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 25s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 52s	master passed
+1 💚	compile	5m 25s	master passed
+1 💚	checkstyle	1m 43s	master passed
+1 💚	spotbugs	6m 31s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 14s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 36s	the patch passed
+1 💚	compile	5m 27s	the patch passed
+1 💚	cc	5m 27s	the patch passed
+1 💚	javac	5m 27s	the patch passed
+1 💚	checkstyle	1m 38s	the patch passed
+1 💚	whitespace	0m 1s	The patch has no whitespace issues.
+1 💚	hadoopcheck	18m 13s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	hbaseprotoc	2m 0s	the patch passed
+1 💚	spotbugs	7m 10s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 38s	The patch does not generate ASF License warnings.
		71m 47s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/4/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile cc hbaseprotoc prototool
uname	Linux 526e40bc729d 4.15.0-112-generic #113-Ubuntu SMP Thu Jul 9 23:41:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `8c2332d`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Max. process+thread count	96 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/4/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache9 · 2021-05-10T01:29:53Z

So we will only compress value?

As we will do batching when writing WAL entries out, is it possible to compress when flushing? The data will be larger and compress may perform better. The structure of a WAL file will be multiple compressed blocks.

I think for AsyncFSWAL this is easy as we will buffer all the entries in memory and when flush is called we flush all the in memory data out. For FSHLog it will be a bit hard as we rely on the DFSOutputStream to do flush if the data is too big before we actually call hflush.

Apache-HBase · 2021-05-10T01:40:07Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 14s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 30s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 36s	master passed
+1 💚	compile	2m 12s	master passed
+1 💚	shadedjars	8m 16s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 14s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 17s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 43s	the patch passed
+1 💚	compile	2m 10s	the patch passed
+1 💚	javac	2m 10s	the patch passed
+1 💚	shadedjars	8m 10s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 13s	the patch passed
		_ Other Tests _
+1 💚	unit	0m 46s	hbase-protocol-shaded in the patch passed.
+1 💚	unit	1m 47s	hbase-common in the patch passed.
+1 💚	unit	147m 20s	hbase-server in the patch passed.
		185m 2s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/4/artifact/yetus-jdk8-hadoop3-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 10a910d8547a 4.15.0-60-generic #67-Ubuntu SMP Thu Aug 22 16:55:30 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `8c2332d`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/4/testReport/
Max. process+thread count	4133 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/4/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-10T01:42:46Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 0s	Docker mode activated.
-1 ❌	patch	0m 4s	#3244 does not apply to master. Rebase required? Wrong Branch? See https://yetus.apache.org/documentation/in-progress/precommit-patchnames for help.

Subsystem	Report/Notes
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/5/console
versions	git=2.17.1
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-10T02:47:36Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 3s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+0 🆗	prototool	0m 1s	prototool was not available.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 26s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 36s	master passed
+1 💚	compile	5m 19s	master passed
+1 💚	checkstyle	1m 42s	master passed
+1 💚	spotbugs	6m 28s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 14s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 34s	the patch passed
+1 💚	compile	5m 21s	the patch passed
+1 💚	cc	5m 21s	the patch passed
+1 💚	javac	5m 21s	the patch passed
+1 💚	checkstyle	1m 39s	the patch passed
+1 💚	whitespace	0m 0s	The patch has no whitespace issues.
+1 💚	hadoopcheck	18m 1s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	hbaseprotoc	2m 1s	the patch passed
+1 💚	spotbugs	7m 13s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 37s	The patch does not generate ASF License warnings.
		65m 29s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/5/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile cc hbaseprotoc prototool
uname	Linux 735dda148162 4.15.0-112-generic #113-Ubuntu SMP Thu Jul 9 23:41:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `8c2332d`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Max. process+thread count	96 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/5/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-10T04:47:30Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 29s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 25s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 35s	master passed
+1 💚	compile	2m 51s	master passed
+1 💚	shadedjars	8m 59s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 22s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 16s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 16s	the patch passed
+1 💚	compile	2m 40s	the patch passed
+1 💚	javac	2m 40s	the patch passed
+1 💚	shadedjars	8m 48s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 25s	the patch passed
		_ Other Tests _
+1 💚	unit	1m 5s	hbase-protocol-shaded in the patch passed.
+1 💚	unit	2m 11s	hbase-common in the patch passed.
+1 💚	unit	143m 53s	hbase-server in the patch passed.
		185m 27s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/5/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 37d932ab31f2 4.15.0-112-generic #113-Ubuntu SMP Thu Jul 9 23:41:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `8c2332d`
Default Java	AdoptOpenJDK-11.0.10+9
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/5/testReport/
Max. process+thread count	4058 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/5/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

apurtell · 2021-05-10T16:14:19Z

So we will only compress value?

This is an enhancement to existing WAL compression. As you know the existing WAL compression already compresses other aspects of WAL entries except for the value. This patch adds support for compressing values too.

As we will do batching when writing WAL entries out, is it possible to compress when flushing? The data will be larger and compress may perform better. The structure of a WAL file will be multiple compressed blocks.

This is not possible for two reasons:

WALCellCodec does not compress the WAL file in blocks. The design is edit by edit. I want to introduce value compression without re-engineering the whole WAL format. Perhaps our WAL file format is due for a redesign, but I would like to see that be a different issue.
We flush the compressor at the end of every value to ensure each WALedit record persists all of the value data into the expected place. Otherwise the compressor would put some of the unflushed output of the previous value into the next/current value. But, we are not resetting the compressor. (That would be FULL_FLUSH. We are using SYNC_FLUSH.) By using the same Deflater instance for the whole WAL we already get the benefit you are thinking of. The (re-used) Deflater is able to build its dictionary across the contents of all of the values in the file, not just each value considered in isolation (that was the original patch but I pushed an improvement that aligns with this suggestion later), achieving a better compression.

Way back in the distant past our WAL format was based on Hadoop's SequenceFile, which supported both record-by-record and block based compression, where the blocks would contain multiple records. I don't remember why we moved away from it but I imagine it was because if there are corruptions of the WAL, a record by record codec is able to skip over the corrupt record and we lose only the record (or as many records as are actually corrupt), but with a block format we would lose the whole block and all of the edits contained within that block, especially if compression or encryption is enabled.

apurtell · 2021-05-19T20:45:12Z

@bharathv I wrote a simple bounded delegating input stream impl to avoid the unnecessary copy at decompression time. Rebased on master.

Apache-HBase · 2021-05-19T22:01:53Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	6m 27s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+0 🆗	prototool	0m 0s	prototool was not available.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 27s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 7s	master passed
+1 💚	compile	5m 37s	master passed
+1 💚	checkstyle	1m 46s	master passed
+1 💚	spotbugs	6m 54s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 12s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 1s	the patch passed
+1 💚	compile	5m 33s	the patch passed
+1 💚	cc	5m 33s	the patch passed
-0 ⚠️	javac	0m 45s	hbase-common generated 1 new + 158 unchanged - 1 fixed = 159 total (was 159)
+1 💚	checkstyle	0m 9s	The patch passed checkstyle in hbase-protocol-shaded
-0 ⚠️	checkstyle	0m 25s	hbase-common: The patch generated 1 new + 115 unchanged - 0 fixed = 116 total (was 115)
+1 💚	checkstyle	1m 10s	hbase-server: The patch generated 0 new + 26 unchanged - 2 fixed = 26 total (was 28)
-0 ⚠️	whitespace	0m 0s	The patch has 2 line(s) that end in whitespace. Use git apply --whitespace=fix <<patch_file>>. Refer https://git-scm.com/docs/git-apply
+1 💚	hadoopcheck	19m 59s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	hbaseprotoc	2m 3s	the patch passed
+1 💚	spotbugs	7m 32s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 31s	The patch does not generate ASF License warnings.
		75m 52s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/15/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile cc hbaseprotoc prototool
uname	Linux f89c074ca0e6 4.15.0-136-generic #140-Ubuntu SMP Thu Jan 28 05:20:47 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `fe47557`
Default Java	AdoptOpenJDK-1.8.0_282-b08
javac	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/15/artifact/yetus-general-check/output/diff-compile-javac-hbase-common.txt
checkstyle	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/15/artifact/yetus-general-check/output/diff-checkstyle-hbase-common.txt
whitespace	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/15/artifact/yetus-general-check/output/whitespace-eol.txt
Max. process+thread count	86 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/15/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

apurtell · 2021-05-19T23:21:12Z

I am redoing microbenchmarks with the latest patch and will update here soon. Improvements have unlocked IO performance improvement from the compression.

Algorithm	Avg WAL append in ms - Before	Avg WAL append in ms - Now
None	0.290 (stdev 0.328)	unchanged
Compression, but no value compression	0.372 (stddev 0.336)	unchanged
Compression, value compression=SNAPPY	2.971 (stddev 2.029)	0.0272 (stddev 0.204)

Before, SNAPPY+copy had a 10x loss, now SNAPPY+no-copy has a 10x gain. I double checked these findings and it seems correct. I will re-run the benchmark when measuring for the other codec types for comparison.

Microbenchmarks are collected with this change.
Statistics are collected over the lifetime of the regionserver and are dumped at end of test at shutdown. Statistics are updated under synchronization but this is done in a way that excludes that overhead from measurement. The normal patch does not contain either the instrumentation or the synchronization point. Nanoseconds are converted to milliseconds for the table.

Apache-HBase · 2021-05-19T23:54:24Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	3m 53s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 33s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 37s	master passed
+1 💚	compile	2m 11s	master passed
+1 💚	shadedjars	8m 6s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 14s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 17s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 40s	the patch passed
+1 💚	compile	2m 12s	the patch passed
+1 💚	javac	2m 12s	the patch passed
+1 💚	shadedjars	8m 11s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 13s	the patch passed
		_ Other Tests _
+1 💚	unit	0m 46s	hbase-protocol-shaded in the patch passed.
+1 💚	unit	1m 48s	hbase-common in the patch passed.
+1 💚	unit	147m 59s	hbase-server in the patch passed.
		188m 29s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/15/artifact/yetus-jdk8-hadoop3-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux c76a45c06d4f 4.15.0-60-generic #67-Ubuntu SMP Thu Aug 22 16:55:30 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `fe47557`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/15/testReport/
Max. process+thread count	3669 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/15/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-20T01:31:16Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	10m 6s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 17s	Maven dependency ordering for branch
+1 💚	mvninstall	5m 40s	master passed
+1 💚	compile	3m 6s	master passed
+1 💚	shadedjars	9m 53s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 30s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 16s	Maven dependency ordering for patch
+1 💚	mvninstall	5m 30s	the patch passed
+1 💚	compile	3m 23s	the patch passed
+1 💚	javac	3m 23s	the patch passed
+1 💚	shadedjars	10m 9s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 49s	the patch passed
		_ Other Tests _
+1 💚	unit	1m 16s	hbase-protocol-shaded in the patch passed.
+1 💚	unit	2m 29s	hbase-common in the patch passed.
+1 💚	unit	226m 58s	hbase-server in the patch passed.
		285m 12s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/15/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 110bacffde07 4.15.0-142-generic #146-Ubuntu SMP Tue Apr 13 01:11:19 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `fe47557`
Default Java	AdoptOpenJDK-11.0.10+9
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/15/testReport/
Max. process+thread count	2982 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/15/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

apurtell · 2021-05-20T02:02:50Z

SNAPPY or ZSTD are recommended, all other options provided for comparison. (LZMA is included as a sanity check that indeed an expensive algorithm really is expensive.)

When using SNAPPY or ZSTD we derive a performance benefit due to reduced IO for the large values in the test case.

Microbenchmarks are collected with this change.
Statistics are collected over the lifetime of the regionserver and are dumped at end of test at shutdown. Statistics are updated under synchronization but this is done in a way that excludes that overhead from measurement. The normal patch does not contain either the instrumentation or the synchronization point. Nanoseconds are converted to milliseconds for the table.

Mode	WALs aggregate size	WALs aggregate size difference	WAL writer append time (ms avg)
Default	5,117,369,553	-	0.290 (stdev 0.328)
Compression enabled, value compression not enabled	5,002,683,600	(2.241%)	0.372 (stddev 0.336)
~~Compression enabled, value compression enabled, v1 patch, Deflate (best speed)~~	~~1,209,947,515~~	~~(76.4%)~~	~~12.694 (stddev 8.48)~~
Compression enabled, value compression enabled, v2 patch, algorithm=SNAPPY	1,616,387,702	(68.4%)	0.027 (stddev 0.204)
Compression enabled, value compression enabled, v2 patch, algorithm=ZSTD (best speed)	1,149,008,133	(77.55%)	0.043 (stddev 0.195)
Compression enabled, value compression enabled, v2 patch, algorithm=ZSTD (default)	1,089,241,811	(78.7%)	0.056 (stdev 0.310)
Compression enabled, value compression enabled, v2 patch, algorithm=ZSTD (best compression)	941,452,655	(81.2%)	0.231 (stddev 1.11)
Options below not recommended.	-	-	-
Compression enabled, value compression enabled, v2 patch, algorithm=GZ	1,082,414,015	(78.9%)	0.267 (stddev 1.325)
Compression enabled, value compression enabled, v2 patch, algorithm=LZMA (level 1)	1,013,951,637	(80.2%)	2.157 (stddev 3.302)
Compression enabled, value compression enabled, v2 patch, algorithm=LZMA (default)	940,884,618	(81.7%)	4.739 (stdev 8.609)

bharathv · 2021-05-20T02:04:40Z

SNAPPY+copy had a 10x loss, now SNAPPY+no-copy has a 10x gain. I double checked these findings and it seems correct. I will re-run the benchmark when measuring for the other codec types for comparison.

Holy guacamole! Is this because of the reduced disk IO with compressed values? I'm glad we regained all the lost performance by eliding the copy.

Edit: Just saw your last comment "When using SNAPPY or ZSTD we derive a performance benefit due to reduced IO for the large values in the test case." Sweeeeet!

bharathv

+1 with one pending suggestion and green QA.

bharathv · 2021-05-20T02:16:10Z

hbase-common/src/main/java/org/apache/hadoop/hbase/io/BoundedDelegatingInputStream.java

+ * is finished.
+ */
+@InterfaceAudience.Private
+public class BoundedDelegatingInputStream extends DelegatingInputStream {


We can merge this and DelegatingInputStream? Don't think we need both of them separately.

public class BoundedDelegatingInputStream extends FilterInputStream { .... < bounding methods>... setDelegate() {} }

I thought we could keep them both.

Apache-HBase · 2021-05-20T02:45:23Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 6s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+0 🆗	prototool	0m 0s	prototool was not available.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 23s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 1s	master passed
+1 💚	compile	5m 43s	master passed
+1 💚	checkstyle	1m 46s	master passed
+1 💚	spotbugs	6m 39s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 12s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 0s	the patch passed
+1 💚	compile	5m 36s	the patch passed
+1 💚	cc	5m 36s	the patch passed
-0 ⚠️	javac	0m 47s	hbase-common generated 1 new + 158 unchanged - 1 fixed = 159 total (was 159)
+1 💚	checkstyle	0m 8s	The patch passed checkstyle in hbase-protocol-shaded
-0 ⚠️	checkstyle	0m 26s	hbase-common: The patch generated 1 new + 115 unchanged - 0 fixed = 116 total (was 115)
+1 💚	checkstyle	1m 8s	hbase-server: The patch generated 0 new + 26 unchanged - 2 fixed = 26 total (was 28)
-0 ⚠️	whitespace	0m 0s	The patch has 2 line(s) that end in whitespace. Use git apply --whitespace=fix <<patch_file>>. Refer https://git-scm.com/docs/git-apply
+1 💚	hadoopcheck	20m 38s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	hbaseprotoc	2m 8s	the patch passed
+1 💚	spotbugs	7m 44s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 32s	The patch does not generate ASF License warnings.
		71m 14s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/16/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile cc hbaseprotoc prototool
uname	Linux bace79313f46 4.15.0-136-generic #140-Ubuntu SMP Thu Jan 28 05:20:47 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `fe47557`
Default Java	AdoptOpenJDK-1.8.0_282-b08
javac	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/16/artifact/yetus-general-check/output/diff-compile-javac-hbase-common.txt
checkstyle	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/16/artifact/yetus-general-check/output/diff-checkstyle-hbase-common.txt
whitespace	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/16/artifact/yetus-general-check/output/whitespace-eol.txt
Max. process+thread count	86 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/16/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-20T04:39:44Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	3m 55s	Docker mode activated.
-0 ⚠️	yetus	0m 4s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 29s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 14s	master passed
+1 💚	compile	2m 38s	master passed
+1 💚	shadedjars	8m 7s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 18s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 16s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 14s	the patch passed
+1 💚	compile	2m 39s	the patch passed
+1 💚	javac	2m 39s	the patch passed
+1 💚	shadedjars	8m 6s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 19s	the patch passed
		_ Other Tests _
+1 💚	unit	1m 0s	hbase-protocol-shaded in the patch passed.
+1 💚	unit	2m 6s	hbase-common in the patch passed.
+1 💚	unit	142m 29s	hbase-server in the patch passed.
		185m 30s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/16/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux b02b5d6634e6 4.15.0-58-generic #64-Ubuntu SMP Tue Aug 6 11:12:41 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `fe47557`
Default Java	AdoptOpenJDK-11.0.10+9
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/16/testReport/
Max. process+thread count	3872 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/16/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-20T04:50:15Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 28s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 27s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 40s	master passed
+1 💚	compile	2m 12s	master passed
+1 💚	shadedjars	8m 8s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 14s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 17s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 38s	the patch passed
+1 💚	compile	2m 10s	the patch passed
+1 💚	javac	2m 10s	the patch passed
+1 💚	shadedjars	8m 9s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 12s	the patch passed
		_ Other Tests _
+1 💚	unit	0m 47s	hbase-protocol-shaded in the patch passed.
+1 💚	unit	1m 48s	hbase-common in the patch passed.
-1 ❌	unit	159m 26s	hbase-server in the patch failed.
		196m 8s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/16/artifact/yetus-jdk8-hadoop3-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 33ea6bbd4f8c 4.15.0-60-generic #67-Ubuntu SMP Thu Aug 22 16:55:30 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `fe47557`
Default Java	AdoptOpenJDK-1.8.0_282-b08
unit	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/16/artifact/yetus-jdk8-hadoop3-check/output/patch-unit-hbase-server.txt
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/16/testReport/
Max. process+thread count	3627 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/16/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

apurtell · 2021-05-20T18:15:29Z

Unit test failures are not related. It looks like master is recently unstable, related to RS groups.

ndimiduk · 2021-05-20T18:41:49Z

Unit test failures are not related. It looks like master is recently unstable, related to RS groups.

That failure looks suspicious. Let me grab the logs before you merge/close the PR.

ndimiduk · 2021-05-20T18:45:47Z

I have what's available.

apurtell · 2021-05-20T19:31:59Z

@ndimiduk

That failure looks suspicious. Let me grab the logs before you merge/close the PR.

How can it be related? That test neither enables WAL compression nor WAL value compression.

org.junit.runners.model.TestTimedOutException: test timed out after 780 seconds
    at org.apache.hadoop.hbase.master.TestMergeTableRegionsWhileRSCrash.test(TestMergeTableRegionsWhileRSCrash.java:112)

Test log is full of:

java.net.ConnectException: Call to address=33ea6bbd4f8c:42363 null failed on connection exception:
org.apache.hbase.thirdparty.io.netty.channel.AbstractChannel$AnnotatedConnectException:
finishConnect(..) failed: Connection refused: 33ea6bbd4f8c/172.17.0.2:42363

Passes on my dev branch:

[INFO] -------------------------------------------------------
[INFO]  T E S T S
[INFO] -------------------------------------------------------
[INFO] Running org.apache.hadoop.hbase.master.TestMergeTableRegionsWhileRSCrash
[INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 38.5 s - in org.apache.hadoop.hbase.master.TestMergeTableRegionsWhileRSCrash
[INFO] 
[INFO] Results:
[INFO] 
[INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0

apurtell · 2021-05-20T19:36:56Z

Pushed a fix for whitespace and javadoc issues introduced in last change. No additional changes anticipated from this point.

Apache-HBase · 2021-05-20T20:43:34Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 35s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+0 🆗	prototool	0m 0s	prototool was not available.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 29s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 16s	master passed
+1 💚	compile	5m 29s	master passed
+1 💚	checkstyle	1m 41s	master passed
+1 💚	spotbugs	6m 34s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 13s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 38s	the patch passed
+1 💚	compile	5m 22s	the patch passed
+1 💚	cc	5m 22s	the patch passed
-0 ⚠️	javac	0m 48s	hbase-common generated 1 new + 158 unchanged - 1 fixed = 159 total (was 159)
+1 💚	checkstyle	0m 11s	The patch passed checkstyle in hbase-protocol-shaded
+1 💚	checkstyle	0m 27s	The patch passed checkstyle in hbase-common
+1 💚	checkstyle	1m 2s	hbase-server: The patch generated 0 new + 26 unchanged - 2 fixed = 26 total (was 28)
+1 💚	whitespace	0m 0s	The patch has no whitespace issues.
+1 💚	hadoopcheck	18m 25s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	hbaseprotoc	2m 2s	the patch passed
+1 💚	spotbugs	7m 4s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 37s	The patch does not generate ASF License warnings.
		66m 39s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/17/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile cc hbaseprotoc prototool
uname	Linux 89c6f8671b31 4.15.0-58-generic #64-Ubuntu SMP Tue Aug 6 11:12:41 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `7c24ed4`
Default Java	AdoptOpenJDK-1.8.0_282-b08
javac	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/17/artifact/yetus-general-check/output/diff-compile-javac-hbase-common.txt
Max. process+thread count	96 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/17/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

ndimiduk · 2021-05-20T22:12:10Z

@ndimiduk

That failure looks suspicious. Let me grab the logs before you merge/close the PR.

How can it be related? That test neither enables WAL compression nor WAL value compression.

Parton Andrew. I did not mean for "suspicious" to imply "caused by this changeset," merely that I was interested in looking at it further. All the PR build artifacts are purged from Jenkins after the PR is closed.

Apache-HBase · 2021-05-20T22:43:57Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	4m 55s	Docker mode activated.
-0 ⚠️	yetus	0m 4s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 35s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 38s	master passed
+1 💚	compile	2m 50s	master passed
+1 💚	shadedjars	8m 40s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 27s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 17s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 26s	the patch passed
+1 💚	compile	2m 53s	the patch passed
+1 💚	javac	2m 53s	the patch passed
+1 💚	shadedjars	8m 49s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 23s	the patch passed
		_ Other Tests _
+1 💚	unit	1m 5s	hbase-protocol-shaded in the patch passed.
+1 💚	unit	2m 8s	hbase-common in the patch passed.
+1 💚	unit	140m 14s	hbase-server in the patch passed.
		187m 10s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/17/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 2c288e42a37a 4.15.0-112-generic #113-Ubuntu SMP Thu Jul 9 23:41:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `7c24ed4`
Default Java	AdoptOpenJDK-11.0.10+9
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/17/testReport/
Max. process+thread count	3858 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/17/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-21T00:04:07Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 16s	Docker mode activated.
-0 ⚠️	yetus	0m 2s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 29s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 9s	master passed
+1 💚	compile	2m 15s	master passed
+1 💚	shadedjars	9m 31s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 22s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 16s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 23s	the patch passed
+1 💚	compile	2m 14s	the patch passed
+1 💚	javac	2m 14s	the patch passed
+1 💚	shadedjars	9m 2s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	1m 24s	the patch passed
		_ Other Tests _
+1 💚	unit	0m 54s	hbase-protocol-shaded in the patch passed.
+1 💚	unit	2m 17s	hbase-common in the patch passed.
+1 💚	unit	225m 19s	hbase-server in the patch passed.
		267m 14s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/17/artifact/yetus-jdk8-hadoop3-check/output/Dockerfile
GITHUB PR	#3244
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux bf3385fe68e2 4.15.0-142-generic #146-Ubuntu SMP Tue Apr 13 01:11:19 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `7c24ed4`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/17/testReport/
Max. process+thread count	2571 (vs. ulimit of 30000)
modules	C: hbase-protocol-shaded hbase-common hbase-server U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3244/17/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

apurtell · 2021-05-21T00:05:42Z

Thanks. Unless objection I will merge this tonight or tomorrow morning, to master, and then to branch-2 (for future 2.5.0)

WAL storage can be expensive, especially if the cell values represented in the edits are large, consisting of blobs or significant lengths of text. Such WALs might need to be kept around for a fairly long time to satisfy replication constraints on a space limited (or space-contended) filesystem. We have a custom dictionary compression scheme for cell metadata that is engaged when WAL compression is enabled in site configuration. This is fine for that application, where we can expect the universe of values and their lengths in the custom dictionaries to be constrained. For arbitrary cell values it is better to use one of the available compression codecs, which are suitable for arbitrary albeit compressible data. Signed-off-by: Bharath Vissapragada <bharathv@apache.org> Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Nick Dimiduk <ndimiduk@apache.org>

apurtell commented May 8, 2021

View reviewed changes

hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/wal/WALCellCodec.java Outdated Show resolved Hide resolved

apurtell requested review from Apache9, bharathv, saintstack, virajjasani and ndimiduk May 8, 2021 01:44

apurtell commented May 8, 2021

View reviewed changes

hbase-common/src/main/java/org/apache/hadoop/hbase/KeyValue.java Outdated Show resolved Hide resolved

apurtell force-pushed the HBASE-25869 branch from bf89426 to e58ad7c Compare May 8, 2021 01:50

apache deleted a comment from Apache-HBase May 8, 2021

apurtell force-pushed the HBASE-25869 branch from e58ad7c to 3fc97b1 Compare May 8, 2021 19:23

apurtell commented May 8, 2021

View reviewed changes

hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/wal/WALCellCodec.java Outdated Show resolved Hide resolved

apurtell force-pushed the HBASE-25869 branch 2 times, most recently from ca44233 to b8ff445 Compare May 9, 2021 22:45

apurtell changed the title ~~[WIP] HBASE-25869 WAL value compression~~ HBASE-25869 WAL value compression May 9, 2021

Extend DelegatingInputStream from FilterInputStream

a0a3384

apurtell force-pushed the HBASE-25869 branch from 2acc163 to 8ee289f Compare May 19, 2021 20:45

Avoid a copy during decompression with new BoundedDelegatingInputStream.

f2202ac

apurtell force-pushed the HBASE-25869 branch from 8ee289f to f2202ac Compare May 19, 2021 20:50

bharathv approved these changes May 20, 2021

View reviewed changes

Fix whitespace and javadoc formatting nits in latest precommit report

e4f8f7f

apurtell merged commit 8ec6fd9 into apache:master May 21, 2021

apurtell deleted the HBASE-25869 branch May 21, 2021 18:05

apurtell mentioned this pull request Jun 10, 2021

HBASE-25994 Active WAL tailing fails when WAL value compression is enabled #3377

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HBASE-25869 WAL value compression #3244

HBASE-25869 WAL value compression #3244

apurtell commented May 8, 2021 •

edited

Loading

apurtell commented May 8, 2021

apurtell commented May 8, 2021

Apache-HBase commented May 8, 2021

Apache-HBase commented May 8, 2021

Apache-HBase commented May 9, 2021

Apache9 commented May 10, 2021

Apache-HBase commented May 10, 2021

Apache-HBase commented May 10, 2021

Apache-HBase commented May 10, 2021

Apache-HBase commented May 10, 2021

apurtell commented May 10, 2021 •

edited

Loading

apurtell commented May 19, 2021 •

edited

Loading

Apache-HBase commented May 19, 2021

apurtell commented May 19, 2021 •

edited

Loading

Apache-HBase commented May 19, 2021

Apache-HBase commented May 20, 2021

apurtell commented May 20, 2021

bharathv commented May 20, 2021 •

edited

Loading

bharathv left a comment

bharathv May 20, 2021

apurtell May 20, 2021

Apache-HBase commented May 20, 2021

Apache-HBase commented May 20, 2021

Apache-HBase commented May 20, 2021

apurtell commented May 20, 2021

ndimiduk commented May 20, 2021

ndimiduk commented May 20, 2021

apurtell commented May 20, 2021

apurtell commented May 20, 2021

Apache-HBase commented May 20, 2021

ndimiduk commented May 20, 2021

Apache-HBase commented May 20, 2021

Apache-HBase commented May 21, 2021

apurtell commented May 21, 2021

HBASE-25869 WAL value compression #3244

HBASE-25869 WAL value compression #3244

Conversation

apurtell commented May 8, 2021 • edited Loading

apurtell commented May 8, 2021

apurtell commented May 8, 2021

Apache-HBase commented May 8, 2021

Apache-HBase commented May 8, 2021

Apache-HBase commented May 9, 2021

Apache9 commented May 10, 2021

Apache-HBase commented May 10, 2021

Apache-HBase commented May 10, 2021

Apache-HBase commented May 10, 2021

Apache-HBase commented May 10, 2021

apurtell commented May 10, 2021 • edited Loading

apurtell commented May 19, 2021 • edited Loading

Apache-HBase commented May 19, 2021

apurtell commented May 19, 2021 • edited Loading

Apache-HBase commented May 19, 2021

Apache-HBase commented May 20, 2021

apurtell commented May 20, 2021

bharathv commented May 20, 2021 • edited Loading

bharathv left a comment

Choose a reason for hiding this comment

bharathv May 20, 2021

Choose a reason for hiding this comment

apurtell May 20, 2021

Choose a reason for hiding this comment

Apache-HBase commented May 20, 2021

Apache-HBase commented May 20, 2021

Apache-HBase commented May 20, 2021

apurtell commented May 20, 2021

ndimiduk commented May 20, 2021

ndimiduk commented May 20, 2021

apurtell commented May 20, 2021

apurtell commented May 20, 2021

Apache-HBase commented May 20, 2021

ndimiduk commented May 20, 2021

Apache-HBase commented May 20, 2021

Apache-HBase commented May 21, 2021

apurtell commented May 21, 2021

apurtell commented May 8, 2021 •

edited

Loading

apurtell commented May 10, 2021 •

edited

Loading

apurtell commented May 19, 2021 •

edited

Loading

apurtell commented May 19, 2021 •

edited

Loading

bharathv commented May 20, 2021 •

edited

Loading