HBASE-25824 IntegrationTestLoadCommonCrawl #3208

apurtell · 2021-04-28T23:32:43Z

This integration test loads successful resource retrieval records from
the Common Crawl (https://commoncrawl.org/) public dataset into an HBase
table and writes records that can be used to later verify the presence
and integrity of those records.

Run like:

./bin/hbase org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl \
  -Dfs.s3n.awsAccessKeyId=<AWS access key> \
  -Dfs.s3n.awsSecretAccessKey=<AWS secret key> \
  /path/to/test-CC-MAIN-2021-10-warc.paths.gz \
  /path/to/tmp/warc-loader-output

Access to the Common Crawl dataset in S3 is made available to anyone by
Amazon AWS, but Hadoop's S3N filesystem still requires valid access
credentials to initialize.

The input path can either specify a directory or a file. The file may
optionally be compressed with gzip. If a directory, the loader expects
the directory to contain one or more WARC files from the Common Crawl
dataset. If a file, the loader expects a list of Hadoop S3N URIs which
point to S3 locations for one or more WARC files from the Common Crawl
dataset, one URI per line. Lines should be terminated with the UNIX line
terminator.

Included in hbase-it/src/test/resources/CC-MAIN-2021-10-warc.paths.gz
is a list of all WARC files comprising the Q1 2021 crawl archive. There
are 64,000 WARC files in this data set, each containing ~1GB of gzipped
data. The WARC files contain several record types, such as metadata,
request, and response, but we only load the response record types. If
the HBase table schema does not specify compression (by default) there
is roughly a 10x expansion. Loading the full crawl archive results in a
table approximately 640 TB in size.

The hadoop-aws jar will be needed at runtime to instantiate the S3N
filesystem. Use the -files ToolRunner argument to add it.

You can also split the Loader and Verify stages:

Load with:

./bin/hbase 'org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl$Loader' \
  -files /path/to/hadoop-aws.jar \
  -Dfs.s3n.awsAccessKeyId=<AWS access key> \
  -Dfs.s3n.awsSecretAccessKey=<AWS secret key> \
  /path/to/test-CC-MAIN-2021-10-warc.paths.gz \
  /path/to/tmp/warc-loader-output

Verify with:

./bin/hbase 'org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl$Verify' \
  /path/to/tmp/warc-loader-output

Apache-HBase · 2021-04-29T00:49:32Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 26s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 33s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 50s	master passed
+1 💚	compile	8m 11s	master passed
+1 💚	checkstyle	1m 55s	master passed
+1 💚	spotbugs	9m 53s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 33s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 34s	the patch passed
+1 💚	compile	8m 11s	the patch passed
-0 ⚠️	javac	8m 11s	root generated 20 new + 1541 unchanged - 20 fixed = 1561 total (was 1561)
-0 ⚠️	checkstyle	2m 6s	root: The patch generated 382 new + 0 unchanged - 0 fixed = 382 total (was 0)
-0 ⚠️	whitespace	0m 0s	The patch 1 line(s) with tabs.
+1 💚	xml	0m 2s	The patch has no ill-formed XML file.
+1 💚	hadoopcheck	18m 4s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	spotbugs	9m 57s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 29s	The patch does not generate ASF License warnings.
		75m 56s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/1/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile xml
uname	Linux 08e2d4fc5483 4.15.0-112-generic #113-Ubuntu SMP Thu Jul 9 23:41:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `2382f68`
Default Java	AdoptOpenJDK-1.8.0_282-b08
javac	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/1/artifact/yetus-general-check/output/diff-compile-javac-root.txt
checkstyle	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/1/artifact/yetus-general-check/output/diff-checkstyle-root.txt
whitespace	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/1/artifact/yetus-general-check/output/whitespace-tabs.txt
Max. process+thread count	141 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/1/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-04-29T03:56:48Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 22s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 31s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 55s	master passed
+1 💚	compile	3m 22s	master passed
+1 💚	shadedjars	9m 7s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	3m 29s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 18s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 44s	the patch passed
+1 💚	compile	3m 19s	the patch passed
+1 💚	javac	3m 19s	the patch passed
+1 💚	shadedjars	9m 3s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	3m 32s	the patch passed
		_ Other Tests _
-1 ❌	unit	216m 59s	root in the patch failed.
		263m 5s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/1/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 038e5f9221c8 4.15.0-101-generic #102-Ubuntu SMP Mon May 11 10:07:26 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `2382f68`
Default Java	AdoptOpenJDK-11.0.10+9
unit	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/1/artifact/yetus-jdk11-hadoop3-check/output/patch-unit-root.txt
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/1/testReport/
Max. process+thread count	2978 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/1/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-04-29T06:59:54Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 6s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 30s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 13s	master passed
+1 💚	compile	2m 43s	master passed
+1 💚	shadedjars	8m 56s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 32s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 17s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 0s	the patch passed
+1 💚	compile	2m 51s	the patch passed
+1 💚	javac	2m 51s	the patch passed
+1 💚	shadedjars	8m 57s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 39s	the patch passed
		_ Other Tests _
+1 💚	unit	405m 10s	root in the patch passed.
		446m 16s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/1/artifact/yetus-jdk8-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 462d4704056e 4.15.0-126-generic #129-Ubuntu SMP Mon Nov 23 18:53:38 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `2382f68`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/1/testReport/
Max. process+thread count	3934 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/1/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-04-29T08:25:15Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 8s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 18s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 59s	master passed
+1 💚	compile	8m 42s	master passed
+1 💚	checkstyle	2m 14s	master passed
+1 💚	spotbugs	11m 20s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 18s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 1s	the patch passed
+1 💚	compile	8m 41s	the patch passed
-0 ⚠️	javac	8m 41s	root generated 4 new + 1557 unchanged - 4 fixed = 1561 total (was 1561)
-0 ⚠️	checkstyle	2m 17s	root: The patch generated 373 new + 0 unchanged - 0 fixed = 373 total (was 0)
-0 ⚠️	whitespace	0m 0s	The patch 1 line(s) with tabs.
+1 💚	xml	0m 1s	The patch has no ill-formed XML file.
+1 💚	hadoopcheck	19m 53s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	spotbugs	11m 15s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 24s	The patch does not generate ASF License warnings.
		83m 18s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/2/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile xml
uname	Linux 9d4e8364290d 4.15.0-126-generic #129-Ubuntu SMP Mon Nov 23 18:53:38 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `6c65314`
Default Java	AdoptOpenJDK-1.8.0_282-b08
javac	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/2/artifact/yetus-general-check/output/diff-compile-javac-root.txt
checkstyle	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/2/artifact/yetus-general-check/output/diff-checkstyle-root.txt
whitespace	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/2/artifact/yetus-general-check/output/whitespace-tabs.txt
Max. process+thread count	126 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/2/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-04-29T12:31:48Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 37s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 40s	Maven dependency ordering for branch
+1 💚	mvninstall	5m 55s	master passed
+1 💚	compile	3m 44s	master passed
+1 💚	shadedjars	10m 13s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	3m 54s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 20s	Maven dependency ordering for patch
+1 💚	mvninstall	5m 23s	the patch passed
+1 💚	compile	3m 47s	the patch passed
+1 💚	javac	3m 47s	the patch passed
+1 💚	shadedjars	9m 50s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	4m 5s	the patch passed
		_ Other Tests _
+1 💚	unit	277m 4s	root in the patch passed.
		329m 43s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/2/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 0e2cde2416f9 4.15.0-136-generic #140-Ubuntu SMP Thu Jan 28 05:20:47 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `6c65314`
Default Java	AdoptOpenJDK-11.0.10+9
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/2/testReport/
Max. process+thread count	4382 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/2/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-04-29T13:37:47Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 9s	Docker mode activated.
-0 ⚠️	yetus	0m 4s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 38s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 47s	master passed
+1 💚	compile	2m 33s	master passed
+1 💚	shadedjars	8m 8s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 24s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 20s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 38s	the patch passed
+1 💚	compile	2m 32s	the patch passed
+1 💚	javac	2m 32s	the patch passed
+1 💚	shadedjars	8m 4s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 23s	the patch passed
		_ Other Tests _
-1 ❌	unit	357m 16s	root in the patch failed.
		395m 48s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/2/artifact/yetus-jdk8-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux b0cc8bd1cfe2 4.15.0-65-generic #74-Ubuntu SMP Tue Sep 17 17:06:04 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `6c65314`
Default Java	AdoptOpenJDK-1.8.0_282-b08
unit	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/2/artifact/yetus-jdk8-hadoop3-check/output/patch-unit-root.txt
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/2/testReport/
Max. process+thread count	5995 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/2/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

saintstack

Skimmed. Looks excellent. Test failures are all in backup.

Apache-HBase · 2021-04-29T21:08:14Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 39s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 37s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 53s	master passed
+1 💚	compile	8m 21s	master passed
+1 💚	checkstyle	1m 56s	master passed
+1 💚	spotbugs	9m 44s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 20s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 33s	the patch passed
+1 💚	compile	8m 17s	the patch passed
-0 ⚠️	javac	8m 17s	root generated 20 new + 1541 unchanged - 20 fixed = 1561 total (was 1561)
-0 ⚠️	checkstyle	2m 3s	root: The patch generated 376 new + 0 unchanged - 0 fixed = 376 total (was 0)
-0 ⚠️	whitespace	0m 1s	The patch has 1 line(s) that end in whitespace. Use git apply --whitespace=fix <<patch_file>>. Refer https://git-scm.com/docs/git-apply
-0 ⚠️	whitespace	0m 1s	The patch 1 line(s) with tabs.
+1 💚	xml	0m 1s	The patch has no ill-formed XML file.
+1 💚	hadoopcheck	18m 11s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	spotbugs	10m 13s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 29s	The patch does not generate ASF License warnings.
		77m 29s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile xml
uname	Linux e8cf1d66e54b 4.15.0-58-generic #64-Ubuntu SMP Tue Aug 6 11:12:41 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `6c65314`
Default Java	AdoptOpenJDK-1.8.0_282-b08
javac	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/artifact/yetus-general-check/output/diff-compile-javac-root.txt
checkstyle	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/artifact/yetus-general-check/output/diff-checkstyle-root.txt
whitespace	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/artifact/yetus-general-check/output/whitespace-eol.txt
whitespace	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/artifact/yetus-general-check/output/whitespace-tabs.txt
Max. process+thread count	141 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-04-29T23:48:53Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 46s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 36s	Maven dependency ordering for branch
+1 💚	mvninstall	5m 42s	master passed
+1 💚	compile	3m 58s	master passed
+1 💚	shadedjars	11m 16s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	3m 56s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 23s	Maven dependency ordering for patch
+1 💚	mvninstall	5m 24s	the patch passed
+1 💚	compile	3m 54s	the patch passed
+1 💚	javac	3m 54s	the patch passed
+1 💚	shadedjars	10m 49s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	3m 59s	the patch passed
		_ Other Tests _
+1 💚	unit	176m 33s	root in the patch passed.
		231m 3s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 9214765b4568 4.15.0-60-generic #67-Ubuntu SMP Thu Aug 22 16:55:30 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `6c65314`
Default Java	AdoptOpenJDK-11.0.10+9
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/testReport/
Max. process+thread count	6585 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

apurtell · 2021-04-30T00:09:53Z

Thanks for the review @saintstack . Last precommit looked good even though earlier test failures weren't related. I made a push of improvements. I see a round of checkstyle and whitespace fixes are due. Let me address them and then close this out.

Apache-HBase · 2021-04-30T01:30:24Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	2m 7s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 35s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 37s	master passed
+1 💚	compile	3m 1s	master passed
+1 💚	shadedjars	8m 39s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 59s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 21s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 40s	the patch passed
+1 💚	compile	3m 11s	the patch passed
+1 💚	javac	3m 11s	the patch passed
+1 💚	shadedjars	10m 27s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 57s	the patch passed
		_ Other Tests _
-1 ❌	unit	293m 21s	root in the patch failed.
		339m 40s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/artifact/yetus-jdk8-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux f35481672a04 4.15.0-112-generic #113-Ubuntu SMP Thu Jul 9 23:41:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `6c65314`
Default Java	AdoptOpenJDK-1.8.0_282-b08
unit	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/artifact/yetus-jdk8-hadoop3-check/output/patch-unit-root.txt
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/testReport/
Max. process+thread count	3937 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/3/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-04-30T02:52:52Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 26s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 40s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 42s	master passed
+1 💚	compile	8m 30s	master passed
+1 💚	checkstyle	1m 54s	master passed
+1 💚	spotbugs	10m 54s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 20s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 52s	the patch passed
+1 💚	compile	8m 17s	the patch passed
-0 ⚠️	javac	8m 17s	root generated 20 new + 1541 unchanged - 20 fixed = 1561 total (was 1561)
-0 ⚠️	checkstyle	1m 56s	root: The patch generated 20 new + 0 unchanged - 0 fixed = 20 total (was 0)
-0 ⚠️	whitespace	0m 0s	The patch has 2 line(s) that end in whitespace. Use git apply --whitespace=fix <<patch_file>>. Refer https://git-scm.com/docs/git-apply
+1 💚	xml	0m 1s	The patch has no ill-formed XML file.
+1 💚	hadoopcheck	19m 30s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	spotbugs	11m 32s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 25s	The patch does not generate ASF License warnings.
		80m 30s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/4/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile xml
uname	Linux f43a0cd05f2b 4.15.0-112-generic #113-Ubuntu SMP Thu Jul 9 23:41:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `6c65314`
Default Java	AdoptOpenJDK-1.8.0_282-b08
javac	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/4/artifact/yetus-general-check/output/diff-compile-javac-root.txt
checkstyle	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/4/artifact/yetus-general-check/output/diff-checkstyle-root.txt
whitespace	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/4/artifact/yetus-general-check/output/whitespace-eol.txt
Max. process+thread count	141 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/4/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-04-30T06:35:04Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 26s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 37s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 53s	master passed
+1 💚	compile	3m 19s	master passed
+1 💚	shadedjars	9m 10s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	3m 29s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 17s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 49s	the patch passed
+1 💚	compile	3m 23s	the patch passed
+1 💚	javac	3m 23s	the patch passed
+1 💚	shadedjars	8m 55s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	3m 29s	the patch passed
		_ Other Tests _
+1 💚	unit	256m 27s	root in the patch passed.
		302m 40s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/4/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux aa27a4b10fbe 4.15.0-101-generic #102-Ubuntu SMP Mon May 11 10:07:26 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `6c65314`
Default Java	AdoptOpenJDK-11.0.10+9
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/4/testReport/
Max. process+thread count	4477 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/4/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-04-30T08:58:56Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 10s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 35s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 9s	master passed
+1 💚	compile	2m 49s	master passed
+1 💚	shadedjars	8m 57s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 33s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 18s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 2s	the patch passed
+1 💚	compile	2m 49s	the patch passed
+1 💚	javac	2m 49s	the patch passed
+1 💚	shadedjars	8m 56s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 30s	the patch passed
		_ Other Tests _
+1 💚	unit	404m 56s	root in the patch passed.
		446m 3s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/4/artifact/yetus-jdk8-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux b8adc3e35a5a 4.15.0-126-generic #129-Ubuntu SMP Mon Nov 23 18:53:38 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `6c65314`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/4/testReport/
Max. process+thread count	3951 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/4/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-04-30T22:17:39Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 28s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 33s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 49s	master passed
+1 💚	compile	8m 13s	master passed
+1 💚	checkstyle	1m 55s	master passed
+1 💚	spotbugs	9m 50s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 19s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 35s	the patch passed
+1 💚	compile	8m 17s	the patch passed
+1 💚	javac	8m 17s	the patch passed
+1 💚	checkstyle	1m 55s	the patch passed
+1 💚	whitespace	0m 0s	The patch has no whitespace issues.
+1 💚	xml	0m 2s	The patch has no ill-formed XML file.
+1 💚	hadoopcheck	17m 52s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	spotbugs	10m 19s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 30s	The patch does not generate ASF License warnings.
		75m 39s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/5/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile xml
uname	Linux 97359cc2ba2b 4.15.0-112-generic #113-Ubuntu SMP Thu Jul 9 23:41:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `5d42f58`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Max. process+thread count	141 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/5/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-01T01:56:46Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 37s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 32s	Maven dependency ordering for branch
+1 💚	mvninstall	5m 17s	master passed
+1 💚	compile	3m 30s	master passed
+1 💚	shadedjars	9m 16s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	4m 4s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 19s	Maven dependency ordering for patch
+1 💚	mvninstall	5m 0s	the patch passed
+1 💚	compile	3m 26s	the patch passed
+1 💚	javac	3m 26s	the patch passed
+1 💚	shadedjars	9m 38s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	4m 28s	the patch passed
		_ Other Tests _
-1 ❌	unit	245m 25s	root in the patch failed.
		294m 49s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/5/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 996d046a9391 4.15.0-136-generic #140-Ubuntu SMP Thu Jan 28 05:20:47 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `5d42f58`
Default Java	AdoptOpenJDK-11.0.10+9
unit	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/5/artifact/yetus-jdk11-hadoop3-check/output/patch-unit-root.txt
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/5/testReport/
Max. process+thread count	3139 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/5/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-01T05:12:33Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 36s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 42s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 27s	master passed
+1 💚	compile	2m 56s	master passed
+1 💚	shadedjars	9m 4s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 55s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 19s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 21s	the patch passed
+1 💚	compile	2m 53s	the patch passed
+1 💚	javac	2m 53s	the patch passed
+1 💚	shadedjars	8m 58s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 38s	the patch passed
		_ Other Tests _
+1 💚	unit	447m 13s	root in the patch passed.
		490m 35s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/5/artifact/yetus-jdk8-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 1e3bed649845 4.15.0-136-generic #140-Ubuntu SMP Thu Jan 28 05:20:47 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `5d42f58`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/5/testReport/
Max. process+thread count	3915 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/5/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

apurtell · 2021-05-02T20:46:24Z

This issue was only recently found because you have to get ~1000 files into the set before it manifests:

Error: java.lang.IllegalArgumentException: Row length 73872 is > 32767
  at org.apache.hadoop.hbase.client.Mutation.checkRow(Mutation.java:762)
  at org.apache.hadoop.hbase.client.Put.<init>(Put.java:105)
  at org.apache.hadoop.hbase.client.Put.<init>(Put.java:63)
  at org.apache.hadoop.hbase.client.Put.<init>(Put.java:53)
  at org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl$Loader$LoaderMapper.map(IntegrationTestLoadCommonCrawl.java:607)

We are accepting very large row keys from the crawl data during the load phase, need to configure the job not to choke on them.

Apache-HBase · 2021-05-02T22:34:41Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 28s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 40s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 42s	master passed
+1 💚	compile	8m 14s	master passed
+1 💚	checkstyle	1m 53s	master passed
+1 💚	spotbugs	9m 43s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 19s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 35s	the patch passed
+1 💚	compile	8m 14s	the patch passed
+1 💚	javac	8m 14s	the patch passed
+1 💚	checkstyle	1m 57s	the patch passed
+1 💚	whitespace	0m 0s	The patch has no whitespace issues.
+1 💚	xml	0m 2s	The patch has no ill-formed XML file.
+1 💚	hadoopcheck	17m 48s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	spotbugs	10m 17s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 28s	The patch does not generate ASF License warnings.
		75m 27s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/6/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile xml
uname	Linux ca3b66f37b8a 4.15.0-112-generic #113-Ubuntu SMP Thu Jul 9 23:41:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `00fec24`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Max. process+thread count	142 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/6/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-03T02:57:20Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 35s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 30s	Maven dependency ordering for branch
+1 💚	mvninstall	5m 4s	master passed
+1 💚	compile	3m 27s	master passed
+1 💚	shadedjars	9m 11s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	4m 3s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 19s	Maven dependency ordering for patch
+1 💚	mvninstall	5m 2s	the patch passed
+1 💚	compile	3m 24s	the patch passed
+1 💚	javac	3m 24s	the patch passed
+1 💚	shadedjars	9m 39s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	4m 27s	the patch passed
		_ Other Tests _
+1 💚	unit	288m 53s	root in the patch passed.
		338m 3s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/6/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux f3b41e5ca634 4.15.0-136-generic #140-Ubuntu SMP Thu Jan 28 05:20:47 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `00fec24`
Default Java	AdoptOpenJDK-11.0.10+9
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/6/testReport/
Max. process+thread count	4158 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/6/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-03T05:42:20Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 35s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 31s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 12s	master passed
+1 💚	compile	2m 56s	master passed
+1 💚	shadedjars	8m 59s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 48s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 22s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 32s	the patch passed
+1 💚	compile	2m 56s	the patch passed
+1 💚	javac	2m 56s	the patch passed
+1 💚	shadedjars	9m 0s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 39s	the patch passed
		_ Other Tests _
+1 💚	unit	460m 4s	root in the patch passed.
		503m 6s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/6/artifact/yetus-jdk8-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 55baa2b561b0 4.15.0-136-generic #140-Ubuntu SMP Thu Jan 28 05:20:47 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `00fec24`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/6/testReport/
Max. process+thread count	4093 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/6/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-03T07:05:36Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 29s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 33s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 48s	master passed
+1 💚	compile	9m 2s	master passed
+1 💚	checkstyle	2m 4s	master passed
+1 💚	spotbugs	10m 45s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 21s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 47s	the patch passed
+1 💚	compile	8m 39s	the patch passed
+1 💚	javac	8m 39s	the patch passed
+1 💚	checkstyle	1m 50s	the patch passed
+1 💚	whitespace	0m 0s	The patch has no whitespace issues.
+1 💚	xml	0m 1s	The patch has no ill-formed XML file.
+1 💚	hadoopcheck	18m 53s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	spotbugs	11m 2s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 28s	The patch does not generate ASF License warnings.
		80m 26s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/7/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile xml
uname	Linux 92190cbb9b23 4.15.0-112-generic #113-Ubuntu SMP Thu Jul 9 23:41:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `00fec24`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Max. process+thread count	141 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/7/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-03T11:25:28Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 33s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 31s	Maven dependency ordering for branch
+1 💚	mvninstall	5m 14s	master passed
+1 💚	compile	3m 27s	master passed
+1 💚	shadedjars	9m 16s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	4m 5s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 18s	Maven dependency ordering for patch
+1 💚	mvninstall	5m 1s	the patch passed
+1 💚	compile	3m 28s	the patch passed
+1 💚	javac	3m 28s	the patch passed
+1 💚	shadedjars	9m 33s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	4m 27s	the patch passed
		_ Other Tests _
+1 💚	unit	291m 10s	root in the patch passed.
		340m 17s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/7/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux e7861c18b19b 4.15.0-136-generic #140-Ubuntu SMP Thu Jan 28 05:20:47 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `00fec24`
Default Java	AdoptOpenJDK-11.0.10+9
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/7/testReport/
Max. process+thread count	4399 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/7/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-03T12:58:26Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 35s	Docker mode activated.
-0 ⚠️	yetus	0m 3s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 32s	Maven dependency ordering for branch
+1 💚	mvninstall	4m 24s	master passed
+1 💚	compile	2m 56s	master passed
+1 💚	shadedjars	8m 59s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 55s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 20s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 22s	the patch passed
+1 💚	compile	2m 56s	the patch passed
+1 💚	javac	2m 56s	the patch passed
+1 💚	shadedjars	9m 3s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	2m 40s	the patch passed
		_ Other Tests _
-1 ❌	unit	390m 16s	root in the patch failed.
		433m 15s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/7/artifact/yetus-jdk8-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux e8577e59663d 4.15.0-136-generic #140-Ubuntu SMP Thu Jan 28 05:20:47 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `00fec24`
Default Java	AdoptOpenJDK-1.8.0_282-b08
unit	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/7/artifact/yetus-jdk8-hadoop3-check/output/patch-unit-root.txt
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/7/testReport/
Max. process+thread count	2272 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/7/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

This integration test loads successful resource retrieval records from the Common Crawl (https://commoncrawl.org/) public dataset into an HBase table and writes records that can be used to later verify the presence and integrity of those records. Run like: ./bin/hbase org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl \ -Dfs.s3n.awsAccessKeyId=<AWS access key> \ -Dfs.s3n.awsSecretAccessKey=<AWS secret key> \ /path/to/test-CC-MAIN-2021-10-warc.paths.gz \ /path/to/tmp/warc-loader-output Access to the Common Crawl dataset in S3 is made available to anyone by Amazon AWS, but Hadoop's S3N filesystem still requires valid access credentials to initialize. The input path can either specify a directory or a file. The file may optionally be compressed with gzip. If a directory, the loader expects the directory to contain one or more WARC files from the Common Crawl dataset. If a file, the loader expects a list of Hadoop S3N URIs which point to S3 locations for one or more WARC files from the Common Crawl dataset, one URI per line. Lines should be terminated with the UNIX line terminator. Included in hbase-it/src/test/resources/CC-MAIN-2021-10-warc.paths.gz is a list of all WARC files comprising the Q1 2021 crawl archive. There are 64,000 WARC files in this data set, each containing ~1GB of gzipped data. The WARC files contain several record types, such as metadata, request, and response, but we only load the response record types. If the HBase table schema does not specify compression (by default) there is roughly a 10x expansion. Loading the full crawl archive results in a table approximately 640 TB in size. The hadoop-aws jar will be needed at runtime to instantiate the S3N filesystem. Use the -files ToolRunner argument to add it. You can also split the Loader and Verify stages: Load with: ./bin/hbase 'org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl$Loader' \ -files /path/to/hadoop-aws.jar \ -Dfs.s3n.awsAccessKeyId=<AWS access key> \ -Dfs.s3n.awsSecretAccessKey=<AWS secret key> \ /path/to/test-CC-MAIN-2021-10-warc.paths.gz \ /path/to/tmp/warc-loader-output Verify with: ./bin/hbase 'org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl$Verify' \ /path/to/tmp/warc-loader-output

@OverRide

- Javadoc warnings - Missing @OverRide - Checkstyle nits - Whitespace - Remove line length limitation in WARCRecord#readLine. It is rare but the CC data includes lines that are longer. If there is a real format error like a corrupted file let an EOFException or OOME signal the problem.

Apache-HBase · 2021-05-03T18:03:40Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	0m 30s	Docker mode activated.
		_ Prechecks _
+1 💚	dupname	0m 0s	No case conflicting files found.
+1 💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
+1 💚	@author	0m 0s	The patch does not contain any @author tags.
		_ master Compile Tests _
+0 🆗	mvndep	0m 20s	Maven dependency ordering for branch
+1 💚	mvninstall	3m 49s	master passed
+1 💚	compile	8m 58s	master passed
+1 💚	checkstyle	2m 6s	master passed
+1 💚	spotbugs	10m 29s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 20s	Maven dependency ordering for patch
+1 💚	mvninstall	3m 40s	the patch passed
+1 💚	compile	8m 31s	the patch passed
+1 💚	javac	8m 31s	the patch passed
+1 💚	checkstyle	1m 54s	the patch passed
+1 💚	whitespace	0m 0s	The patch has no whitespace issues.
+1 💚	xml	0m 1s	The patch has no ill-formed XML file.
+1 💚	hadoopcheck	18m 25s	Patch does not cause any errors with Hadoop 3.1.2 3.2.1 3.3.0.
+1 💚	spotbugs	10m 58s	the patch passed
		_ Other Tests _
+1 💚	asflicense	0m 26s	The patch does not generate ASF License warnings.
		78m 44s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/8/artifact/yetus-general-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	dupname asflicense javac spotbugs hadoopcheck hbaseanti checkstyle compile xml
uname	Linux b7aa9c653277 4.15.0-112-generic #113-Ubuntu SMP Thu Jul 9 23:41:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `7640134`
Default Java	AdoptOpenJDK-1.8.0_282-b08
Max. process+thread count	141 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/8/console
versions	git=2.17.1 maven=3.6.3 spotbugs=4.2.2
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-03T22:03:16Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 39s	Docker mode activated.
-0 ⚠️	yetus	0m 2s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 35s	Maven dependency ordering for branch
+1 💚	mvninstall	5m 35s	master passed
+1 💚	compile	3m 53s	master passed
+1 💚	shadedjars	10m 5s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	4m 16s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 22s	Maven dependency ordering for patch
+1 💚	mvninstall	5m 46s	the patch passed
+1 💚	compile	3m 55s	the patch passed
+1 💚	javac	3m 55s	the patch passed
+1 💚	shadedjars	10m 51s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	4m 19s	the patch passed
		_ Other Tests _
+1 💚	unit	264m 36s	root in the patch passed.
		318m 19s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/8/artifact/yetus-jdk11-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 051d6a2fa71d 4.15.0-136-generic #140-Ubuntu SMP Thu Jan 28 05:20:47 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `7640134`
Default Java	AdoptOpenJDK-11.0.10+9
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/8/testReport/
Max. process+thread count	4370 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/8/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2021-05-03T23:36:12Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
+0 🆗	reexec	1m 38s	Docker mode activated.
-0 ⚠️	yetus	0m 4s	Unprocessed flag(s): --brief-report-file --spotbugs-strict-precheck --whitespace-eol-ignore-list --whitespace-tabs-ignore-list --quick-hadoopcheck
		_ Prechecks _
		_ master Compile Tests _
+0 🆗	mvndep	0m 34s	Maven dependency ordering for branch
+1 💚	mvninstall	5m 11s	master passed
+1 💚	compile	3m 45s	master passed
+1 💚	shadedjars	10m 57s	branch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	3m 6s	master passed
		_ Patch Compile Tests _
+0 🆗	mvndep	0m 21s	Maven dependency ordering for patch
+1 💚	mvninstall	4m 57s	the patch passed
+1 💚	compile	3m 34s	the patch passed
+1 💚	javac	3m 34s	the patch passed
+1 💚	shadedjars	10m 56s	patch has no errors when building our shaded downstream artifacts.
+1 💚	javadoc	3m 0s	the patch passed
		_ Other Tests _
-1 ❌	unit	360m 11s	root in the patch failed.
		411m 12s

Subsystem	Report/Notes
Docker	ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/8/artifact/yetus-jdk8-hadoop3-check/output/Dockerfile
GITHUB PR	#3208
Optional Tests	javac javadoc unit shadedjars compile
uname	Linux 6c53dc6f95bf 4.15.0-65-generic #74-Ubuntu SMP Tue Sep 17 17:06:04 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	dev-support/hbase-personality.sh
git revision	master / `7640134`
Default Java	AdoptOpenJDK-1.8.0_282-b08
unit	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/8/artifact/yetus-jdk8-hadoop3-check/output/patch-unit-root.txt
Test Results	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/8/testReport/
Max. process+thread count	5598 (vs. ulimit of 30000)
modules	C: hbase-it . U: .
Console output	https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/job/PR-3208/8/console
versions	git=2.17.1 maven=3.6.3
Powered by	Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

This integration test loads successful resource retrieval records from the Common Crawl (https://commoncrawl.org/) public dataset into an HBase table and writes records that can be used to later verify the presence and integrity of those records. Run like: ./bin/hbase org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl \ -Dfs.s3n.awsAccessKeyId=<AWS access key> \ -Dfs.s3n.awsSecretAccessKey=<AWS secret key> \ /path/to/test-CC-MAIN-2021-10-warc.paths.gz \ /path/to/tmp/warc-loader-output Access to the Common Crawl dataset in S3 is made available to anyone by Amazon AWS, but Hadoop's S3N filesystem still requires valid access credentials to initialize. The input path can either specify a directory or a file. The file may optionally be compressed with gzip. If a directory, the loader expects the directory to contain one or more WARC files from the Common Crawl dataset. If a file, the loader expects a list of Hadoop S3N URIs which point to S3 locations for one or more WARC files from the Common Crawl dataset, one URI per line. Lines should be terminated with the UNIX line terminator. Included in hbase-it/src/test/resources/CC-MAIN-2021-10-warc.paths.gz is a list of all WARC files comprising the Q1 2021 crawl archive. There are 64,000 WARC files in this data set, each containing ~1GB of gzipped data. The WARC files contain several record types, such as metadata, request, and response, but we only load the response record types. If the HBase table schema does not specify compression (by default) there is roughly a 10x expansion. Loading the full crawl archive results in a table approximately 640 TB in size. The hadoop-aws jar will be needed at runtime to instantiate the S3N filesystem. Use the -files ToolRunner argument to add it. You can also split the Loader and Verify stages: Load with: ./bin/hbase 'org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl$Loader' \ -files /path/to/hadoop-aws.jar \ -Dfs.s3n.awsAccessKeyId=<AWS access key> \ -Dfs.s3n.awsSecretAccessKey=<AWS secret key> \ /path/to/test-CC-MAIN-2021-10-warc.paths.gz \ /path/to/tmp/warc-loader-output Verify with: ./bin/hbase 'org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl$Verify' \ /path/to/tmp/warc-loader-output Signed-off-by: Michael Stack <stack@apache.org> Conflicts: pom.xml

apurtell requested a review from saintstack April 28, 2021 23:32

apurtell force-pushed the HBASE-25824 branch 2 times, most recently from ccfd9f6 to b08d1ce Compare April 29, 2021 00:33

apurtell force-pushed the HBASE-25824 branch from 36eed95 to 0003ad2 Compare April 29, 2021 01:53

saintstack approved these changes Apr 29, 2021

View reviewed changes

apurtell force-pushed the HBASE-25824 branch from 8ae834c to eeffe59 Compare May 2, 2021 21:18

This was referenced May 2, 2021

HBASE-25836 RegionStates#getAssignmentsForBalancer should only care about OPEN or OPENING regions #3219

Merged

HBASE-25835 Ignore duplicate split requests from regionserver reports #3218

Merged

apurtell force-pushed the HBASE-25824 branch from eeffe59 to e27f124 Compare May 2, 2021 22:41

apurtell and others added 8 commits May 3, 2021 09:42

Cleanups and make better row keys with new rowKeyFromTargetURI()

1fdcb6a

Protect against NPE if URI does not parse with a host component

d7f719d

Miscellaenous improvements

246d97f

Checkstyle fixes

25a0141

Prevent over large row keys

a53cca6

Advise job submission time is proportional to input file set

6e62208

apurtell force-pushed the HBASE-25824 branch from e27f124 to 6e62208 Compare May 3, 2021 16:44

apurtell merged commit 6ad5b9e into apache:master May 4, 2021

apurtell deleted the HBASE-25824 branch May 4, 2021 00:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HBASE-25824 IntegrationTestLoadCommonCrawl #3208

HBASE-25824 IntegrationTestLoadCommonCrawl #3208

apurtell commented Apr 28, 2021

Apache-HBase commented Apr 29, 2021

Apache-HBase commented Apr 29, 2021

Apache-HBase commented Apr 29, 2021

Apache-HBase commented Apr 29, 2021

Apache-HBase commented Apr 29, 2021

Apache-HBase commented Apr 29, 2021

saintstack left a comment

Apache-HBase commented Apr 29, 2021

Apache-HBase commented Apr 29, 2021

apurtell commented Apr 30, 2021 •

edited

Loading

Apache-HBase commented Apr 30, 2021

Apache-HBase commented Apr 30, 2021

Apache-HBase commented Apr 30, 2021

Apache-HBase commented Apr 30, 2021

Apache-HBase commented Apr 30, 2021

Apache-HBase commented May 1, 2021

Apache-HBase commented May 1, 2021

apurtell commented May 2, 2021 •

edited

Loading

Apache-HBase commented May 2, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

HBASE-25824 IntegrationTestLoadCommonCrawl #3208

HBASE-25824 IntegrationTestLoadCommonCrawl #3208

Conversation

apurtell commented Apr 28, 2021

Apache-HBase commented Apr 29, 2021

Apache-HBase commented Apr 29, 2021

Apache-HBase commented Apr 29, 2021

Apache-HBase commented Apr 29, 2021

Apache-HBase commented Apr 29, 2021

Apache-HBase commented Apr 29, 2021

saintstack left a comment

Choose a reason for hiding this comment

Apache-HBase commented Apr 29, 2021

Apache-HBase commented Apr 29, 2021

apurtell commented Apr 30, 2021 • edited Loading

Apache-HBase commented Apr 30, 2021

Apache-HBase commented Apr 30, 2021

Apache-HBase commented Apr 30, 2021

Apache-HBase commented Apr 30, 2021

Apache-HBase commented Apr 30, 2021

Apache-HBase commented May 1, 2021

Apache-HBase commented May 1, 2021

apurtell commented May 2, 2021 • edited Loading

Apache-HBase commented May 2, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

Apache-HBase commented May 3, 2021

apurtell commented Apr 30, 2021 •

edited

Loading

apurtell commented May 2, 2021 •

edited

Loading