[ADAM-752] Build for many combos of Spark/Hadoop versions. #765

fnothaft · 2015-08-10T23:32:49Z

Resolves build comments on #752. I have set Jenkins up as a 3D!!!!!! matrix:

fnothaft · 2015-08-10T23:45:49Z

OK, so a few interesting things.

We can't build for Spark 1.1.1, which I had forgotten (repartitionAndSortWithinPartitions doesn't exist)
It seems like the Spark-based unit tests will only run in parallel on Spark 1.2.0. Not sure what is broken in Spark 1.3+? I've told Jenkins to just run builds sequentially, for now.

massie · 2015-08-10T23:47:51Z

scripts/jenkins-test

@@ -26,16 +25,24 @@ export SPARK_DRIVER_MEMORY=8g

 pushd "$ADAM_TMP_DIR"

+
 if [[ $HADOOP_VERSION == "1.0.4" ]]; then


Maybe echo $HADOOP_VERSION | egrep '^1.0.' would be more general, in case, we use e.g. 1.0.4 as the hadoop test version?

or possibly [[ $HADOOP_VERSION =~ ^1\.0 ]], likewise below, if that seems easier

What @ryan-williams said. :) Much cleaner.

ryan-williams · 2015-08-10T23:47:53Z

That looks pretty neat. I'm guessing there's some config changes you are making in Jenkins that are not depicted here that e.g. populate $SPARK_VERSION?

massie · 2015-08-10T23:48:08Z

scripts/jenkins-test

- tar xzvf spark-1.1.0-bin-hadoop1.tgz
- export SPARK_HOME="${ADAM_TMP_DIR}/spark-1.1.0-bin-hadoop1"
+ HADOOP=hadoop1
+elif [[ $HADOOP_VERSION == "2.6.0" ]]; then


Maybe echo $HADOOP_VERSION | egrep '^2.6.' would be more general, in case, we use e.g. 2.6.3 as the hadoop test version?

massie · 2015-08-10T23:49:05Z

This is good stuff, Frank.

fnothaft · 2015-08-10T23:52:46Z

Jenkins, retest this please.

fnothaft · 2015-08-11T00:49:20Z

Fun times! Guess what doesn't work?

If your guess was that running:

mvn package -Dspark.version=1.4.1

Would lead to Destruction, Terror, and Mayhem, you win $5! At least, it does for me locally.

I will sort this out later, but if anyone has seen this before or has any clues, I would love your thoughts.

heuermh · 2015-08-11T03:16:25Z

Destruction, Terror, and Mayhem and a 45 minute wait to see if your pull request turns green. :)

Maybe a middle ground, or set up a quick build in Travis and leave Jenkins 3D Awesome for a nightly integration test?

Would a profile work for where the -Dspark-version is failing?

fnothaft · 2015-08-11T17:30:19Z

OK, so what seems to have been the problem is that we had a completely unused (???) dependency on com.amazonaws:aws-java-sdk, which was bringing in a version of com.fasterxml.jackson.core:jackson-core that was incompatible with Spark 1.3.0 and higher. As a result, this was giving a red herring test failure message that implied that the unit tests were crashing because we were running multiple SparkContexts in parallel. That unused dependency is gone now, so hopefully the build should pass now (and we don't need to run all the builds sequentially, which is good for obvious reasons).

fnothaft · 2015-08-11T17:35:59Z

And the Spark 1.4.1/Hadoop 2.6.0 touchstone builds have passed! Huzzah! Now, for the rest of the builds.

fnothaft · 2015-08-11T17:40:52Z

Rebased and added a commit to clean up log junk when running 1.4.1 unit tests.

fnothaft · 2015-08-11T17:53:11Z

Cleaned up RE: the comments above around version checking.

fnothaft · 2015-08-11T17:55:01Z

Jenkins, retest this please.

ryan-williams · 2015-08-11T18:24:07Z

pom.xml

@@ -345,12 +345,36 @@
 <type>test-jar</type>
 <scope>test</scope>
 </dependency>
+ <!--


any reason for this block?

I was testing something; this can be removed.

…dependency version mismatch when running unit tests with Spark versions > 1.3.

fnothaft · 2015-08-11T22:27:11Z

Jenkins, retest this please.

fnothaft · 2015-08-12T01:24:01Z

IT WORKS! IT WORKS! IT REALLY DOES!!!!!!!
;)

heuermh · 2015-08-12T02:17:58Z

Are all the red Failed - skipped in build 842 supposed to be Not run? Looks like they might be combinations that are not supported, such as Hadoop 1 with Spark 1.4.1. Or maybe I just need my 3D glasses?

fnothaft · 2015-08-12T04:23:04Z

@heuermh correct; Spark 1.4.1/Hadoop 1.x is skipped as it won't build with the current way that Spark is packaged as maven artifacts (there was a long discussion of this on a previous ADAM pr).

fnothaft · 2015-08-12T04:30:52Z

They show up as greyed out red because the last build of that combo failed.

heuermh · 2015-08-12T15:18:12Z

From what I can see it looks like a 15 minute build time up from 10 minutes, not too bad. +1

massie · 2015-08-12T20:12:54Z

Nice addition, Frank!

massie reviewed Aug 10, 2015
View reviewed changes

fnothaft force-pushed the update-jenkins-script branch from a685df1 to d687a4f Compare August 11, 2015 17:26

fnothaft force-pushed the update-jenkins-script branch from d687a4f to ec7ee73 Compare August 11, 2015 17:40

fnothaft force-pushed the update-jenkins-script branch 2 times, most recently from 23b9e41 to bbdc3ce Compare August 11, 2015 17:52

ryan-williams reviewed Aug 11, 2015
View reviewed changes

fnothaft added 3 commits August 11, 2015 14:59

Remove amazonaws dependency, which causes com.fasterxml.jackson.core …

3b42880

…dependency version mismatch when running unit tests with Spark versions > 1.3.

[ADAM-752] Build for many combos of Spark/Hadoop versions.

78ee65f

Modified unit test Spark config to eliminate log junk.

6cade81

fnothaft force-pushed the update-jenkins-script branch from bbdc3ce to 6cade81 Compare August 11, 2015 21:59

massie merged commit 6cade81 into bigdatagenomics:master Aug 12, 2015

heuermh mentioned this pull request Aug 19, 2015

Bump Spark version to 1.4 #752

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ADAM-752] Build for many combos of Spark/Hadoop versions. #765

[ADAM-752] Build for many combos of Spark/Hadoop versions. #765

fnothaft commented Aug 10, 2015

fnothaft commented Aug 10, 2015

massie Aug 10, 2015

ryan-williams Aug 10, 2015

massie Aug 10, 2015

ryan-williams commented Aug 10, 2015

massie Aug 10, 2015

massie commented Aug 10, 2015

fnothaft commented Aug 10, 2015

fnothaft commented Aug 11, 2015

heuermh commented Aug 11, 2015

fnothaft commented Aug 11, 2015

fnothaft commented Aug 11, 2015

fnothaft commented Aug 11, 2015

fnothaft commented Aug 11, 2015

fnothaft commented Aug 11, 2015

ryan-williams Aug 11, 2015

fnothaft Aug 11, 2015

fnothaft commented Aug 11, 2015

fnothaft commented Aug 12, 2015

heuermh commented Aug 12, 2015

fnothaft commented Aug 12, 2015

fnothaft commented Aug 12, 2015

heuermh commented Aug 12, 2015

massie commented Aug 12, 2015

		@@ -26,16 +25,24 @@ export SPARK_DRIVER_MEMORY=8g

		pushd "$ADAM_TMP_DIR"


		if [[ $HADOOP_VERSION == "1.0.4" ]]; then

[ADAM-752] Build for many combos of Spark/Hadoop versions. #765

[ADAM-752] Build for many combos of Spark/Hadoop versions. #765

Conversation

fnothaft commented Aug 10, 2015

fnothaft commented Aug 10, 2015

massie Aug 10, 2015

Choose a reason for hiding this comment

ryan-williams Aug 10, 2015

Choose a reason for hiding this comment

massie Aug 10, 2015

Choose a reason for hiding this comment

ryan-williams commented Aug 10, 2015

massie Aug 10, 2015

Choose a reason for hiding this comment

massie commented Aug 10, 2015

fnothaft commented Aug 10, 2015

fnothaft commented Aug 11, 2015

heuermh commented Aug 11, 2015

fnothaft commented Aug 11, 2015

fnothaft commented Aug 11, 2015

fnothaft commented Aug 11, 2015

fnothaft commented Aug 11, 2015

fnothaft commented Aug 11, 2015

ryan-williams Aug 11, 2015

Choose a reason for hiding this comment

fnothaft Aug 11, 2015

Choose a reason for hiding this comment

fnothaft commented Aug 11, 2015

fnothaft commented Aug 12, 2015

heuermh commented Aug 12, 2015

fnothaft commented Aug 12, 2015

fnothaft commented Aug 12, 2015

heuermh commented Aug 12, 2015

massie commented Aug 12, 2015