-
Notifications
You must be signed in to change notification settings - Fork 64
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
spark-yarn package doesn't exist in CDH repo #18
Comments
Is there a separate 'spark-yarn'? I think the regular 'spark-core' package should work (did a quick glance and it seemed like it was build with yarn support) |
Sorry maybe I wasn't clear enough. I'm talking about the maven artifacts that are defined in the mrgeo pom file. https://repository.cloudera.com/artifactory/cloudera-repos/org/apache/spark/spark-yarn_2.10/ |
For some reason, Cloudera has left out the spark_yarn_2.10 jar from its CDH5.3.x repos. They do exist in previous and more recent versions. To get around the problem, I had to download the spark release from Cloudera (located at http://archive.cloudera.com/cdh5/cdh/5/, specifically http://archive.cloudera.com/cdh5/cdh/5/spark-1.2.0-cdh5.3.2.tar.gz for CDH5.3.2). Unpacking the tar contains all the spark jars. Then take the spark_yarn_2.10 and load it into your local maven repo using a typical mvn install:install-file command. Let me know if this works for you. |
I will try soon. In the meantime I spoke to Cloudera support and they recommended to use spark-network-yarn instead of spark-yarn. |
I can confirm that your trick worked. The workaround suggested by Cloudera doesn'work at all, since spark-network-yarn depends on spark-yarn. |
this is the definition for the cdh532 build:
<spark.version>1.2.0-cdh5.3.2</spark.version>
but inside the cloudera artifactory repository it doesn't exist.
The same applies for the cdh530 or cdh520 builds.
The text was updated successfully, but these errors were encountered: