-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-6207] [YARN] [SQL] Adds delegation tokens for metastore to conf. #5031
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Adds hive2-metastore delagations token to conf when running in securemode. Without this change, runing on YARN in cluster mode fails with a GSS exception. This contribution is my original work and that I licenses the work to the Apache Spark project under the project's open source licenses. Author: Doug Balog <doug.balog@target.com>
|
jenkins test this please |
|
thanks for providing the patch. Which versions of Hive have you tested this with? I'm trying to build with a hive 13 version (not the official 0.13.1 though) and see a compile error: We will have to find a way to conditionally do the Hive stuff as its optional to compile it in. I'll have to look at how the hive stuff is done now a bit more to figure out what makes sense. adding @marmbrus to see if he has ideas as I think his familiar with how the hive stuff is done now. |
|
Thanks for looking at my patch. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So, I think the /* And Hive is enabled */ part is kinda important, and not talking just about Hive being compiled in. What would happen if a user doesn't have any Hive services running?
I'm also a little worried about precedent here... couldn't we make the same argument for acquiring HBase tokens? Solr? Somthing else? Unfortunately, I can't think of any alternative off the bat, given the way cluster mode works... :-/
|
I have tested on secure hbase, and it didn't work. On executor process we got the error:
|
|
@doubg I'm just curious, are you still working on this? I think a version of this could go in to atleast allow it to work for the 7 days then Hari's change could improve. |
|
@tgravescs yes, Sorry, I'll update the pr today. I have a new version using reflection that works in my environment with Hive 0.14. |
|
@XuTingjun Sorry, this patch is for Hive, not hbase. I'm sure something similar could be created for hbase. |
- Use reflection instead of adding dependency on hive. - Tested on Hive 0.13 and Hadoop 2.4.1
|
ok to test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add space between if and (
|
A few nits but other then that it looks fine. I"m going to try it out. Did you test this with hive compiled in and then without it also? |
|
Updated code per @tgravescs comments. |
|
Jenkins, this is ok to test. |
|
Test build #30100 has finished for PR 5031 at commit
|
|
The Scalastyle checks that failed had nothing to do with this pr. ========================================================================= Running Scala style checks ========================================================================= Scalastyle checks failed at following occurrences: [error] /home/jenkins/workspace/SparkPullRequestBuilder/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala:172:8: Public method must have explicit type [error] /home/jenkins/workspace/SparkPullRequestBuilder/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/basicOperators.scala:159:15: Public method must have explicit type [error] (catalyst/compile:scalastyle) errors exist [error] Total time: 7 s, completed Apr 11, 2015 7:34:10 PM [error] /home/jenkins/workspace/SparkPullRequestBuilder/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala:172:8: Public method must have explicit type [error] /home/jenkins/workspace/SparkPullRequestBuilder/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/basicOperators.scala:159:15: Public method must have explicit type [error] (catalyst/compile:scalastyle) errors exist [error] Total time: 6 s, completed Apr 11, 2015 7:34:27 PM [error] Got a return code of 1 on line 125 of the run-tests script. Archiving unit tests logs... |
|
I've pushed a hotfix to fix those style errors, so this should hopefully be able to test now. Let's try again... |
|
Jenkins, retest this please. |
|
Test build #30101 has finished for PR 5031 at commit
|
|
lgtm. |
Adds hive2-metastore delegation token to conf when running in secure mode. Without this change, running on YARN in cluster mode fails with a GSS exception. This is a rough patch that adds a dependency to spark/yarn on hive-exec. I'm looking for suggestions on how to make this patch better. This contribution is my original work and that I licenses the work to the Apache Spark project under the project's open source licenses. Author: Doug Balog <doug.balogtarget.com> Author: Doug Balog <doug.balog@target.com> Closes apache#5031 from dougb/SPARK-6207 and squashes the following commits: 3e9ac16 [Doug Balog] [SPARK-6207] Fixes minor code spacing issues. e260765 [Doug Balog] [SPARK-6207] Second pass at adding Hive delegation token to conf. - Use reflection instead of adding dependency on hive. - Tested on Hive 0.13 and Hadoop 2.4.1 1ab1729 [Doug Balog] Merge branch 'master' of git://github.com/apache/spark into SPARK-6207 bf356d2 [Doug Balog] [SPARK-6207] [YARN] [SQL] Adds delegation tokens for metastore to conf. Adds hive2-metastore delagations token to conf when running in securemode. Without this change, runing on YARN in cluster mode fails with a GSS exception.
Adds hive2-metastore delegation token to conf when running in secure mode. Without this change, running on YARN in cluster mode fails with a GSS exception. This is a rough patch that adds a dependency to spark/yarn on hive-exec. I'm looking for suggestions on how to make this patch better. This contribution is my original work and that I licenses the work to the Apache Spark project under the project's open source licenses. Author: Doug Balog <doug.balogtarget.com> Author: Doug Balog <doug.balog@target.com> Closes apache#5031 from dougb/SPARK-6207 and squashes the following commits: 3e9ac16 [Doug Balog] [SPARK-6207] Fixes minor code spacing issues. e260765 [Doug Balog] [SPARK-6207] Second pass at adding Hive delegation token to conf. - Use reflection instead of adding dependency on hive. - Tested on Hive 0.13 and Hadoop 2.4.1 1ab1729 [Doug Balog] Merge branch 'master' of git://github.com/apache/spark into SPARK-6207 bf356d2 [Doug Balog] [SPARK-6207] [YARN] [SQL] Adds delegation tokens for metastore to conf. Adds hive2-metastore delagations token to conf when running in securemode. Without this change, runing on YARN in cluster mode fails with a GSS exception.
Adds hive2-metastore delegation token to conf when running in secure mode.
Without this change, running on YARN in cluster mode fails with a
GSS exception.
This is a rough patch that adds a dependency to spark/yarn on hive-exec.
I'm looking for suggestions on how to make this patch better.
This contribution is my original work and that I licenses the work to the
Apache Spark project under the project's open source licenses.
Author: Doug Balog doug.balog@target.com