Skip to content

Commit

Permalink
Added Hive support for Sparklyr
Browse files Browse the repository at this point in the history
  • Loading branch information
Brian Panneton authored and chu11 committed Apr 25, 2019
1 parent 2332a8f commit f0f0c67
Show file tree
Hide file tree
Showing 2 changed files with 10 additions and 0 deletions.
4 changes: 4 additions & 0 deletions conf/spark/spark-defaults.conf
Original file line number Diff line number Diff line change
Expand Up @@ -66,4 +66,8 @@ spark.shuffle.sasl.timeout SPARKNETWORKTIMEOUTSECSs
spark.rdd.compress SPARKRDDCOMPRESS
spark.io.compression.codec SPARKIOCOMPRESSIONCODEC

# Sparklyr needs to know the catalogImplementation in order to read
# the HIVE tables
# spark.sql.catalogImplementation hive

@YARN@
6 changes: 6 additions & 0 deletions magpie/setup/magpie-setup-project-spark
Original file line number Diff line number Diff line change
Expand Up @@ -625,6 +625,12 @@ then
sed -i -e "/@YARN@/,+1d" ${post_sparkdefaultsconf}
fi

# Sparklyr needs to know the catalogImplementation
if [ "${HIVE_SETUP}" == "yes" ]
then
sed -i -e "s/# spark.sql.catalogImplementation hive/spark.sql.catalogImplementation hive/g" ${post_sparkdefaultsconf}
fi

# This is not for legitimate security as it can easily be
# calculated. It is for sanity "just in case we screwed up a
# config" security.
Expand Down

0 comments on commit f0f0c67

Please sign in to comment.