[WIP] Loading spark-defaults.conf when creating SparkConf instances #1233

witgo · 2014-06-26T15:52:58Z

No description provided.

AmplabJenkins · 2014-06-26T15:55:23Z

Merged build triggered.

AmplabJenkins · 2014-06-26T15:55:31Z

Merged build started.

AmplabJenkins · 2014-06-26T15:59:03Z

Merged build finished.

AmplabJenkins · 2014-06-26T15:59:03Z

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16159/

AmplabJenkins · 2014-06-26T16:05:24Z

Merged build triggered.

AmplabJenkins · 2014-06-26T16:05:31Z

Merged build started.

AmplabJenkins · 2014-06-26T16:48:51Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-06-26T16:48:51Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16160/

vanzin · 2014-06-26T17:03:07Z

core/src/main/scala/org/apache/spark/SparkConf.scala

I think this will end up causing disagreement with spark-submit options.

spark-submit will only load spark-defaults.conf if --properties-file is not defined in the command line. It seems like this code will always load spark-defaults.conf if one exists.

vanzin · 2014-06-26T17:05:52Z

Hi @witgo, can you elaborate, in the change summary, why this is needed? Maybe file a bug?

I think you guys don't use spark-submit, which is why you might need this; but spark-submit translates the config file into system properties, so the config is picked up by SparkConf. And it seems like this change is breaking the semantics of how spark-submit works.

(I think it would be nice to stop using system properties like that - it gets really confusing when parts of the code use system properties and others use SparkConf - but that's a separate discussion.)

witgo · 2014-06-27T16:45:32Z

@vanzin The situation is sbin/start-*.sh are not support spark-defaults.conf.

eg: sbin/start-history-server.sh cannot load thespark.history.fs.logDirectory configuration from spark-defaults.conf.

vanzin · 2014-06-27T16:49:37Z

Ah, so it's SPARK-2098.

I think it's a nice feature to have (I filed the bug after all), but we can't break the existing semantics. For daemons, the command line parsers could do that (by having a "--properties-file" argument similar to spark-submit).

But if you want to support arbitrary SparkConf instances to read these conf files, it will become trickier, since now you need to propagate that command line information somehow.

witgo · 2014-06-27T16:53:46Z

You're right, the corresponding code should be submitted at the weekend.

witgo · 2014-06-28T14:19:12Z

@vanzin I submitted a new PR #1256 . I close this.

### What changes were proposed in this pull request? This PR cherry-picks the OPTIMIZE command to Spark 3.2. ### Why are the changes needed? These changes are needed to support data compaction via SQL for Iceberg. ### Does this PR introduce _any_ user-facing change? Yes but the changes are isolated and will be supported only by Iceberg. ### How was this patch tested? This PR comes with tests. More tests are in Iceberg.

…1233) * Push partial aggregate through range join condition * Add lower cost expression threshold * Fix data issue: ```sql SELECT c.session_start_dt, COUNT(*) AS clav_session_cats_cnt FROM p_soj_cl_v.clav_session_cats c full OUTER JOIN p_soj_cl_v.clav_session_ext s ON c.guid = s.guid AND c.session_skey = s.session_skey AND c.site_id = s.site_id AND c.session_start_dt = s.session_start_dt AND c.cobrand = s.cobrand WHERE c.session_start_dt = '2021-07-14' GROUP BY 1 ORDER by 1, 2 ``` * Fix NPE: ```sql SELECT if(CSS.SLR_ID > 10, 'B2C', 'C2C') as key ,count(*) FROM P_ATEE_T.DW_ACCOUNTS_ALL AS REV LEFT JOIN PRS_RESTRICTED_V.DNA_CUST_SELLER_SGMNTN_HIST AS CSS ON REV.USER_ID = CSS.SLR_ID AND REV.ACCT_TRANS_DT = CSS.CUST_SLR_SGMNTN_BEG_DT WHERE REV.ACCT_TRANS_DT = DATE '2019-12-01' AND REV.AUCT_TYPE_CODE NOT IN (12,15) GROUP BY 1 ``` * fix test

Loading spark-defaults.conf when creating SparkConf instances

135343c

review commit

3ec5479

vanzin reviewed Jun 26, 2014
View reviewed changes

witgo closed this Jun 28, 2014

witgo deleted the defaults-conf branch March 13, 2015 08:58

[WIP] Loading spark-defaults.conf when creating SparkConf instances #1233

[WIP] Loading spark-defaults.conf when creating SparkConf instances #1233

Uh oh!

Conversation

witgo commented Jun 26, 2014

Uh oh!

AmplabJenkins commented Jun 26, 2014

Uh oh!

AmplabJenkins commented Jun 26, 2014

Uh oh!

AmplabJenkins commented Jun 26, 2014

Uh oh!

AmplabJenkins commented Jun 26, 2014

Uh oh!

AmplabJenkins commented Jun 26, 2014

Uh oh!

AmplabJenkins commented Jun 26, 2014

Uh oh!

AmplabJenkins commented Jun 26, 2014

Uh oh!

AmplabJenkins commented Jun 26, 2014

Uh oh!

vanzin Jun 26, 2014

Choose a reason for hiding this comment

Uh oh!

vanzin commented Jun 26, 2014

Uh oh!

witgo commented Jun 27, 2014

Uh oh!

vanzin commented Jun 27, 2014

Uh oh!

witgo commented Jun 27, 2014

Uh oh!

witgo commented Jun 28, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants