Modify defaults for repartitioning #1138

RobinL · 2023-03-22T16:45:14Z

Once merged this should close #1136 - see the issue for comments about why this is required

github-actions · 2023-03-22T16:46:41Z

Test: test_2_rounds_1k_duckdb

Percentage change: 3.0%

	date	time	stats_mean	stats_min	commit_info_branch	commit_info_id	machine_info_cpu_brand_raw	machine_info_cpu_hz_actual_friendly	commit_hash
849	2022-07-12	18:40:05	1.89098	1.87463	splink3	`c334bb9`	Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz	2.7934 GHz	`c334bb9`
1494	2023-03-22	16:59:10	1.95273	1.93026	(detached head)	`5a590cf`	Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz	2.2947 GHz	`5a590cf`

Test: test_2_rounds_1k_sqlite

Percentage change: 1.0%

	date	time	stats_mean	stats_min	commit_info_branch	commit_info_id	machine_info_cpu_brand_raw	machine_info_cpu_hz_actual_friendly	commit_hash
851	2022-07-12	18:40:05	4.32179	4.25898	splink3	`c334bb9`	Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz	2.7934 GHz	`c334bb9`
1496	2023-03-22	16:59:10	4.31679	4.30103	(detached head)	`5a590cf`	Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz	2.2947 GHz	`5a590cf`

Click here for vega lite time series charts

RobinL · 2023-03-22T16:50:27Z

splink/spark/spark_linker.py

+ try:
+ parallelism_value = self.spark.conf.get("spark.default.parallelism")
+ parallelism_value = int(parallelism_value)
+ except Exception:


I didn't want to catch py4j.protocol.Py4JJavaError explicitly just in case the lib changes between version of PySpark.

Ultimately if this fails for whatever reason we can just fall back on the default

RobinL added 2 commits March 22, 2023 16:44

modify defaults for repartitioning

b8ee657

bump version

f1e8329

RobinL commented Mar 22, 2023

View reviewed changes

fix typo

cb77088

RobinL merged commit cb9c0e7 into master Mar 22, 2023

RobinL deleted the issue_1136_repartitioning_fix branch August 12, 2024 10:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modify defaults for repartitioning #1138

Modify defaults for repartitioning #1138

RobinL commented Mar 22, 2023 •

edited

Loading

github-actions bot commented Mar 22, 2023 •

edited

Loading

RobinL Mar 22, 2023 •

edited

Loading

Modify defaults for repartitioning #1138

Modify defaults for repartitioning #1138

Conversation

RobinL commented Mar 22, 2023 • edited Loading

github-actions bot commented Mar 22, 2023 • edited Loading

Test: test_2_rounds_1k_duckdb

Test: test_2_rounds_1k_sqlite

RobinL Mar 22, 2023 • edited Loading

Choose a reason for hiding this comment

RobinL commented Mar 22, 2023 •

edited

Loading

github-actions bot commented Mar 22, 2023 •

edited

Loading

RobinL Mar 22, 2023 •

edited

Loading