[SPARK-54615][PYTHON] Always pass runner_conf to python worker #53353

gaogaotiantian · 2025-12-05T22:40:05Z

What changes were proposed in this pull request?

Always pass runnerConf to python worker, even if it's not used.

Why are the changes needed?

This is part of the effort to consolidate our protocol from JVM to the worker. We have different ways to pass the runner conf now and sometimes we just don't pass it. It makes the worker side code a bit messy - we need to determine whether to read the conf based on eval type. However reading an empty conf is super cheap and we can just do it regardless.

With this infra, vanilla python udfs can also pass some runner conf in the future. We can do some refactoring on our JVM worker code in the future.

Does this PR introduce any user-facing change?

No

How was this patch tested?

pyspark-sql passed locally. Running the rest on CI.

Was this patch authored or co-authored using generative AI tooling?

No

HyukjinKwon · 2025-12-07T22:49:42Z

Merged to master.

### What changes were proposed in this pull request? Always pass runnerConf to python worker, even if it's not used. ### Why are the changes needed? This is part of the effort to consolidate our protocol from JVM to the worker. We have different ways to pass the runner conf now and sometimes we just don't pass it. It makes the worker side code a bit messy - we need to determine whether to read the conf based on eval type. However reading an empty conf is super cheap and we can just do it regardless. With this infra, vanilla python udfs can also pass some runner conf in the future. We can do some refactoring on our JVM worker code in the future. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? `pyspark-sql` passed locally. Running the rest on CI. ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#53353 from gaogaotiantian/always-pass-runnerconf. Authored-by: Tian Gao <gaogaotiantian@hotmail.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>

Always pass runner conf to python worker

17e9eb2

github-actions bot added SQL STRUCTURED STREAMING CORE PYTHON labels Dec 5, 2025

reformat

75df6e6

gaogaotiantian marked this pull request as ready for review December 6, 2025 04:00

HyukjinKwon approved these changes Dec 7, 2025

View reviewed changes

HyukjinKwon changed the title ~~[SPARK-54615] Always pass runner_conf to python worker~~ [SPARK-54615][PYTHON] Always pass runner_conf to python worker Dec 7, 2025

HyukjinKwon closed this in d4de913 Dec 7, 2025

gaogaotiantian deleted the always-pass-runnerconf branch December 19, 2025 00:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-54615][PYTHON] Always pass runner_conf to python worker #53353

[SPARK-54615][PYTHON] Always pass runner_conf to python worker #53353

Uh oh!

gaogaotiantian commented Dec 5, 2025

Uh oh!

HyukjinKwon commented Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[SPARK-54615][PYTHON] Always pass runner_conf to python worker #53353

[SPARK-54615][PYTHON] Always pass runner_conf to python worker #53353

Uh oh!

Conversation

gaogaotiantian commented Dec 5, 2025

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

HyukjinKwon commented Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants