[SPARK-54555][PYTHON][TESTS] Set `spark.sql.execution.pandas.structHandlingMode` in pyspark pandas doctest by asl3 · Pull Request #53301 · apache/spark

asl3 · 2025-12-03T04:26:34Z

What changes were proposed in this pull request?

After #53299, explicitly set conf spark.sql.execution.pandas.structHandlingMode to row. This is needed because when Arrow optimization was previously disabled, structHandlingMode converted to Row object by default, but when Arrow optimization is enabled, it converts to dict or raise an Exception if duplicated nested field names.

To match the docs behavior after enabling arrow by default, we explicitly set this conf to row.

Why are the changes needed?

Fix pyspark-pandas doctest and remove the skip of doctests

Does this PR introduce any user-facing change?

No

How was this patch tested?

CI running pyspark-pandas doctest

Was this patch authored or co-authored using generative AI tooling?

No

ueshin

LGTM, pending tests.

python/pyspark/pandas/base.py

zhengruifeng · 2025-12-04T01:51:34Z

merged to master

github-actions bot added PYTHON PANDAS API ON SPARK labels Dec 3, 2025

asl3 changed the title ~~[SPARK-54555][PYTHON][TESTS][FOLLOW-UP] Fix pyspark-pandas doctest~~ [SPARK-54555][PYTHON][TESTS] Set spark.sql.execution.pandas.structHandlingMode in pyspark pandas doctest Dec 3, 2025

ueshin approved these changes Dec 3, 2025

View reviewed changes

zhengruifeng reviewed Dec 3, 2025

View reviewed changes

python/pyspark/pandas/base.py Outdated Show resolved Hide resolved

asl3 added 2 commits December 3, 2025 07:16

structHandlingMode

09713d2

conf

b5b9675

asl3 force-pushed the pysparkpandasdoctest branch from c776daf to b5b9675 Compare December 3, 2025 15:17

asl3 requested a review from zhengruifeng December 3, 2025 15:35

allisonwang-db approved these changes Dec 3, 2025

View reviewed changes

fmt

67938d9

HyukjinKwon approved these changes Dec 4, 2025

View reviewed changes

zhengruifeng closed this in 74ff452 Dec 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[SPARK-54555][PYTHON][TESTS] Set `spark.sql.execution.pandas.structHandlingMode` in pyspark pandas doctest#53301

[SPARK-54555][PYTHON][TESTS] Set `spark.sql.execution.pandas.structHandlingMode` in pyspark pandas doctest#53301
asl3 wants to merge 3 commits intoapache:masterfrom
asl3:pysparkpandasdoctest

asl3 commented Dec 3, 2025 •

edited

Loading

Uh oh!

ueshin left a comment

Uh oh!

Uh oh!

zhengruifeng commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Comments

Conversation

asl3 commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

ueshin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zhengruifeng commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

asl3 commented Dec 3, 2025 •

edited

Loading