[SPARK-48610][SQL] refactor: use auxiliary idMap instead of OP_ID_TAG #46965

liuzqt · 2024-06-13T02:08:11Z

What changes were proposed in this pull request?

refactor: In ExplainUtils.processPlan, use auxiliary idMap instead of OP_ID_TAG

Why are the changes needed?

#45282 introduced synchronize to ExplainUtils.processPlan to avoid race condition when multiple queries refers to same cached plan.

The granularity of lock is too large. We can try to fix the root cause of this concurrency issue by refactoring the usage of mutable OP_ID_TAG, which is not a good practice in terms of immutable nature of SparkPlan.

Instead, we can use an auxiliary id map, with object identity as the key. The entire scope of OP_ID_TAG usage is within ExplainUtils.processPlan, therefore it's safe to do so, with thread local to make it available in other involved classes.

Does this PR introduce any user-facing change?

NO

How was this patch tested?

existing UTs.

Was this patch authored or co-authored using generative AI tooling?

NO

liuzqt · 2024-06-13T17:27:49Z

@cloud-fan

cloud-fan · 2024-06-13T18:55:34Z

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/QueryPlan.scala

-  val OP_ID_TAG = TreeNodeTag[Int]("operatorId")
  val CODEGEN_ID_TAG = new TreeNodeTag[Int]("wholeStageCodegenId")

+  val localIdMap: ThreadLocal[java.util.Map[QueryPlan[_], Int]] = ThreadLocal.withInitial(() =>


can we define the scope of this thread local? When it's set and when it's cleared.

the scope is ExplainUtils.processPlan, but I defined it here because QueryPlan also need this, and don't have access to execution package from catalyst. Added comments to clarify.

cloud-fan · 2024-06-17T06:34:11Z

thanks, merging to master/3.5!

### What changes were proposed in this pull request? refactor: In `ExplainUtils.processPlan`, use auxiliary idMap instead of OP_ID_TAG ### Why are the changes needed? #45282 introduced synchronize to `ExplainUtils.processPlan` to avoid race condition when multiple queries refers to same cached plan. The granularity of lock is too large. We can try to fix the root cause of this concurrency issue by refactoring the usage of mutable `OP_ID_TAG`, which is not a good practice in terms of immutable nature of SparkPlan. Instead, we can use an auxiliary id map, with object identity as the key. The entire scope of `OP_ID_TAG` usage is within `ExplainUtils.processPlan`, therefore it's safe to do so, with thread local to make it available in other involved classes. ### Does this PR introduce _any_ user-facing change? NO ### How was this patch tested? existing UTs. ### Was this patch authored or co-authored using generative AI tooling? NO Closes #46965 from liuzqt/SPARK-48610. Authored-by: Ziqi Liu <ziqi.liu@databricks.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com> (cherry picked from commit d3da240) Signed-off-by: Wenchen Fan <wenchen@databricks.com>

…P_ID_TAG ### What changes were proposed in this pull request? refactor: In `ExplainUtils.processPlan`, use auxiliary idMap instead of OP_ID_TAG ### Why are the changes needed? apache#45282 introduced synchronize to `ExplainUtils.processPlan` to avoid race condition when multiple queries refers to same cached plan. The granularity of lock is too large. We can try to fix the root cause of this concurrency issue by refactoring the usage of mutable `OP_ID_TAG`, which is not a good practice in terms of immutable nature of SparkPlan. Instead, we can use an auxiliary id map, with object identity as the key. The entire scope of `OP_ID_TAG` usage is within `ExplainUtils.processPlan`, therefore it's safe to do so, with thread local to make it available in other involved classes. ### Does this PR introduce _any_ user-facing change? NO ### How was this patch tested? existing UTs. ### Was this patch authored or co-authored using generative AI tooling? NO Closes apache#46965 from liuzqt/SPARK-48610. Authored-by: Ziqi Liu <ziqi.liu@databricks.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com> (cherry picked from commit d3da240) Signed-off-by: Wenchen Fan <wenchen@databricks.com>

…apache#626) ### What changes were proposed in this pull request? refactor: In `ExplainUtils.processPlan`, use auxiliary idMap instead of OP_ID_TAG ### Why are the changes needed? apache#45282 introduced synchronize to `ExplainUtils.processPlan` to avoid race condition when multiple queries refers to same cached plan. The granularity of lock is too large. We can try to fix the root cause of this concurrency issue by refactoring the usage of mutable `OP_ID_TAG`, which is not a good practice in terms of immutable nature of SparkPlan. Instead, we can use an auxiliary id map, with object identity as the key. The entire scope of `OP_ID_TAG` usage is within `ExplainUtils.processPlan`, therefore it's safe to do so, with thread local to make it available in other involved classes. ### Does this PR introduce _any_ user-facing change? NO ### How was this patch tested? existing UTs. ### Was this patch authored or co-authored using generative AI tooling? NO Closes apache#46965 from liuzqt/SPARK-48610. Authored-by: Ziqi Liu <ziqi.liu@databricks.com> (cherry picked from commit d3da240) Signed-off-by: Wenchen Fan <wenchen@databricks.com> Co-authored-by: Ziqi Liu <ziqi.liu@databricks.com>

github-actions bot added the SQL label Jun 13, 2024

refactor: use auxiliary idMap instead of OP_ID_TAG

c31a66f

liuzqt force-pushed the SPARK-48610 branch from 9b5d490 to c31a66f Compare June 13, 2024 02:09

cloud-fan reviewed Jun 13, 2024

View reviewed changes

add comments

2c028b7

liuzqt requested a review from cloud-fan June 14, 2024 21:58

cloud-fan approved these changes Jun 17, 2024

View reviewed changes

cloud-fan closed this in d3da240 Jun 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-48610][SQL] refactor: use auxiliary idMap instead of OP_ID_TAG #46965

[SPARK-48610][SQL] refactor: use auxiliary idMap instead of OP_ID_TAG #46965

Uh oh!

liuzqt commented Jun 13, 2024 •

edited

Loading

Uh oh!

liuzqt commented Jun 13, 2024

Uh oh!

cloud-fan Jun 13, 2024

Uh oh!

liuzqt Jun 13, 2024

Uh oh!

cloud-fan commented Jun 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[SPARK-48610][SQL] refactor: use auxiliary idMap instead of OP_ID_TAG #46965

[SPARK-48610][SQL] refactor: use auxiliary idMap instead of OP_ID_TAG #46965

Uh oh!

Conversation

liuzqt commented Jun 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

liuzqt commented Jun 13, 2024

Uh oh!

cloud-fan Jun 13, 2024

Choose a reason for hiding this comment

Uh oh!

liuzqt Jun 13, 2024

Choose a reason for hiding this comment

Uh oh!

cloud-fan commented Jun 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

liuzqt commented Jun 13, 2024 •

edited

Loading