[SPARK-48611][CORE] Log TID for input split in HadoopRDD and NewHadoopRDD #46966

pan3793 · 2024-06-13T06:28:53Z

What changes were proposed in this pull request?

Log TID for "input split" in HadoopRDD and NewHadoopRDD

Why are the changes needed?

This change should benefit both structured logging enabled/disabled cases.

When structured logging is disabled, and executor cores > 1, the logs of tasks are mixed in stdout, something like

24/06/12 21:40:10 INFO Executor: Running task 26.0 in stage 2.0 (TID 10)
24/06/12 21:40:10 INFO Executor: Running task 27.0 in stage 2.0 (TID 11)
24/06/12 21:40:11 INFO HadoopRDD: Input split: hdfs://.../part-00025-53bc40ae-399f-4291-b5ac-617c980deb86-c000:0+124138257
24/06/12 21:40:11 INFO HadoopRDD: Input split: hdfs://.../part-00045-53bc40ae-399f-4291-b5ac-617c980deb86-c000:0+121726684

it's hard to say which file is read by which task because they run in parallel.

If something goes wrong, the log prints TID and exception stack trace, the error may related to the input data, sometimes that exception message is clear enough to show which file that input data comes from, but sometimes not, in the latter case, the current log is not clear enough to allow us to identify the bad file quickly.

24/06/12 21:40:18 ERROR Executor: Exception in task 27.0 in stage 2.0 (TID 11)
(... exception message)
(... stacktraces)

When structured logging is enabled, exposing TID as a LogKey makes the logs more selective.

Does this PR introduce any user-facing change?

Yes, it supplies additional information in logs.

How was this patch tested?

Review, as it only touches log contents.

Was this patch authored or co-authored using generative AI tooling?

No

…pRDD

pan3793 · 2024-06-13T06:30:29Z

cc @gengliangwang @yaooqinn @LuciferYang

LuciferYang

+1, LGTM

gengliangwang · 2024-06-13T20:46:45Z

core/src/main/scala/org/apache/spark/rdd/HadoopRDD.scala


      private val split = theSplit.asInstanceOf[HadoopPartition]
-      logInfo(log"Input split: ${MDC(INPUT_SPLIT, split.inputSplit)}")
+      logInfo(log"Task (TID ${MDC(TASK_ID, context.taskAttemptId())}) input split: " +


Use TASK_ATTEMPT_ID?

context.taskAttemptId() is task id, this was clarified in #45834 (comment)

gengliangwang · 2024-06-15T03:20:37Z

Thanks, merging to master

[SPARK-48611][CORE] Log TID for input split in HadoopRDD and NewHadoo…

4f88ac6

…pRDD

github-actions bot added the CORE label Jun 13, 2024

LuciferYang approved these changes Jun 13, 2024

View reviewed changes

gengliangwang reviewed Jun 13, 2024

View reviewed changes

gengliangwang approved these changes Jun 15, 2024

View reviewed changes

gengliangwang closed this in 0775ea7 Jun 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-48611][CORE] Log TID for input split in HadoopRDD and NewHadoopRDD #46966

[SPARK-48611][CORE] Log TID for input split in HadoopRDD and NewHadoopRDD #46966

Uh oh!

pan3793 commented Jun 13, 2024 •

edited

Loading

Uh oh!

pan3793 commented Jun 13, 2024

Uh oh!

LuciferYang left a comment

Uh oh!

gengliangwang Jun 13, 2024

Uh oh!

pan3793 Jun 14, 2024

Uh oh!

gengliangwang commented Jun 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-48611][CORE] Log TID for input split in HadoopRDD and NewHadoopRDD #46966

[SPARK-48611][CORE] Log TID for input split in HadoopRDD and NewHadoopRDD #46966

Uh oh!

Conversation

pan3793 commented Jun 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

pan3793 commented Jun 13, 2024

Uh oh!

LuciferYang left a comment

Choose a reason for hiding this comment

Uh oh!

gengliangwang Jun 13, 2024

Choose a reason for hiding this comment

Uh oh!

pan3793 Jun 14, 2024

Choose a reason for hiding this comment

Uh oh!

gengliangwang commented Jun 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pan3793 commented Jun 13, 2024 •

edited

Loading