-
Notifications
You must be signed in to change notification settings - Fork 9.2k
MAPREDUCE-7369. Fixed MapReduce tasks timing out when spends more time on MultipleOutputs#close #4247
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…e on MultipleOutputs#close
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
|
||
| <property> | ||
| <name>mapreduce.task.enable.ping-for-liveliness-check</name> | ||
| <value>true</value> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The value must be same with the DEFAULT_MR_TASK_ENABLE_PING_FOR_LIVELINESS_CHECK.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @iwasakims for review. I have updated the DEFAULT_MR_TASK_ENABLE_PING_FOR_LIVELINESS_CHECK to true.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When should we turn this off? If it should be always true, we don't need to make it configurable. We usually needs new configuration knob to add new feature disabled by default for compatibility and safety.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah I think this makes sense, I will remove configurable part.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @iwasakims. I saw the comment from @cnauroth on JIRA about his suggestion to put it behind the configuration and I think we can keep it configurable and let the end user decide on it. By default, lets keep it
mapreduce.task.enable.ping-for-liveliness-check : false which will keep the current behaviour intact
and for use cases that we saw in JIRA, user can set it true when required
Thoughts?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ashutoshcipher Making this feature disabled by default sounds right direction as @cnauroth mentioned. While no unexpected side effect was observed in the test results of current patch, there should be additional test case covering the code path of new feature enabled.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @iwasakims , let me make the required changes.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@iwasakims @cnauroth - I have made the required changes, Can you please review the PR? Thanks
|
🎊 +1 overall
This message was automatically generated. |
|
It would be nice if we can add a test case reproducing the issue covered by this fix. |
|
🎊 +1 overall
This message was automatically generated. |
… ping check feature
Reproducing this issue seems a little tricky, But I have added a test case for this feature. Thanks. |
iwasakims
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, @ashutoshcipher. LGTM.
Since majority of boolean configuration properties are suffixed with .enabled, mapreduce.task.ping-for-liveliness-check.enabled might be better than mapreduce.task.enable.ping-for-liveliness-check. You can update before merging if you like.
…ask.ping-for-liveliness-check.enabled
|
Thanks @iwasakims for your review and approval. I have update it to |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
…e on MultipleOutputs#close (#4247) Contributed by Ravuri Sushma sree. Co-authored-by: Ashutosh Gupta <ashugpt@amazon.com> (cherry picked from commit 36c4be8) Conflicts: hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/TaskAttemptListenerImpl.java
|
merged this to trunk and branch-3.3. |
…e on MultipleOutputs#close (apache#4247) Contributed by Ravuri Sushma sree. Co-authored-by: Ashutosh Gupta <ashugpt@amazon.com>
Description of PR
Fixed MapReduce tasks timing out when spends more time on MultipleOutputs#close