Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Truncates long streaming job confs #1773

Merged
merged 1 commit into from
Jul 29, 2016
Merged

Conversation

daveFNbuck
Copy link
Contributor

Description

Adds stream.jobconf.truncate.limit=20000 to jobconfs of streaming jobs. The 20000 number is configurable via the jobconf_truncate task property.

Motivation and Context

When running a streaming job with lots of inputs, the job conf gets passed to
each mapper. This can cause an exception from passing too many arguments to the
mapper. This job conf is not actually, needed, so it's safe to truncate. 20000
is recommended as a safe value for this truncation in
http://aajisaka.github.io/hadoop-project/hadoop-streaming/HadoopStreaming.html#What_do_I_do_if_I_get_a_error7_Argument_list_too_long

Have you tested this? If so, how?

Ran a MR job that failed without this. It now runs fine.

When running a streaming job with lots of inputs, the job conf gets passed to
each mapper. This can cause an exception from passing too many arguments to the
mapper. This job conf is not actually, needed, so it's safe to truncate. 20000
is recommended as a safe value for this truncation in
http://aajisaka.github.io/hadoop-project/hadoop-streaming/HadoopStreaming.html
@mention-bot
Copy link

@daveFNbuck, thanks for your PR! By analyzing the annotation information on this pull request, we identified @Tarrasch, @mvj3 and @geowurster to be potential reviewers

@Tarrasch
Copy link
Contributor

LGTM

@dchentech
Copy link
Contributor

LGTM 2

@Tarrasch Tarrasch merged commit 0532453 into spotify:master Jul 29, 2016
@daveFNbuck daveFNbuck deleted the streaming_truncate branch June 2, 2017 00:03
This was referenced Jun 29, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants