Skip to content

Conversation

@brennonyork
Copy link

Removed sleep() from the stop() method of the TaskSchedulerImpl class which, from the JIRA ticket, is believed to be a legacy artifact slowing down testing originally introduced in the ClusterScheduler class.

@AmplabJenkins
Copy link

Can one of the admins verify this patch?

@rxin
Copy link
Contributor

rxin commented Dec 31, 2014

Jenkins, test this please.

@SparkQA
Copy link

SparkQA commented Dec 31, 2014

Test build #24921 has started for PR 3851 at commit 04c3e64.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Dec 31, 2014

Test build #24921 has finished for PR 3851 at commit 04c3e64.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24921/
Test FAILed.

@JoshRosen
Copy link
Contributor

Jenkins, retest this please.

@SparkQA
Copy link

SparkQA commented Dec 31, 2014

Test build #24926 has started for PR 3851 at commit 04c3e64.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Dec 31, 2014

Test build #24926 has finished for PR 3851 at commit 04c3e64.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • class GaussianMixtureModel(
    • case class Sort(
    • case class BroadcastLeftSemiJoinHash(

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24926/
Test FAILed.

@JoshRosen
Copy link
Contributor

Jenkins, retest this please.

@SparkQA
Copy link

SparkQA commented Dec 31, 2014

Test build #24968 has started for PR 3851 at commit 04c3e64.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Dec 31, 2014

Test build #24968 has finished for PR 3851 at commit 04c3e64.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24968/
Test PASSed.

@JoshRosen
Copy link
Contributor

Ping @mridulm, since it looks like you might have introduced the first version of this sleep call back in 2013: d90d2af#diff-4e188f32951dc989d97fa7577858bc7cR397

I don't see a good reason to keep this sleep; the tests pass without it and its removal seems to have given us a big test performance boost.

@JoshRosen
Copy link
Contributor

@tdas, what do you think about merging this? It looks like you were the last one to touch this line in 27311b1. This fixed sleep() seems race-prone enough that I suppose we would have noticed if it was necessary for anything because it would have caused test flakiness.

@JoshRosen
Copy link
Contributor

I'm inclined to merge this into master now and not perform any backports right away (maybe it's still serving some purpose in older branches?).

@tdas
Copy link
Contributor

tdas commented Jan 4, 2015

@JoshRosen I dont remember much of the context around this. From the commit comment, I think I had introduced it for testing out metadata cleaning in Spark 1.0, so that all messages gets sent if the scheduler and everything is shutdown. Off the top of my head, I am not sure if removing this would cause flakiness due to unsent messages. If tests pass without it without flakiness, I am good with it.

@JoshRosen
Copy link
Contributor

Alright, I'm going to merge this into master (1.3.0); after I do that, let's wait a bit and see if it adds any flakiness. If this does serve some purpose, then we'll be able to implement a better solution that doesn't rely on fixed timeouts.

@asfgit asfgit closed this in b96008d Jan 4, 2015
srowen added a commit to srowen/spark that referenced this pull request Feb 26, 2015
asfgit pushed a commit that referenced this pull request Feb 26, 2015
Backport #3851 to branch 1.2: remove Thread.sleep(1000) in TaskSchedulerImpl.
Teeing this up for Jenkins per discussion in the JIRA / PR.

Author: Sean Owen <sowen@cloudera.com>

Closes #4793 from srowen/SPARK-795.2 and squashes the following commits:

5f5db4a [Sean Owen] Backport #3851 to branch 1.2: remove Thread.sleep(1000) in TaskSchedulerImpl
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants