[SPARK-27750][CORE] Standalone scheduler - ability to prioritize applications over drivers, many drivers act like Denial of Service #27697

tooptoop4 · 2020-02-25T22:21:34Z

What changes were proposed in this pull request?

Don't allow drivers to use all the cores in spark standalone scheduler. Reserve some cores for apps

Why are the changes needed?

Airflow is triggering many Spark jobs in parallel, spark standalone gets 'full' with drivers and can never execute apps as drivers took all the cores. The spark cluster essentially becomes stuck and never completes any drivers (as no cores avail to run the associated apps), never frees up cores. Manual intervention is needed to either kill some drivers (but airflow can retry sending them), or force scale up of the spark cluster by registering more workers (than ASG MaxSize).

Does this PR introduce any user-facing change?

By default no, they must opt-in by explicitly setting config value > 0

How was this patch tested?

Running in prod for 3 months

m

…ns over drivers, many drivers act like Denial of Service

AmplabJenkins · 2020-02-25T22:29:36Z

Can one of the admins verify this patch?

Ngone51

How about limiting the number of concurrent running drivers? Personally, I feel it's more convenient for user to set # of driver rather than the cores. WDYT?

tooptoop4 · 2020-02-26T14:56:27Z

@Ngone51 but each driver can have different amount of cores

Ngone51 · 2020-02-26T15:01:22Z

@Ngone51 but each driver can have different amount of cores

Yeah, but for such case, many drivers race for the reserved core concurrently, how can we know which drivers win the race and decide the amount of cores for them?

tooptoop4 · 2020-02-26T16:19:04Z

@Ngone51 as long as some cores are not used by any driver, then i don't mind which drivers got in first. I just want some cores that can only be consumed by apps

Ngone51 · 2020-02-26T16:24:50Z

I just want some cores that can only be consumed by apps

When we limit the number of running drivers, we'll definitely have reserved cores for the apps.

tooptoop4 · 2020-02-26T16:42:01Z

@Ngone51 that is worse, one day you could have 10 drivers each taking 1 core, another day you could have 5 drivers each taking 4 cores (in this case if u limited on 10 drivers on both days, all cores would be used on 2nd day). So it does not work well with diff sized drivers. Cores is simple/uniform way. Secondly, if I want to have scheduled scaleup of spark cluster, specifying a portion of cores to not use still works, whereas fixing a max # drivers does not work as it limits ability to take advantage of larger spark cluster

jiangxb1987 · 2020-02-26T22:13:00Z

What's blocking you from choosing the client deploy mode? Using the client deploy mode would let you launch the driver program locally and thus no need to worry about drivers involved into cluster resource contentions.

jiangxb1987 · 2020-02-26T22:21:25Z

Speaking of the proposed improvement, it does somehow migrates the issue by allowing you to launch at least a few executors, but it would probably still be the case that not enough cluster resources are going to executors, thus many drivers still need to wait for free slots to launch their pending tasks. Without other considerations, I think you should give client deploy mode a try (or you already have other issues thus not able to choose the client mode) ?

tooptoop4 · 2020-02-26T23:29:19Z

@jiangxb1987 client deploy mode is subject to limits of single master machine. if I want to run 200+ spark-submits in parallel then the master machine must have a enough memory to support all those drivers. With my change few executors is all I need, but even I can reserve 100s of cores for just apps with this new config

jiangxb1987 · 2020-02-26T23:39:21Z

Sorry I don't have enough context to understand your use case, but submitting 200+ applications at the same time to a Spark cluster is something I'm not expecting. Basically I would expect a lot less applications, and each application can submit a few jobs, thus we don't really need to launch that much drivers.

tooptoop4 · 2020-02-27T06:12:47Z

i am doing a data lake, where 10000s of files of different schemas get ingested daily, client mode does not handle the concurrency/scale i need. Also I use Spark REST API (spark.master.rest.port) which does not support client mode. what is the reluctance to merging this?

tooptoop4 · 2020-02-28T21:47:16Z

can u pls merge @dongjoon-hyun ?

dongjoon-hyun · 2020-02-28T23:03:17Z

core/src/main/scala/org/apache/spark/internal/config/Deploy.scala

    .createWithDefault(Int.MaxValue)

+  val CORES_RESERVED_FOR_APPS = ConfigBuilder("spark.deploy.coresReservedForApps")
+    .version("2.4.6")


Hi, @tooptoop4 . We don't backport a feature.
The feasible next version is 3.1.0.

jiangxb1987 · 2020-02-28T23:05:32Z

The proposed improvement is just not necessary. I don't see any need to submit 200+ applications, you could start one application and submit multiple jobs.

dongjoon-hyun · 2020-02-28T23:05:49Z

core/src/main/scala/org/apache/spark/deploy/master/Master.scala

-          launched = true
+    val allFreeCores = shuffledAliveWorkers.map(_.coresFree).sum
+    val forDriversFreeCores = math.max(allFreeCores - coresReservedForApps, 0)
+    if (forDriversFreeCores > 0) {


Could you make a test case for this?

tooptoop4 · 2020-02-28T23:14:05Z

@jiangxb1987 ur suggestion is don't scale?

jiangxb1987 · 2020-02-28T23:16:32Z

What do you mean by scale? You don't need 200 drivers, you can still launch one application and submit your jobs, this way your workload should work.

tooptoop4 · 2020-02-28T23:44:29Z

one app is going to be slower 2. what if 7 out of 200 job fails, whole app fails meaning poor resumability. 3. files arrive at different times, so can't wait for all to arrive and then do single app. This PR fixes clear bug, spark standalone gets itself stuck. if you don't like the number 200, think of submitting 6 apps in cluster mode on a 4 core cluster, spark gets full of drivers and no apps can ever run, spark stays stuck forever. With this fix, I can guarantee some cores for apps so spark never gets stuck with only drivers

jiangxb1987 · 2020-02-29T00:05:20Z

I‘m -1 to this change, because it's trying to resolve an issue that doesn't even exist.
Please read https://spark.apache.org/docs/latest/cluster-overview.html before you ask. Thanks!

dongjoon-hyun · 2020-02-29T02:20:12Z

Thank you for your proposal, @tooptoop4. However, according to the above discussion, I'll close this PR. Thank you, @tooptoop4 , @Ngone51 , @jiangxb1987 .

tooptoop4 added 2 commits February 25, 2020 21:20

Merge pull request #1 from apache/master

14f4564

m

[SPARK-27750] Standalone scheduler - ability to prioritize applicatio…

942dde2

…ns over drivers, many drivers act like Denial of Service

style

c8fe334

Ngone51 reviewed Feb 26, 2020

View reviewed changes

dongjoon-hyun added the SCHEDULER label Feb 28, 2020

dongjoon-hyun reviewed Feb 28, 2020

View reviewed changes

dongjoon-hyun added the SPARK CORE label Feb 28, 2020

dongjoon-hyun closed this Feb 29, 2020

tooptoop4 mentioned this pull request Aug 29, 2020

[SPARK-32614][SQL] Don't apply comment processing if 'comment' unset for CSV #29516

Closed

[SPARK-27750][CORE] Standalone scheduler - ability to prioritize applications over drivers, many drivers act like Denial of Service #27697

[SPARK-27750][CORE] Standalone scheduler - ability to prioritize applications over drivers, many drivers act like Denial of Service #27697

Uh oh!

Conversation

tooptoop4 commented Feb 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

AmplabJenkins commented Feb 25, 2020

Uh oh!

Ngone51 left a comment

Choose a reason for hiding this comment

Uh oh!

tooptoop4 commented Feb 26, 2020

Uh oh!

Ngone51 commented Feb 26, 2020

Uh oh!

tooptoop4 commented Feb 26, 2020

Uh oh!

Ngone51 commented Feb 26, 2020

Uh oh!

tooptoop4 commented Feb 26, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jiangxb1987 commented Feb 26, 2020

Uh oh!

jiangxb1987 commented Feb 26, 2020

Uh oh!

tooptoop4 commented Feb 26, 2020

Uh oh!

jiangxb1987 commented Feb 26, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tooptoop4 commented Feb 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tooptoop4 commented Feb 28, 2020

Uh oh!

dongjoon-hyun Feb 28, 2020

Choose a reason for hiding this comment

Uh oh!

jiangxb1987 commented Feb 28, 2020

Uh oh!

dongjoon-hyun Feb 28, 2020

Choose a reason for hiding this comment

Uh oh!

tooptoop4 commented Feb 28, 2020

Uh oh!

jiangxb1987 commented Feb 28, 2020

Uh oh!

tooptoop4 commented Feb 28, 2020

Uh oh!

jiangxb1987 commented Feb 29, 2020

Uh oh!

dongjoon-hyun commented Feb 29, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

tooptoop4 commented Feb 25, 2020 •

edited

Loading

tooptoop4 commented Feb 26, 2020 •

edited

Loading

jiangxb1987 commented Feb 26, 2020 •

edited

Loading

tooptoop4 commented Feb 27, 2020 •

edited

Loading