[SPARK-26441][KUBERNETES] Add kind configuration of driver pod #23382

fhan688 · 2018-12-26T04:04:10Z

What changes were proposed in this pull request?

Spark running on kubernetes now starts driver pod in Pod kind by default , which has a problem that entire job fails when host machine crashs. In other words , driver pod can not failover in kind of Pod in this situation. So , we add kind configuration of driver pod which supports Pod、Deployment、Job. For example , in streaming jobs , starting driver pod in Deployment kind will ensure the driver service high available even there is host-machine-crash. In batch jobs , there is configurable backoffLimits for retry.

How was this patch tested?

We test in production env. Starting driver in Deployment or Job kind can make driver high available when host machine crashs.

AmplabJenkins · 2018-12-26T04:09:28Z

Can one of the admins verify this patch?

liyinan926 · 2018-12-26T20:40:39Z

There has been some discussions on using a Job to run the driver has been discussed before in #21067. We decided not to adopt that approach for the reasons discussed in the PR. A Deployment has the same problem with a Job, i.e., lack of exactly-once semantics. If you need high availability and auto restart/retry support for the driver, the K8S Spark Operator is worth taking a look.

vanzin · 2019-01-02T23:46:22Z

@liyinan926 looks like you're saying this PR should be closed and the bug marked "won't fix" (or maybe duplicate)?

liyinan926 · 2019-01-02T23:48:12Z

@vanzin Yes.

vanzin · 2019-01-02T23:58:10Z

Alright then, closing on your suggestion.

support driver pod kind

ebc4fc2

vanzin closed this Jan 2, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-26441][KUBERNETES] Add kind configuration of driver pod #23382

[SPARK-26441][KUBERNETES] Add kind configuration of driver pod #23382

Uh oh!

fhan688 commented Dec 26, 2018

Uh oh!

AmplabJenkins commented Dec 26, 2018

Uh oh!

liyinan926 commented Dec 26, 2018

Uh oh!

vanzin commented Jan 2, 2019

Uh oh!

liyinan926 commented Jan 2, 2019

Uh oh!

vanzin commented Jan 2, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SPARK-26441][KUBERNETES] Add kind configuration of driver pod #23382

[SPARK-26441][KUBERNETES] Add kind configuration of driver pod #23382

Uh oh!

Conversation

fhan688 commented Dec 26, 2018

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

AmplabJenkins commented Dec 26, 2018

Uh oh!

liyinan926 commented Dec 26, 2018

Uh oh!

vanzin commented Jan 2, 2019

Uh oh!

liyinan926 commented Jan 2, 2019

Uh oh!

vanzin commented Jan 2, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants