Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using remote shuffle service with Spark operator #67

Open
roligupt opened this issue Apr 26, 2022 · 2 comments
Open

Using remote shuffle service with Spark operator #67

roligupt opened this issue Apr 26, 2022 · 2 comments

Comments

@roligupt
Copy link

@hiboyang Have you tried using remote shuffle service with spark operator? (spark on K8s operator)?

I tested it with the client jar in my 'SparkApplication' image and it works as expected.

Although I want to include the client jar in my spark operator image so every job that I am submitting to spark operator uses the client jar from spark operator and I don't have to include the client jar in every job image.

I pretty sure this can be done but probably would need the code changes in remote shuffle service?

@datapunchorg
Copy link

Hi @roligupt , if you use Spark on Kubernetes (spark operator), the remote shuffle service client jar file must be inside the Spark application image, because the jar file is loaded during Spark driver start time, otherwise, there will be error.

@hiboyang
Copy link
Contributor

Oops, just found I replied using my another GitHub account, That @datapunchorg is still me.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants