Support rolling `spark.kubernetes.file.upload.path` #6876

pan3793 · 2024-12-30T06:25:36Z

Why are the changes needed?

The vanilla Spark neither support rolling nor expiration mechanism for spark.kubernetes.file.upload.path, if you use file system that does not support TTL, e.g. HDFS, additional cleanup mechanisms are needed to prevent the files in this directory from growing indefinitely.

This PR proposes to let spark.kubernetes.file.upload.path support placeholders {{YEAR}}, {{MONTH}} and {{DAY}} and introduce a switch kyuubi.kubernetes.spark.autoCreateFileUploadPath.enabled to let Kyuubi server create the directory with 777 permission automatically before submitting Spark application.

For example, the user can configure the below configurations in kyuubi-defaults.conf to enable monthly rolling support for spark.kubernetes.file.upload.path

kyuubi.kubernetes.spark.autoCreateFileUploadPath.enabled=true
spark.kubernetes.file.upload.path=hdfs://hadoop-cluster/spark-upload-{{YEAR}}{{MONTH}}

Note that: spark would create sub dir s"spark-upload-${UUID.randomUUID()}" under the spark.kubernetes.file.upload.path for each uploading, the administer still needs to clean up the staging directory periodically.

For example:

hdfs://hadoop-cluster/spark-upload-202412/spark-upload-f2b71340-dc1d-4940-89e2-c5fc31614eb4
hdfs://hadoop-cluster/spark-upload-202412/spark-upload-173a8653-4d3e-48c0-b8ab-b7f92ae582d6
hdfs://hadoop-cluster/spark-upload-202501/spark-upload-3b22710f-a4a0-40bb-a3a8-16e481038a63

Administer can safely delete the hdfs://hadoop-cluster/spark-upload-202412 after 20250101

How was this patch tested?

New UTs are added.

Was this patch authored or co-authored using generative AI tooling?

No.

codecov-commenter · 2024-12-30T08:00:38Z

Codecov Report

Attention: Patch coverage is 0% with 40 lines in your changes missing coverage. Please review.

Project coverage is 0.00%. Comparing base (e8cbff3) to head (6614bf2).
Report is 4 commits behind head on master.

Files with missing lines	Patch %	Lines
...ache/kyuubi/engine/spark/SparkProcessBuilder.scala	0.00%	33 Missing ⚠️
...in/scala/org/apache/kyuubi/config/KyuubiConf.scala	0.00%	5 Missing ⚠️
...kyuubi/engine/spark/SparkBatchProcessBuilder.scala	0.00%	2 Missing ⚠️

Additional details and impacted files

@@          Coverage Diff           @@
##           master   #6876   +/-   ##
======================================
  Coverage    0.00%   0.00%           
======================================
  Files         688     688           
  Lines       42545   42589   +44     
  Branches     5800    5805    +5     
======================================
- Misses      42545   42589   +44

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

turboFei · 2024-12-30T23:05:54Z

will check it today.

turboFei · 2024-12-31T00:54:23Z

Note that:

kyuubi.kubernetes.spark.autoCreateFileUploadPath.enabled=true
spark.kubernetes.file.upload.path=hdfs://hadoop-cluster/spark-upload-{{YEAR}}{{MONTH}}

Spark would create sub dir s"spark-upload-${UUID.randomUUID()}" under the spark.kubernetes.file.upload.path for each uploading, the administer still need to cleanup the staging directory periodically.

For example:

hdfs://hadoop-cluster/spark-upload-202412/spark-upload-f2b71340-dc1d-4940-89e2-c5fc31614eb4

kyuubi-server/src/main/scala/org/apache/kyuubi/engine/spark/SparkProcessBuilder.scala

zwangsheng · 2024-12-31T02:05:14Z

docs/deployment/engine_on_kubernetes.md

+The vanilla Spark neither support rolling nor expiration mechanism for `spark.kubernetes.file.upload.path`, if you use
+file system that does not support TTL, e.g. HDFS, additional cleanup mechanisms are needed to prevent the files in this
+directory from growing indefinitely. Since Kyuubi v1.11.0, you can configure `spark.kubernetes.file.upload.path` with
+placeholders `{{YEAR}}`, `{{MONTH}}` and `{{DAY}}`, and enable `kyuubi.kubernetes.spark.autoCreateFileUploadPath.enabled`
+to let Kyuubi server create the directory with 777 permission automatically before submitting Spark application.
+


It seems that our current implementation does not solve the problem of file growth. Will this issue be solved in subsequent PRs?

It adds the rolling support for spark.kubernetes.file.upload.path, for example,

spark.kubernetes.file.upload.path=hdfs://hadoop-testing/spark-upload-{{YEAR}}{{MONTH}}

hdfs://hadoop-testing/spark-upload-202412 hdfs://hadoop-testing/spark-upload-202501

Admin can safely delete the hdfs://hadoop-testing/spark-upload-202412 after 20250101

kyuubi-server/src/main/scala/org/apache/kyuubi/engine/spark/SparkProcessBuilder.scala

pan3793 · 2024-12-31T02:17:36Z

Spark would create sub dir s"spark-upload-${UUID.randomUUID()}" under the spark.kubernetes.file.upload.path for each uploading, the administer still need to cleanup the staging directory periodically.

Yes, exactly. Previously, it was unsafe to delete the whole spark.kubernetes.file.upload.path directly because the submitting Spark apps might use it.

turboFei

LGTM, thanks

zwangsheng

Thanks, LGTM

Support rolling spark.kubernetes.file.upload.path

38953dc

github-actions bot added kind:documentation Documentation is a feature! module:server module:common labels Dec 30, 2024

docs

7069897

pan3793 requested review from turboFei and zwangsheng December 30, 2024 06:33

fix

3eade8b

turboFei reviewed Dec 31, 2024

View reviewed changes

kyuubi-server/src/main/scala/org/apache/kyuubi/engine/spark/SparkProcessBuilder.scala Show resolved Hide resolved

zwangsheng reviewed Dec 31, 2024

View reviewed changes

pan3793 added 3 commits December 31, 2024 10:25

review

343adae

docs

5d5cb3e

comment

6614bf2

turboFei approved these changes Jan 1, 2025

View reviewed changes

zwangsheng approved these changes Jan 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support rolling `spark.kubernetes.file.upload.path` #6876

Support rolling `spark.kubernetes.file.upload.path` #6876

pan3793 commented Dec 30, 2024 •

edited

Loading

codecov-commenter commented Dec 30, 2024 •

edited

Loading

turboFei commented Dec 30, 2024

turboFei commented Dec 31, 2024 •

edited

Loading

zwangsheng Dec 31, 2024

pan3793 Dec 31, 2024

pan3793 commented Dec 31, 2024

turboFei left a comment

zwangsheng left a comment

Support rolling spark.kubernetes.file.upload.path #6876

Are you sure you want to change the base?

Support rolling spark.kubernetes.file.upload.path #6876

Conversation

pan3793 commented Dec 30, 2024 • edited Loading

Why are the changes needed?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

codecov-commenter commented Dec 30, 2024 • edited Loading

Codecov Report

turboFei commented Dec 30, 2024

turboFei commented Dec 31, 2024 • edited Loading

zwangsheng Dec 31, 2024

Choose a reason for hiding this comment

pan3793 Dec 31, 2024

Choose a reason for hiding this comment

pan3793 commented Dec 31, 2024

turboFei left a comment

Choose a reason for hiding this comment

zwangsheng left a comment

Choose a reason for hiding this comment

Support rolling `spark.kubernetes.file.upload.path` #6876

Support rolling `spark.kubernetes.file.upload.path` #6876

pan3793 commented Dec 30, 2024 •

edited

Loading

codecov-commenter commented Dec 30, 2024 •

edited

Loading

turboFei commented Dec 31, 2024 •

edited

Loading