-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-53233][SQL][FOLLOWUP] Add compatibility class/object for org.apache.spark.sql.execution.streaming #52387
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you, @cloud-fan . Could you fix the compilation failure?
[error] /home/runner/work/spark/spark/sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/legacy.scala:30:36: type mismatch;
[error] found : org.apache.hadoop.conf.Configuration
[error] required: org.apache.spark.sql.SparkSession
[error] extends ActualHDFSMetadataLog[T](conf, path) {
anishshri-db
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for working on this !
HeartSaVioR
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 assuming CI passes. Thanks for the work!
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, LGTM.
|
thanks for the review, merging to master! |
|
late LGTM, thank you @cloud-fan |
…pache.spark.sql.execution.streaming ### What changes were proposed in this pull request? This is a followup of apache#51959 . Although internal APIs are allowed to be changed, it's still better to keep compatibility if possible to avoid breaking existing Spark plugins. This PR brings back `HDFSMetadataLog` and `SerializedOffset` to the original package, to avoid breaking the pulsar data source: https://github.com/streamnative/pulsar-spark/blob/master/src/main/scala/org/apache/spark/sql/pulsar/PulsarSources.scala#L27 ### Why are the changes needed? Avoid breaking Spark plugins ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? manual test ### Was this patch authored or co-authored using generative AI tooling? no Closes apache#52387 from cloud-fan/compat. Lead-authored-by: Wenchen Fan <wenchen@databricks.com> Co-authored-by: Wenchen Fan <cloud0fan@gmail.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>
What changes were proposed in this pull request?
This is a followup of #51959 . Although internal APIs are allowed to be changed, it's still better to keep compatibility if possible to avoid breaking existing Spark plugins.
This PR brings back
HDFSMetadataLogandSerializedOffsetto the original package, to avoid breaking the pulsar data source: https://github.com/streamnative/pulsar-spark/blob/master/src/main/scala/org/apache/spark/sql/pulsar/PulsarSources.scala#L27Why are the changes needed?
Avoid breaking Spark plugins
Does this PR introduce any user-facing change?
No
How was this patch tested?
manual test
Was this patch authored or co-authored using generative AI tooling?
no