-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore(ingest/s3) Bump Deequ and Pyspark version #8638
Conversation
@@ -259,13 +259,14 @@ def init_spark(self): | |||
import pydeequ | |||
|
|||
conf = SparkConf() | |||
|
|||
spark_version = os.getenv("SPARK_VERSION", "3.0") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
3.0? Don’t we want 3.3?
metadata-ingestion/setup.py
Outdated
"pydeequ>=1.0.1, <1.1", | ||
"pyspark==3.0.3", | ||
"pydeequ>=1.0.1", | ||
"pyspark==3.3.2", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It might still be good to keep this strictly pinned instead of >=
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Spark 3.3.3 has already been released. Would it be better to fix the version to the 3.3x branch. i.e. compatible release
"pyspark~=3.3.0", so can upgrade Spark with any future micro releases in 3.3?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
make sense, thanks
@treff7es looks like CI is failing on this one |
Closes #6852
Checklist