Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-49911][SQL] Fix semantic of support binary equality #1

Open
wants to merge 251 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
251 commits
Select commit Hold shift + click to select a range
ed3a9b1
[SPARK-49691][PYTHON][CONNECT] Function `substring` should accept col…
zhengruifeng Sep 18, 2024
fbf81eb
[SPARK-47263][SQL] Assign names to the legacy conditions _LEGACY_ERRO…
Sep 18, 2024
a6f6e07
[SPARK-48939][AVRO] Support reading Avro with recursive schema reference
eason-yuchen-liu Sep 18, 2024
25d6b7a
[SPARK-49692][PYTHON][CONNECT] Refine the string representation of li…
zhengruifeng Sep 18, 2024
669e63a
[SPARK-49673][CONNECT] Increase CONNECT_GRPC_ARROW_MAX_BATCH_SIZE to …
dillitz Sep 18, 2024
5c48806
[SPARK-49688][CONNECT][TESTS] Fix a sporadic `SparkConnectServiceSuit…
changgyoopark-db Sep 19, 2024
db8010b
[SPARK-49568][CONNECT][SQL] Remove self type from Dataset
hvanhovell Sep 19, 2024
3b34891
[SPARK-49684][CONNECT] Remove global locks from session and execution…
changgyoopark-db Sep 19, 2024
af45902
[SPARK-49422][CONNECT][SQL] Add groupByKey to sql/api
hvanhovell Sep 19, 2024
58d73fe
Revert "[SPARK-49495][DOCS][FOLLOWUP] Enable GitHub Pages settings vi…
HyukjinKwon Sep 19, 2024
3763827
Revert "[SPARK-49495][DOCS][FOLLOWUP] Fix Pandoc installation for Git…
HyukjinKwon Sep 19, 2024
8861f0f
Revert "[SPARK-49495][DOCS] Document and Feature Preview on the maste…
HyukjinKwon Sep 19, 2024
f3c8d26
Revert "[SPARK-49422][CONNECT][SQL] Add groupByKey to sql/api"
HyukjinKwon Sep 19, 2024
3bdf146
[SPARK-49611][SQL][FOLLOW-UP] Fix wrong results of collations() TVF
mihailom-db Sep 19, 2024
492d1b1
[SPARK-48782][SQL] Add support for executing procedures in catalogs
aokolnychyi Sep 19, 2024
ac34f1d
[SPARK-48280][SQL][FOLLOW-UP] Add expressions that are built via expr…
mihailom-db Sep 19, 2024
a060c23
[SPARK-49667][SQL] Disallowed CS_AI collators with expressions that u…
vladanvasi-db Sep 19, 2024
4068fbc
[SPARK-49717][SQL][TESTS] Function parity test ignore private[xxx] fu…
zhengruifeng Sep 19, 2024
398457a
[SPARK-49422][CONNECT][SQL] Add groupByKey to sql/api
hvanhovell Sep 19, 2024
94dca78
[SPARK-49693][PYTHON][CONNECT] Refine the string representation of `t…
zhengruifeng Sep 19, 2024
f0fb0c8
[SPARK-49719][SQL] Make `UUID` and `SHUFFLE` accept integer `seed`
zhengruifeng Sep 19, 2024
92cad2a
[SPARK-49716][PS][DOCS][TESTS] Fix documentation and add test of barh…
xinrong-meng Sep 19, 2024
6d1815e
[SPARK-49718][PS] Switch `Scatter` plot to sampled data
zhengruifeng Sep 19, 2024
0445579
[SPARK-49720][PYTHON][INFRA] Add a script to clean up PySpark temp files
zhengruifeng Sep 19, 2024
ca726c1
[SPARK-49721][BUILD] Upgrade `protobuf-java` to 3.25.5
dongjoon-hyun Sep 20, 2024
a5ac80a
[SPARK-49713][PYTHON][CONNECT] Make function `count_min_sketch` accep…
zhengruifeng Sep 20, 2024
d4665fa
[SPARK-49677][SS] Ensure that changelog files are written on commit a…
anishshri-db Sep 20, 2024
6352c12
[MINOR][INFRA] Disable 'pages build and deployment' action
yaooqinn Sep 20, 2024
c009cd0
[SPARK-49392][SQL][FOLLOWUP] Catch errors when failing to write to ex…
uros-db Sep 20, 2024
b37863d
[MINOR][FOLLOWUP] Fix rat check for .nojekyll
yaooqinn Sep 20, 2024
46b0210
[SPARK-49699][SS] Disable PruneFilters for streaming workloads
n-young-db Sep 20, 2024
4d97574
[SPARK-49733][SQL][DOCS] Delete `ExpressionInfo[between]` from `gen-s…
panbingkun Sep 20, 2024
bb8294c
[SPARK-49729][SQL][DOCS] Forcefully check `usage` and correct the non…
panbingkun Sep 20, 2024
3d8c078
[SPARK-49704][BUILD] Upgrade `commons-io` to 2.17.0
panbingkun Sep 20, 2024
22a7edc
[SPARK-49531][PYTHON][CONNECT] Support line plot with plotly backend
xinrong-meng Sep 20, 2024
f3785fa
[SPARK-49737][SQL] Disable bucketing on collated columns in complex t…
stefankandic Sep 20, 2024
f76a9b1
[SPARK-49738][SQL] Endswith bug fix
viktorluc-db Sep 20, 2024
bdea091
[SPARK-49557][SQL] Add SQL pipe syntax for the WHERE operator
dtenedor Sep 20, 2024
70bd606
[SPARK-49641][DOCS] Include `table_funcs` and `variant_funcs` in the …
panbingkun Sep 21, 2024
f235bab
[SPARK-49451][FOLLOW-UP] Add support for duplicate keys in from_json(…
harshmotw-db Sep 21, 2024
fc8b945
[SPARK-49460][SQL] Followup: fix potential NPE risk
liuzqt Sep 21, 2024
0b05b1a
[SPARK-48782][SQL][TESTS][FOLLOW-UP] Enable ANSI for malformed input …
HyukjinKwon Sep 21, 2024
bbbc05c
[SPARK-49495][DOCS] Document and Feature Preview on the master branch…
yaooqinn Sep 21, 2024
1990646
Revert "[SPARK-49495][DOCS] Document and Feature Preview on the maste…
yaooqinn Sep 21, 2024
4f640e2
[SPARK-49495][DOCS] Document and Feature Preview on the master branch…
yaooqinn Sep 21, 2024
b642096
[SPARK-49741][DOCS] Add `spark.shuffle.accurateBlockSkewedFactor` to …
timlee0119 Sep 22, 2024
067f8f1
[SPARK-48355][SQL][TESTS][FOLLOWUP] Enable a SQL Scripting test in AN…
MaxGekk Sep 22, 2024
719b57a
[SPARK-49752][YARN] Remove workaround for YARN-3350
pan3793 Sep 22, 2024
0eeb61f
[SPARK-49734][PYTHON] Add `seed` argument for function `shuffle`
zhengruifeng Sep 23, 2024
3c81f07
[SPARK-49653][SQL] Single join for correlated scalar subqueries
agubichev Sep 23, 2024
d2e8c1c
[SPARK-48195][CORE] Save and reuse RDD/Broadcast created by SparkPlan
Sep 23, 2024
44ec70f
[SPARK-49626][PYTHON][CONNECT] Support horizontal and vertical bar plots
xinrong-meng Sep 23, 2024
e1637e3
[SPARK-48712][SQL][FOLLOWUP] Check whether input is valid utf-8 strin…
yaooqinn Sep 23, 2024
fec1562
[SPARK-49755][CONNECT] Remove special casing for avro functions in Co…
zhengruifeng Sep 23, 2024
3b5c1d6
[SPARK-49747][CONNECT] Migrate connect/ files to structured logging
nemanja-boric-databricks Sep 23, 2024
1086256
[SPARK-49415][CONNECT][SQL] Move SQLImplicits to sql/api
hvanhovell Sep 23, 2024
94d288e
[MINOR][PYTHON][DOCS] Fix the docstring of `to_timestamp`
zhengruifeng Sep 23, 2024
742265e
[SPARK-49429][CONNECT][SQL] Add Shared DataStreamWriter interface
hvanhovell Sep 24, 2024
35e5d29
[SPARK-49760][YARN] Correct handling of `SPARK_USER` env variable ove…
cnauroth Sep 24, 2024
64ea50e
[SPARK-49607][PYTHON] Update the sampling approach for sampled based …
zhengruifeng Sep 24, 2024
438a6e7
[SPARK-49753][BUILD] Upgrade ZSTD-JNI to 1.5.6-6
LuciferYang Sep 24, 2024
6bdd151
[SPARK-49694][PYTHON][CONNECT] Support scatter plots
xinrong-meng Sep 24, 2024
982028e
[SPARK-49609][PYTHON][CONNECT] Add API compatibility check between Cl…
itholic Sep 24, 2024
73d6bd7
[SPARK-49630][SS] Add flatten option to process collection types with…
anishshri-db Sep 24, 2024
dedf5aa
[SPARK-49750][DOC] Mention delegation token support in K8s mode
pan3793 Sep 24, 2024
55d0233
[SPARK-49713][PYTHON][FOLLOWUP] Make function `count_min_sketch` acce…
zhengruifeng Sep 24, 2024
afe8bf9
[SPARK-49771][PYTHON] Improve Pandas Scalar Iter UDF error when outpu…
allisonwang-db Sep 24, 2024
0a7b985
[SPARK-49585][CONNECT] Replace executions map in SessionHolder with o…
changgyoopark-db Sep 24, 2024
29ed272
[SPARK-49688][CONNECT] Fix a data race between interrupt and execute …
changgyoopark-db Sep 24, 2024
5fb0ff9
[SPARK-49282][CONNECT][SQL] Create a shared SparkSessionBuilder inter…
hvanhovell Sep 25, 2024
0c234bb
[SPARK-49369][CONNECT][SQL] Add implicit Column conversions
hvanhovell Sep 25, 2024
828b1f9
[SPARK-49463] Support ListState for TransformWithStateInPandas
bogao007 Sep 25, 2024
a4fb6cb
[SPARK-49743][SQL] OptimizeCsvJsonExpr should not change schema field…
nikhilsheoran-db Sep 25, 2024
e2d2ab5
[SPARK-49552][PYTHON] Add DataFrame API support for new 'randstr' and…
dtenedor Sep 25, 2024
9aa11d1
[SPARK-49772][SS] Remove ColumnFamilyOptions and add configs directly…
liviazhu-db Sep 25, 2024
5134c68
[SPARK-49765][DOCS][PYTHON] Adjust documentation of "spark.sql.pyspar…
xinrong-meng Sep 25, 2024
46c5acc
[SPARK-49609][PYTHON][TESTS][FOLLOW-UP] Skip Spark Connect tests if d…
HyukjinKwon Sep 25, 2024
7f0ecd4
[SPARK-49764][PYTHON][CONNECT] Support area plots
xinrong-meng Sep 25, 2024
e1b2ac5
[SPARK-49767][PS][CONNECT] Refactor the internal function invocation
zhengruifeng Sep 25, 2024
c362d50
[SPARK-49775][SQL][TESTS] Make tests of `INVALID_PARAMETER_VALUE.CHAR…
zhengruifeng Sep 25, 2024
0ccf53a
[SPARK-49609][PYTHON][FOLLOWUP] Correct the typehint for `filter` and…
zhengruifeng Sep 25, 2024
d230232
[SPARK-49745][SS] Add change to read registered timers through state …
anishshri-db Sep 25, 2024
983f6f4
[SPARK-49746][BUILD] Upgrade Scala to 2.13.15
panbingkun Sep 25, 2024
1f2e7b8
[SPARK-49731][K8S] Support K8s volume `mount.subPathExpr` and `hostPa…
EnricoMi Sep 25, 2024
09209f0
[SPARK-49775][SQL][FOLLOW-UP] Use SortedSet instead of Array with sor…
HyukjinKwon Sep 25, 2024
80d6651
[SPARK-48195][FOLLOWUP] Accumulator reset() no longer needed in Colle…
Sep 25, 2024
c0984e7
[SPARK-49609][PYTHON][TESTS][FOLLOW-UP] Avoid import connect modules …
HyukjinKwon Sep 26, 2024
5629779
[SPARK-49786][K8S] Lower `KubernetesClusterSchedulerBackend.onDisconn…
dongjoon-hyun Sep 26, 2024
913a0f7
[SPARK-49784][PYTHON][TESTS] Add more test for `spark.sql`
zhengruifeng Sep 26, 2024
fe1cf32
[SPARK-49656][SS] Add support for state variables with value state co…
anishshri-db Sep 26, 2024
a116a5b
[SPARK-49416][CONNECT][SQL] Add Shared DataStreamReader interface
hvanhovell Sep 26, 2024
7b20e58
[SPARK-49286][CONNECT][SQL] Move Avro/Protobuf functions to sql/api
hvanhovell Sep 26, 2024
218051a
[MINOR][SQL][TESTS] Use `formatString.format(value)` instead of `valu…
panbingkun Sep 26, 2024
87b5ffb
[SPARK-49797][INFRA] Align the running OS image of `maven_test.yml` t…
panbingkun Sep 26, 2024
624eda5
[SPARK-49444][SQL] Modified UnivocityParser to throw runtime exceptio…
vladanvasi-db Sep 26, 2024
54e62a1
[SPARK-49800][BUILD][K8S] Upgrade `kubernetes-client` to 6.13.4
bjornjorgensen Sep 26, 2024
339dd5b
[SPARK-49791][SQL] Make DelegatingCatalogExtension more extendable
cloud-fan Sep 26, 2024
fc9d421
[SPARK-49211][SQL][FOLLOW-UP] Support catalog in QualifiedTableName
amaliujia Sep 27, 2024
09b7aa6
[SPARK-49803][SQL][TESTS] Increase `spark.test.docker.connectionTimeo…
dongjoon-hyun Sep 27, 2024
488c3f6
[SPARK-49776][PYTHON][CONNECT] Support pie plots
xinrong-meng Sep 27, 2024
27d4a77
[SPARK-49801][PYTHON][PS][BUILD] Update `pandas` to 2.2.3
bjornjorgensen Sep 27, 2024
5d701f2
[SPARK-49804][K8S] Fix to use the exit code of executor container always
Sep 27, 2024
f18c4e7
[SPARK-49805][SQL][ML] Remove private[xxx] functions from `function.s…
zhengruifeng Sep 27, 2024
9b739d4
[SPARK-49757][SQL] Support IDENTIFIER expression in SET CATALOG state…
mikhailnik-db Sep 27, 2024
d7abddc
[SPARK-49808][SQL] Fix a deadlock in subquery execution due to lazy vals
zhengruifeng Sep 27, 2024
dd692e9
[SPARK-49801][FOLLOWUP][INFRA] Update `pandas` to 2.2.3 in `pages.yml…
bjornjorgensen Sep 27, 2024
6dc628c
[SPARK-49809][BUILD] Use `sbt.IO` in `SparkBuild.scala` to avoid nami…
LuciferYang Sep 27, 2024
b6681fb
[SPARK-49787][SQL] Cast between UDT and other types
viirya Sep 27, 2024
4d70954
[SPARK-49817][BUILD] Upgrade `gcs-connector` to `2.2.25`
dongjoon-hyun Sep 27, 2024
d813f54
[SPARK-49417][CONNECT][SQL] Add Shared StreamingQueryManager interface
hvanhovell Sep 27, 2024
0c19059
[SPARK-49820][PYTHON] Change `raise IOError` to `raise OSError`
bjornjorgensen Sep 27, 2024
f9a2077
[SPARK-49810][PYTHON] Extract the preparation of `DataFrame.sort` to …
zhengruifeng Sep 28, 2024
4c12c78
[SPARK-49814][CONNECT] When Spark Connect Client starts, show the `sp…
panbingkun Sep 28, 2024
550c207
[SPARK-49822][SQL][TESTS] Update postgres docker image to 17.0
panbingkun Sep 28, 2024
47d2c9c
[SPARK-49712][SQL] Remove encoderFor from connect-client-jvm
hvanhovell Sep 28, 2024
8dfecc1
[SPARK-49434][SPARK-49435][CONNECT][SQL] Move aggregators to sql/api
hvanhovell Sep 28, 2024
039fd13
[SPARK-49749][CORE] Change log level to debug in BlockManagerInfo
wangyum Sep 29, 2024
885c3fa
[SPARK-49823][SS] Avoid flush during shutdown in rocksdb close path
anishshri-db Sep 30, 2024
d85e7bc
[SPARK-49811][SQL] Rename StringTypeAnyCollation
jovanpavl-db Sep 30, 2024
c54c017
[SPARK-49666][SQL] Add feature flag for trim collation feature
jovanpavl-db Sep 30, 2024
97ae372
[SPARK-49819] Disable CollapseProject for correlated subqueries in pr…
n-young-db Sep 30, 2024
dbfa909
[SPARK-49816][SQL] Should only update out-going-ref-count for referen…
cloud-fan Sep 30, 2024
3065dd9
[SPARK-49561][SQL] Add SQL pipe syntax for the PIVOT and UNPIVOT oper…
dtenedor Sep 30, 2024
a7fa270
[SPARK-48196][SQL] Turn QueryExecution lazy val plans into LazyTry
Sep 30, 2024
d68048b
[SPARK-49833][K8S] Support user-defined annotations for OnDemand PVCs
prathit06 Sep 30, 2024
1233611
[SPARK-49732][CORE][K8S] Spark deamons should respect `spark.log.stru…
pan3793 Sep 30, 2024
da106f8
[SPARK-49840][INFRA] Use `MacOS 15` in `build_maven_java21_macos14.yml`
dongjoon-hyun Oct 1, 2024
8d0f6fb
[SPARK-49826][BUILD] Upgrade jackson to 2.18.0
panbingkun Oct 1, 2024
c0a1ea2
[SPARK-49795][CORE][SQL][SS][DSTREAM][ML][MLLIB][K8S][YARN][EXAMPLES]…
LuciferYang Oct 1, 2024
97e9bb3
[SPARK-48700][SQL] Mode expression for complex types (all collations)
GideonPotok Oct 1, 2024
3093ad6
[MINOR] Fix a typo in First aggregate expression
exmy Oct 1, 2024
3551a9e
[SPARK-49845][CORE] Make `appArgs` and `environmentVariables` optiona…
dongjoon-hyun Oct 2, 2024
077a319
[SPARK-49843][SQL] Fix change comment on char/varchar columns
stefankandic Oct 2, 2024
18dbaa5
[SPARK-49560][SQL] Add SQL pipe syntax for the TABLESAMPLE operator
dtenedor Oct 2, 2024
d97acc1
[SPARK-49853][SQL][TESTS] Increase test timeout of `PythonForeachWrit…
dongjoon-hyun Oct 2, 2024
ce57626
[SPARK-49837][SQL][TESTS] Add more tests for NULLIF function
ivanjevtic-db Oct 3, 2024
216f761
[SPARK-48357][SQL] Support for LOOP statement
dusantism-db Oct 3, 2024
c1ecab4
[SPARK-49541][BUILD] Upgrade log4j2 to 2.24.1
panbingkun Oct 3, 2024
036db74
[SPARK-47341][SQL] Fix inaccurate documentation of RuntimeConfig#get
xi-db Oct 3, 2024
38f067d
[SPARK-49358][SQL] Mode expression for map types with collated strings
uros-db Oct 3, 2024
68fd17d
[SPARK-49861][INFRA] Add `Python 3.13` to Infra docker image
dongjoon-hyun Oct 3, 2024
901bb33
[SPARK-49860][PYTHON][INFRA] Add `Python 3.13` Daily Python Github Ac…
dongjoon-hyun Oct 3, 2024
b9a3274
[SPARK-46647][INFRA] Add `unittest-xml-reporting` into Python 3.12 image
dongjoon-hyun Oct 3, 2024
29312bc
[SPARK-49824][SS][CONNECT] Improve logging in SparkConnectStreamingQu…
WweiL Oct 3, 2024
96666d4
[SPARK-49859][CONNECT] Replace multiprocessing.ThreadPool with Thread…
nemanja-boric-databricks Oct 4, 2024
a38505c
[SPARK-49869][INFRA] Add NumPy in Python 3.13 image
HyukjinKwon Oct 4, 2024
98da5e1
[MINOR][PYTHON][TESTS] Skip test_artifact if grpc isn't installed
HyukjinKwon Oct 4, 2024
de9b9c8
[SPARK-49870][PYTHON] Add Python 3.13 support in Spark Classic
HyukjinKwon Oct 4, 2024
bd3e2eb
[SPARK-49751][CONNECT] Fix deserialization of SparkListenerConnectSer…
pan3793 Oct 4, 2024
3dfedf6
[SPARK-49842][BUILD] Add `byte-buddy` dependency for modules that dep…
LuciferYang Oct 4, 2024
e1dbeb4
[SPARK-49871][CORE][TESTS] Fix `(Ssl)CoarseGrainedExecutorBackendSuit…
dongjoon-hyun Oct 4, 2024
58f0f3b
[SPARK-49668][SQL] Implement collation key support for trim collation
jovanpavl-db Oct 4, 2024
780759b
[SPARK-48357][SQL][FOLLOW-UP] Add doc comment for LOOP statement exec…
dusantism-db Oct 4, 2024
e59483c
[SPARK-49868][K8S] Fix race condition in executor failure tracking
attilapiros Oct 4, 2024
8a51ca7
[SPARK-47702][CORE] Remove Shuffle service endpoint from the location…
Oct 4, 2024
f694ea8
[SPARK-49867][SQL] Improve the error message when index is out of bou…
mihailoale-db Oct 4, 2024
13e5504
[SPARK-49874][SQL] Remove trim and ltrim collation specifiers
jovanpavl-db Oct 4, 2024
4cf9d14
[SPARK-49806][SQL][CONNECT] Remove redundant `blank space` after `sho…
panbingkun Oct 4, 2024
fcda935
[SPARK-49864][SQL] Improve message of BINARY_ARITHMETIC_OVERFLOW
mihailom-db Oct 4, 2024
0c653db
[SPARK-49879][CORE] Move `TransportCipherUtil` to a separate file to …
LuciferYang Oct 4, 2024
d8c04cf
[SPARK-49836][SQL][SS] Fix possibly broken query when window is provi…
HeartSaVioR Oct 4, 2024
3e69b40
[SPARK-49683][SQL] Block trim collation
jovanpavl-db Oct 5, 2024
37f2966
[SPARK-49866][SQL] Improve the error message for describe table with …
mihailoale-db Oct 5, 2024
06c70ba
[MINOR][SQL] Move iterator.hasNext into try block in executeTask
viirya Oct 6, 2024
06a8f6b
[SPARK-49744][SS][PYTHON] Implement TTL support for ListState in Tran…
bogao007 Oct 6, 2024
9ce0564
[SPARK-49773][SQL] Uncaught Java exception from `make_timestamp()` wi…
itholic Oct 7, 2024
a177ed1
Revert "[SPARK-49808][SQL] Fix a deadlock in subquery execution due o…
zhengruifeng Oct 7, 2024
be546ff
[SPARK-49890][PYTHON] Extract the preparation of df.sample to parent …
zhengruifeng Oct 7, 2024
5132ab1
[SPARK-49877][SQL] Change classifyException function signature: add i…
ivanjevtic-db Oct 7, 2024
9b0e196
initial support for hashing and comparison.
jovanpavl-db Oct 7, 2024
9f4e104
deprecate uses trim collation.
jovanpavl-db Oct 7, 2024
724e1bf
[SPARK-49896][K8S][DOCS] Drop K8s v1.28 Support
dongjoon-hyun Oct 7, 2024
d8aca18
[SPARK-49852][SQL] Fix deadlock caused by explain string generation
liuzqt Oct 8, 2024
c6b09c0
[SPARK-49894][PYTHON][CONNECT] Refine the string representation of co…
zhengruifeng Oct 8, 2024
51af177
[SPARK-49564][SQL] Add SQL pipe syntax for the JOIN operator
dtenedor Oct 8, 2024
78135dc
[SPARK-49889][PYTHON] Add argument `trim` for functions`trim/ltrim/rt…
zhengruifeng Oct 8, 2024
ef142c4
[SPARK-49898][CORE] Fix documentation and default for event log task …
JoshRosen Oct 8, 2024
2038975
add tests.
jovanpavl-db Oct 8, 2024
c513451
add more tests.
jovanpavl-db Oct 8, 2024
2f6705c
nit fix.
jovanpavl-db Oct 8, 2024
83f732c
[SPARK-49903][BUILD] Upgrade `Parquet` to 1.14.3
panbingkun Oct 8, 2024
92e79e3
[SPARK-49901][BUILD] Upgrade dropwizard metrics to 4.2.28
panbingkun Oct 8, 2024
345a2be
[SPARK-49906][SQL] Introduce and use CONFLICTING_DIRECTORY_STRUCTURES…
vladimirg-db Oct 9, 2024
80ae411
[SPARK-49569][CONNECT][SQL] Add shims to support SparkContext and RDD
hvanhovell Oct 9, 2024
5f64e80
[SPARK-49895][SQL] Improve error when encountering trailing comma in …
stefankandic Oct 9, 2024
c1f18a0
[SPARK-49022][CONNECT][SQL][FOLLOW-UP] Parse unresolved identifier to…
HyukjinKwon Oct 9, 2024
5e27eec
[SPARK-49863][SQL] Fix NormalizeFloatingNumbers to preserve nullabili…
nikhilsheoran-db Oct 9, 2024
135cbc6
[SPARK-49564][SQL] Add SQL pipe syntax for set operations
dtenedor Oct 9, 2024
52538f0
[SPARK-49909][SQL] Fix the pretty name of some expressions
panbingkun Oct 9, 2024
6cdcf5b
fix scala style.
jovanpavl-db Oct 9, 2024
b565a8d
[SPARK-49418][CONNECT][SQL] Shared Session Thread Locals
hvanhovell Oct 9, 2024
b1ff767
[SPARK-49857][SQL] Add storageLevel to Dataset localCheckpoint API
Oct 9, 2024
97a5aa6
[SPARK-49873][SQL] Assign proper error class for _LEGACY_ERROR_TEMP_1325
itholic Oct 9, 2024
fef3a71
fix bug.
jovanpavl-db Oct 9, 2024
fed9a8d
[SPARK-49569][BUILD][FOLLOWUP] Adds `scala-library` maven dependency …
LuciferYang Oct 9, 2024
f69d03e
[SPARK-43838][SQL][FOLLOWUP] Replace `HashSet` with `HashMap` to impr…
mihailotim-db Oct 9, 2024
6ed4bdf
[MINOR][PYTHON][TESTS] Reduce the python worker error log of `test_to…
zhengruifeng Oct 9, 2024
7e82e29
[SPARK-49905][SQL][SS] Use different ShuffleOrigin for the shuffle re…
HeartSaVioR Oct 10, 2024
ea60e93
[SPARK-49918][CORE] Use read-only access to conf in `SparkContext` wh…
pmenon Oct 10, 2024
0912217
[SPARK-48714][PYTHON][FOLLOW-UP] Skip tests if test class is not avai…
HyukjinKwon Oct 10, 2024
d7772f2
[SPARK-49782][SQL] ResolveDataFrameDropColumns rule resolves Unresolv…
LantaoJin Oct 10, 2024
c9c33d9
fix style.
jovanpavl-db Oct 10, 2024
f0498f0
[SPARK-49549][SQL] Assign a name to the error conditions _LEGACY_ERRO…
mrk-andreev Oct 10, 2024
f003638
[SPARK-49542][SQL] Partition transform exception evaluate error
dusantism-db Oct 10, 2024
38d66fd
[SPARK-49908][SQL] Assign proper error condition for _LEGACY_ERROR_TE…
itholic Oct 10, 2024
e693af0
[SPARK-49748][CORE] Add `getCondition` and deprecate `getErrorClass` …
MaxGekk Oct 10, 2024
ab1315b
[SPARK-49756][SQL] Postgres dialect supports pushdown datetime functions
beliefer Oct 10, 2024
e589ccd
[SPARK-49920][INFRA] Install `R` for `ubuntu 24.04` when GA run `k8s-…
panbingkun Oct 10, 2024
b056e0b
[SPARK-49569][BUILD][FOLLOWUP] Exclude `spark-connect-shims` from `sq…
LuciferYang Oct 10, 2024
2af6536
[SPARK-49927][SS] pyspark.sql.tests.streaming.test_streaming_listener…
siying Oct 11, 2024
4666972
[SPARK-48567][PYTHON][TESTS][FOLLOW-UP] Make the query scope higher s…
HyukjinKwon Oct 11, 2024
cacd261
[SPARK-49927][SS][PYTHON][TESTS][FOLLOW-UP] Fixes `q.lastProgress.bat…
HyukjinKwon Oct 11, 2024
c0d396f
Revert "[SPARK-49927][SS][PYTHON][TESTS][FOLLOW-UP] Fixes `q.lastProg…
HyukjinKwon Oct 11, 2024
b93281e
Revert "[SPARK-48567][PYTHON][TESTS][FOLLOW-UP] Make the query scope …
HyukjinKwon Oct 11, 2024
ed0a63f
Revert "[SPARK-49927][SS] pyspark.sql.tests.streaming.test_streaming_…
HyukjinKwon Oct 11, 2024
5104d1d
[SPARK-49915][SQL] Handle zeros and ones in ReorderAssociativeOperator
yaooqinn Oct 11, 2024
8e1d317
[SPARK-49615] Bugfix: Make ML column schema validation conforms with …
WeichenXu123 Oct 11, 2024
6d0b838
[SPARK-49748][CORE][FOLLOWUP] Add `getCondition` and deprecate `getEr…
panbingkun Oct 11, 2024
c79e2d6
[SPARK-49925][SQL] Add tests for order by with collated strings
ilicmarkodb Oct 11, 2024
04ec55e
[MINOR][PYTHON] Minor refine `LiteralExpression`
zhengruifeng Oct 11, 2024
3ecfe8e
[SPARK-49930][SS] Ensure that socket updates are flushed on exception…
anishshri-db Oct 12, 2024
cf657e5
[SPARK-49927][SS][PYTHON][TESTS] pyspark.sql.tests.streaming.test_str…
siying Oct 12, 2024
1fb3d57
[SPARK-49935][BUILD] Exclude `spark-connect-shims` from `assembly` mo…
LuciferYang Oct 12, 2024
6734d48
[SPARK-49932][CORE] Use `tryWithResource` release `JsonUtils#toJsonSt…
panbingkun Oct 12, 2024
1244c5a
[SPARK-49766][SQL] Codegen Support for `json_array_length` (by `Invok…
panbingkun Oct 12, 2024
ed4847f
[SPARK-49937][INFRA] Ban call the method `SparkThrowable#getErrorClass`
panbingkun Oct 12, 2024
62ade5f
[SPARK-49924][SQL] Keep `containsNull` after `ArrayCompact` replacement
zhengruifeng Oct 12, 2024
083f44d
[MINOR][SQL] Improved broadcast timeout message prompt
huangxiaopingRD Oct 13, 2024
54fd408
[SPARK-49939][SQL] Codegen Support for json_object_keys (by Invoke & …
panbingkun Oct 13, 2024
1abfd49
[SPARK-49943][PS] Remove `timestamp_ntz_to_long` from `PythonSQLUtils`
zhengruifeng Oct 13, 2024
1aae160
[SPARK-49928][PYTHON][TESTS] Refactor plot-related unit tests
xinrong-meng Oct 14, 2024
36b2a4e
[SPARK-49891][SQL] Assign proper error condition for _LEGACY_ERROR_TE…
itholic Oct 14, 2024
5b9b8da
[SPARK-49904][SQL] Assign proper error condition for _LEGACY_ERROR_TE…
itholic Oct 14, 2024
a2ad4d4
[MINOR][CORE] Fix the regenerate command in `SparkThrowableSuite`
MaxGekk Oct 14, 2024
560748c
[SPARK-49892][SQL] Assign proper error class for _LEGACY_ERROR_TEMP_1136
itholic Oct 14, 2024
eeb044e
[SPARK-49949][PS] Avoid unnecessary analyze task in `attach_sequence_…
zhengruifeng Oct 14, 2024
d77b293
add more tests.
jovanpavl-db Oct 14, 2024
7e78177
nit fixes.
jovanpavl-db Oct 14, 2024
af27d43
nit fixes.
jovanpavl-db Oct 14, 2024
d53fa23
[SPARK-49952][SQL] Assign proper error condition for _LEGACY_ERROR_TE…
itholic Oct 14, 2024
0606512
[SPARK-49864][SQL][FOLLOW-UP] Fix default suggestion for binary arith…
mihailom-db Oct 14, 2024
96c4953
[SPARK-49955][SQL] null value does not mean corrupted file when parsi…
cloud-fan Oct 14, 2024
1d73ad6
init commit.
jovanpavl-db Oct 14, 2024
74aed77
[SPARK-49661][SQL] Implement trim collation hashing and comparison
jovanpavl-db Oct 14, 2024
488f680
[SPARK-49929][PYTHON][CONNECT] Support box plots
xinrong-meng Oct 15, 2024
217e0da
[SPARK-49965][BUILD] Upgrade ASM to 9.7.1
panbingkun Oct 15, 2024
c3176a7
[SPARK-49451][SQL][FOLLOW-UP] Improve duplicate key exception test
harshmotw-db Oct 15, 2024
6078e51
Merge branch 'master' of https://github.com/apache/spark into fix-sem…
jovanpavl-db Oct 15, 2024
5715d5c
nit fixes.
jovanpavl-db Oct 15, 2024
8ead603
revert unnecessary changes.
jovanpavl-db Oct 15, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 2 additions & 0 deletions .asf.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -31,6 +31,8 @@ github:
merge: false
squash: true
rebase: true
ghp_branch: master
ghp_path: /docs

notifications:
pullrequests: reviews@spark.apache.org
Expand Down
4 changes: 4 additions & 0 deletions .github/workflows/build_and_test.yml
Original file line number Diff line number Diff line change
Expand Up @@ -1112,6 +1112,10 @@ jobs:
with:
distribution: zulu
java-version: ${{ inputs.java }}
- name: Install R
run: |
sudo apt update
sudo apt-get install r-base
- name: Start Minikube
uses: medyagh/setup-minikube@v0.0.18
with:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@
# under the License.
#

name: "Build / Maven (master, Scala 2.13, Hadoop 3, JDK 21, macos-14)"
name: "Build / Maven (master, Scala 2.13, Hadoop 3, JDK 21, MacOS-15)"

on:
schedule:
Expand All @@ -32,7 +32,7 @@ jobs:
if: github.repository == 'apache/spark'
with:
java: 21
os: macos-14
os: macos-15
envs: >-
{
"OBJC_DISABLE_INITIALIZE_FORK_SAFETY": "YES"
Expand Down
45 changes: 45 additions & 0 deletions .github/workflows/build_python_3.13.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,45 @@
#
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.
#

name: "Build / Python-only (master, Python 3.13)"

on:
schedule:
- cron: '0 20 * * *'

jobs:
run-build:
permissions:
packages: write
name: Run
uses: ./.github/workflows/build_and_test.yml
if: github.repository == 'apache/spark'
with:
java: 17
branch: master
hadoop: hadoop3
envs: >-
{
"PYTHON_TO_TEST": "python3.13"
}
jobs: >-
{
"pyspark": "true",
"pyspark-pandas": "true"
}
2 changes: 1 addition & 1 deletion .github/workflows/build_python_connect.yml
Original file line number Diff line number Diff line change
Expand Up @@ -71,7 +71,7 @@ jobs:
python packaging/connect/setup.py sdist
cd dist
pip install pyspark*connect-*.tar.gz
pip install 'six==1.16.0' 'pandas<=2.2.2' scipy 'plotly>=4.8' 'mlflow>=2.8.1' coverage matplotlib openpyxl 'memory-profiler>=0.61.0' 'scikit-learn>=1.3.2' 'graphviz==0.20.3' torch torchvision torcheval deepspeed unittest-xml-reporting
pip install 'six==1.16.0' 'pandas<=2.2.2' scipy 'plotly>=4.8' 'mlflow>=2.8.1' coverage matplotlib openpyxl 'memory-profiler>=0.61.0' 'scikit-learn>=1.3.2' 'graphviz==0.20.3' torch torchvision torcheval deepspeed unittest-xml-reporting 'plotly>=4.8'
- name: Run tests
env:
SPARK_TESTING: 1
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/maven_test.yml
Original file line number Diff line number Diff line change
Expand Up @@ -40,7 +40,7 @@ on:
description: OS to run this build.
required: false
type: string
default: ubuntu-22.04
default: ubuntu-latest
envs:
description: Additional environment variables to set when running the tests. Should be in JSON format.
required: false
Expand Down
11 changes: 9 additions & 2 deletions .github/workflows/pages.yml
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@ on:

concurrency:
group: 'docs preview'
cancel-in-progress: true
cancel-in-progress: false

jobs:
docs:
Expand All @@ -35,6 +35,8 @@ jobs:
permissions:
id-token: write
pages: write
environment:
name: github-pages # https://github.com/actions/deploy-pages/issues/271
env:
SPARK_TESTING: 1 # Reduce some noise in the logs
RELEASE_VERSION: 'In-Progress'
Expand All @@ -56,7 +58,12 @@ jobs:
architecture: x64
cache: 'pip'
- name: Install Python dependencies
run: pip install --upgrade -r dev/requirements.txt
run: |
pip install 'sphinx==4.5.0' mkdocs 'pydata_sphinx_theme>=0.13' sphinx-copybutton nbsphinx numpydoc jinja2 markupsafe 'pyzmq<24.0.0' \
ipython ipython_genutils sphinx_plotly_directive 'numpy>=1.20.0' pyarrow 'pandas==2.2.3' 'plotly>=4.8' 'docutils<0.18.0' \
'flake8==3.9.0' 'mypy==1.8.0' 'pytest==7.1.3' 'pytest-mypy-plugins==1.9.3' 'black==23.9.1' \
'pandas-stubs==1.2.0.53' 'grpcio==1.62.0' 'grpcio-status==1.62.0' 'grpc-stubs==1.24.11' 'googleapis-common-protos-stubs==2.2.0' \
'sphinxcontrib-applehelp==1.0.4' 'sphinxcontrib-devhelp==1.0.2' 'sphinxcontrib-htmlhelp==2.0.1' 'sphinxcontrib-qthelp==1.0.3' 'sphinxcontrib-serializinghtml==1.1.5'
- name: Install Ruby for documentation generation
uses: ruby/setup-ruby@v1
with:
Expand Down
Empty file added .nojekyll
Empty file.
6 changes: 6 additions & 0 deletions assembly/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -117,6 +117,12 @@
<groupId>org.apache.spark</groupId>
<artifactId>spark-connect-client-jvm_${scala.binary.version}</artifactId>
<version>${project.version}</version>
<exclusions>
<exclusion>
<groupId>org.apache.spark</groupId>
<artifactId>spark-connect-shims_${scala.binary.version}</artifactId>
</exclusion>
</exclusions>
<scope>provided</scope>
</dependency>

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -255,7 +255,8 @@ public Iterator<T> iterator() {
iteratorTracker.add(new WeakReference<>(it));
return it;
} catch (Exception e) {
throw Throwables.propagate(e);
Throwables.throwIfUnchecked(e);
throw new RuntimeException(e);
}
}
};
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -127,7 +127,7 @@ public boolean hasNext() {
try {
close();
} catch (IOException ioe) {
throw Throwables.propagate(ioe);
throw new RuntimeException(ioe);
}
}
return next != null;
Expand All @@ -151,7 +151,8 @@ public T next() {
next = null;
return ret;
} catch (Exception e) {
throw Throwables.propagate(e);
Throwables.throwIfUnchecked(e);
throw new RuntimeException(e);
}
}

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -287,7 +287,8 @@ public Iterator<T> iterator() {
iteratorTracker.add(new WeakReference<>(it));
return it;
} catch (Exception e) {
throw Throwables.propagate(e);
Throwables.throwIfUnchecked(e);
throw new RuntimeException(e);
}
}
};
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -113,7 +113,7 @@ public boolean hasNext() {
try {
close();
} catch (IOException ioe) {
throw Throwables.propagate(ioe);
throw new RuntimeException(ioe);
}
}
return next != null;
Expand All @@ -137,7 +137,8 @@ public T next() {
next = null;
return ret;
} catch (Exception e) {
throw Throwables.propagate(e);
Throwables.throwIfUnchecked(e);
throw new RuntimeException(e);
}
}

Expand Down
10 changes: 10 additions & 0 deletions common/network-common/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -194,6 +194,16 @@
<artifactId>mockito-core</artifactId>
<scope>test</scope>
</dependency>
<dependency>
<groupId>net.bytebuddy</groupId>
<artifactId>byte-buddy</artifactId>
<scope>test</scope>
</dependency>
<dependency>
<groupId>net.bytebuddy</groupId>
<artifactId>byte-buddy-agent</artifactId>
<scope>test</scope>
</dependency>
</dependencies>

<build>
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -290,9 +290,11 @@ public void onFailure(Throwable e) {
try {
return result.get(timeoutMs, TimeUnit.MILLISECONDS);
} catch (ExecutionException e) {
throw Throwables.propagate(e.getCause());
Throwables.throwIfUnchecked(e.getCause());
throw new RuntimeException(e.getCause());
} catch (Exception e) {
throw Throwables.propagate(e);
Throwables.throwIfUnchecked(e);
throw new RuntimeException(e);
}
}

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -342,7 +342,8 @@ public void operationComplete(final Future<Channel> handshakeFuture) {
logger.error("Exception while bootstrapping client after {} ms", e,
MDC.of(LogKeys.BOOTSTRAP_TIME$.MODULE$, bootstrapTimeMs));
client.close();
throw Throwables.propagate(e);
Throwables.throwIfUnchecked(e);
throw new RuntimeException(e);
}
long postBootstrap = System.nanoTime();

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,6 @@
import java.security.GeneralSecurityException;
import java.util.concurrent.TimeoutException;

import com.google.common.base.Throwables;
import io.netty.buffer.ByteBuf;
import io.netty.buffer.Unpooled;
import io.netty.channel.Channel;
Expand Down Expand Up @@ -80,7 +79,7 @@ public void doBootstrap(TransportClient client, Channel channel) {
doSparkAuth(client, channel);
client.setClientId(appId);
} catch (GeneralSecurityException | IOException e) {
throw Throwables.propagate(e);
throw new RuntimeException(e);
} catch (RuntimeException e) {
// There isn't a good exception that can be caught here to know whether it's really
// OK to switch back to SASL (because the server doesn't speak the new protocol). So
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -132,7 +132,8 @@ protected boolean doAuthChallenge(
try {
engine.close();
} catch (Exception e) {
throw Throwables.propagate(e);
Throwables.throwIfUnchecked(e);
throw new RuntimeException(e);
}
}
}
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -17,32 +17,12 @@

package org.apache.spark.network.crypto;

import com.google.common.annotations.VisibleForTesting;
import com.google.crypto.tink.subtle.Hex;
import com.google.crypto.tink.subtle.Hkdf;
import io.netty.channel.Channel;

import javax.crypto.spec.SecretKeySpec;
import java.io.IOException;
import java.nio.charset.StandardCharsets;
import java.security.GeneralSecurityException;

interface TransportCipher {
String getKeyId() throws GeneralSecurityException;
void addToChannel(Channel channel) throws IOException, GeneralSecurityException;
}

class TransportCipherUtil {
/*
* This method is used for testing to verify key derivation.
*/
@VisibleForTesting
static String getKeyId(SecretKeySpec key) throws GeneralSecurityException {
byte[] keyIdBytes = Hkdf.computeHkdf("HmacSha256",
key.getEncoded(),
null,
"keyID".getBytes(StandardCharsets.UTF_8),
32);
return Hex.encode(keyIdBytes);
}
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,41 @@
/*
* Licensed to the Apache Software Foundation (ASF) under one or more
* contributor license agreements. See the NOTICE file distributed with
* this work for additional information regarding copyright ownership.
* The ASF licenses this file to You under the Apache License, Version 2.0
* (the "License"); you may not use this file except in compliance with
* the License. You may obtain a copy of the License at
*
* http://www.apache.org/licenses/LICENSE-2.0
*
* Unless required by applicable law or agreed to in writing, software
* distributed under the License is distributed on an "AS IS" BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*/

package org.apache.spark.network.crypto;

import java.nio.charset.StandardCharsets;
import java.security.GeneralSecurityException;
import javax.crypto.spec.SecretKeySpec;

import com.google.common.annotations.VisibleForTesting;
import com.google.crypto.tink.subtle.Hex;
import com.google.crypto.tink.subtle.Hkdf;

class TransportCipherUtil {
/**
* This method is used for testing to verify key derivation.
*/
@VisibleForTesting
static String getKeyId(SecretKeySpec key) throws GeneralSecurityException {
byte[] keyIdBytes = Hkdf.computeHkdf("HmacSha256",
key.getEncoded(),
null,
"keyID".getBytes(StandardCharsets.UTF_8),
32);
return Hex.encode(keyIdBytes);
}
}
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,6 @@
import javax.security.sasl.SaslClient;
import javax.security.sasl.SaslException;

import com.google.common.base.Throwables;
import com.google.common.collect.ImmutableMap;

import org.apache.spark.internal.SparkLogger;
Expand Down Expand Up @@ -62,7 +61,7 @@ public SparkSaslClient(String secretKeyId, SecretKeyHolder secretKeyHolder, bool
this.saslClient = Sasl.createSaslClient(new String[] { DIGEST }, null, null, DEFAULT_REALM,
saslProps, new ClientCallbackHandler());
} catch (SaslException e) {
throw Throwables.propagate(e);
throw new RuntimeException(e);
}
}

Expand All @@ -72,7 +71,7 @@ public synchronized byte[] firstToken() {
try {
return saslClient.evaluateChallenge(new byte[0]);
} catch (SaslException e) {
throw Throwables.propagate(e);
throw new RuntimeException(e);
}
} else {
return new byte[0];
Expand All @@ -98,7 +97,7 @@ public synchronized byte[] response(byte[] token) {
try {
return saslClient != null ? saslClient.evaluateChallenge(token) : new byte[0];
} catch (SaslException e) {
throw Throwables.propagate(e);
throw new RuntimeException(e);
}
}

Expand Down
Loading
Loading