[SPARK-41197][BUILD] Upgrade Kafka to 3.3.1#38715
[SPARK-41197][BUILD] Upgrade Kafka to 3.3.1#38715tedyu wants to merge 3 commits intoapache:masterfrom
Conversation
|
@HeartSaVioR |
|
Hi, 3.3.0 "A significant bug was found in the 3.3.0 release after artifacts were pushed to Apache and Maven central but prior to the release announcement. As a result, the decision was made to not announce 3.3.0 and instead release 3.3.1 with the fix. It is recommended that 3.3.0 not be used." Can you try with 3.3.1 |
|
@bjornjorgensen |
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Hi, @tedyu . Could you fix UT failures?
[info] *** 90 TESTS FAILED ***
[error] Failed tests:
[error] org.apache.spark.sql.kafka010.KafkaMicroBatchV2SourceSuite
[error] org.apache.spark.sql.kafka010.KafkaMicroBatchV1SourceWithAdminSuite
[error] org.apache.spark.sql.kafka010.KafkaMicroBatchV1SourceSuite
[error] org.apache.spark.sql.kafka010.KafkaContinuousSourceSuite
[error] org.apache.spark.sql.kafka010.KafkaMicroBatchV2SourceWithAdminSuite
|
I noticed the test failures. |
|
Can it have something to do with apache/kafka#12794 ? |
|
These failures comes from apache/kafka#12049 and is described here: https://kafka.apache.org/documentation/#upgrade_33_notable |
|
cc @dongjoon-hyun and @HeartSaVioR FYI |
|
Thanks @dengziming for the information. |
|
@HyukjinKwon @dongjoon-hyun @HeartSaVioR |
dongjoon-hyun
left a comment
There was a problem hiding this comment.
+1, LGTM. Thank you, @tedyu , @bjornjorgensen , @dengziming, @HyukjinKwon .
Merged to master for Apache Spark 3.4.
This PR upgrades Kafka to 3.3.1 release. The new default partitioner keeps track of how many bytes are produced per-partition and once the amount exceeds `batch.size`, it switches to the next partition. For spark kafka tests, this will result in records being sent to only one partition in some tests. `KafkaTestUtils.producerConfiguration` is modified to use `DefaultPartitioner`. Kafka 3.3.1 release has new features along with bug fixes: https://www.confluent.io/blog/apache-kafka-3-3-0-new-features-and-updates/ No Existing test suite Closes #38715 from tedyu/k-33. Authored-by: Ted Yu <yuzhihong@gmail.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
|
This is merged via 0ff201c . |
### What changes were proposed in this pull request? This PR upgrades Kafka to 3.3.1 release. The new default partitioner keeps track of how many bytes are produced per-partition and once the amount exceeds `batch.size`, it switches to the next partition. For spark kafka tests, this will result in records being sent to only one partition in some tests. `KafkaTestUtils.producerConfiguration` is modified to use `DefaultPartitioner`. ### Why are the changes needed? Kafka 3.3.1 release has new features along with bug fixes: https://www.confluent.io/blog/apache-kafka-3-3-0-new-features-and-updates/ ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Existing test suite Closes apache#38715 from tedyu/k-33. Lead-authored-by: Dongjoon Hyun <dongjoon@apache.org> Co-authored-by: Ted Yu <yuzhihong@gmail.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
This PR upgrades Kafka to 3.3.1 release. The new default partitioner keeps track of how many bytes are produced per-partition and once the amount exceeds `batch.size`, it switches to the next partition. For spark kafka tests, this will result in records being sent to only one partition in some tests. `KafkaTestUtils.producerConfiguration` is modified to use `DefaultPartitioner`. Kafka 3.3.1 release has new features along with bug fixes: https://www.confluent.io/blog/apache-kafka-3-3-0-new-features-and-updates/ No Existing test suite Closes apache#38715 from tedyu/k-33. Authored-by: Ted Yu <yuzhihong@gmail.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
### What changes were proposed in this pull request? This PR upgrades Kafka to 3.3.1 release. The new default partitioner keeps track of how many bytes are produced per-partition and once the amount exceeds `batch.size`, it switches to the next partition. For spark kafka tests, this will result in records being sent to only one partition in some tests. `KafkaTestUtils.producerConfiguration` is modified to use `DefaultPartitioner`. ### Why are the changes needed? Kafka 3.3.1 release has new features along with bug fixes: https://www.confluent.io/blog/apache-kafka-3-3-0-new-features-and-updates/ ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Existing test suite Closes apache#38715 from tedyu/k-33. Lead-authored-by: Dongjoon Hyun <dongjoon@apache.org> Co-authored-by: Ted Yu <yuzhihong@gmail.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
This PR upgrades Kafka to 3.3.1 release. The new default partitioner keeps track of how many bytes are produced per-partition and once the amount exceeds `batch.size`, it switches to the next partition. For spark kafka tests, this will result in records being sent to only one partition in some tests. `KafkaTestUtils.producerConfiguration` is modified to use `DefaultPartitioner`. Kafka 3.3.1 release has new features along with bug fixes: https://www.confluent.io/blog/apache-kafka-3-3-0-new-features-and-updates/ No Existing test suite Closes apache#38715 from tedyu/k-33. Authored-by: Ted Yu <yuzhihong@gmail.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
What changes were proposed in this pull request?
This PR upgrades Kafka to 3.3.1 release.
The new default partitioner keeps track of how many bytes are produced per-partition and once the amount exceeds
batch.size, it switches to the next partition. For spark kafka tests, this will result in records being sent to only one partition in some tests.KafkaTestUtils.producerConfigurationis modified to useDefaultPartitioner.Why are the changes needed?
Kafka 3.3.1 release has new features along with bug fixes: https://www.confluent.io/blog/apache-kafka-3-3-0-new-features-and-updates/
Does this PR introduce any user-facing change?
No
How was this patch tested?
Existing test suite