Kafka-15126: Range queries to accept null lower and upper bounds by Cerchie · Pull Request #13987 · apache/kafka

Cerchie · 2023-07-10T18:48:43Z

Change in response to KIP-941.

Changes line 57 in the RangeQuery class file from:

public static <K, V> RangeQuery<K, V> withRange(final K lower, final K upper) {
    return new RangeQuery<>(Optional.of(lower), Optional.of(upper));
}

to

public static <K, V> RangeQuery<K, V> withRange(final K lower, final K upper) {
     return new RangeQuery<>(Optional.ofNullable(lower), Optional.ofNullable(upper));
 }

Testing strategy:

Since null values can now be entered in RangeQuerys in order to receive full scans, I changed the logic defining query starting at line 1085 in IQv2StoreIntegrationTest.java from:

        final RangeQuery<Integer, V> query;
        if (lower.isPresent() && upper.isPresent()) {
            query = RangeQuery.withRange(lower.get(), upper.get());
        } else if (lower.isPresent()) {
            query = RangeQuery.withLowerBound(lower.get());
        } else if (upper.isPresent()) {
            query = RangeQuery.withUpperBound(upper.get());
        } else {
            query = RangeQuery.withNoBounds();
        }

to

query = RangeQuery.withRange(lower.orElse(null), upper.orElse(null));

because different combinations of isPresent() in the bounds is no longer necessary.

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

removing errant comment

bbejeck

Thanks for the PR @Cerchie - overall, this looks good to me.
I think it would be good to update the Javadoc on the withRange(lower, upper) method to briefly describe the behavior if one or both of the parameters are null

bbejeck

LGTM

bbejeck · 2023-07-25T19:42:40Z

@Cerchie since it's been a couple of weeks for this PR can you rebase this with trunk?
Pending the related tests passing we should be able to merge

ping @vvcephei for a quick second look

Reviewers: Divij Vaidya <diviv@amazon.com> --------- Co-authored-by: Deqi Hu <deqi.hu@shopee.com>

…pache#13661) Various formatting fixes in the config docs. Reviewers: Bill Bejeck <bbejeck@apache.org>

…che#13284)

…pache#13383) Reviewers: Chris Egerton <chrise@aiven.io>

…lient retriable exceptions in AbstractWorkerSourceTask (apache#13955) Reviewers: Sagar Rao <sagarmeansocean@gmail.com>, Chris Egerton <chrise@aiven.io>

…tor (apache#13946) Reviewed-by: Greg Harris <greg.harris@aiven.io>

Reviewers: Divij Vaidya <diviv@amazon.com> --------- Co-authored-by: Damon Xie <damon.xie@zoom.us>

…nnelManager (apache#13988) Reviewers: David Arthur <mumrah@gmail.com>

…iner is empty (apache#13974) Reviewers: Divij Vaidya <diviv@amazon.com>

… status topic (apache#13453) During fast consecutive rebalances where a task is revoked from one worker and assigned to another one, it has been observed that there is a small time window and thus a race condition during which a RUNNING status record in the new generation is produced and is immediately followed by a delayed UNASSIGNED status record belonging to the same or a previous generation before the worker that sends this message reads the RUNNING status record that corresponds to the latest generation. Although this doesn't inhibit the actual execution of tasks, it reports an incorrect status for those tasks(i.e UNASSIGNED). If the users have setup some kind of monitoring on tasks status then this could lead to false alarms for example. This fix addresses this problem by checking if a status message is stale after reading it and updates it's status only when it is safe to. Reviewers: Lucent-Wong <manchesterfans@live.cn>, Chris Egerton <chrise@aiven.io>, Yash Mayya <yash.mayya@gmail.com>, Konstantine Karantasis <k.karantasis@gmail.com>

…he#13275) KAFKA-14522 Rewrite and Move of RemoteIndexCache to storage module. Cleanedup index file suffix usages and other minor cleanups Reviewers: Jun Rao <junrao@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Luke Chen <showuon@gmail.com>, Divij Vaidya <diviv@amazon.com>, Kamal Chandraprakash<kamal.chandraprakash@gmail.com>, Jorge Esteban Quilcate Otoya <quilcate.jorge@gmail.com>

Reviewers: Matthias J. Sax <matthias@confluent.io>

Reviewers: Luke Chen <showuon@gmail.com>

…stem tests (apache#13859) Reviewers: Luke Chen <showuon@gmail.com>, Christo Lolov <christololov@gmail.com>

…d insights (apache#13676) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Divij Vaidya <diviv@amazon.com>

…#13963) This patch adds the session timeout and the revocation timeout to the new consumer group protocol. Reviewers: Calvin Liu <caliu@confluent.io>, Jeff Kim <jeff.kim@confluent.io>, Justine Olshan <jolshan@confluent.io>

…le in MBean server (apache#13995) In JmxTool.scala, we will wait till all the object names are available from MBean server. But in the newer version, we only wait for subset of object names. Due to this, we may not enforce wait option and prematurely return the result if the objects are not yet registered in MBean sever. Reviewers: Luke Chen <showuon@gmail.com>, Federico Valeri <fvaleri@redhat.com>

…ning as unit tests (apache#13973) Reviewers: Divij Vaidya <diviv@amazon.com>

Reviewers: Sagar Rao <sagarmeansocean@gmail.com>, Yash Mayya <yash.mayya@gmail.com>, Sudesh Wasnik <swasnik@confluent.io>, Chris Egerton <chrise@aiven.io>

…e#13992) Reviewed-by: Greg Harris <greg.harris@aiven.io>

Catch any exceptions that escape the processing logic inside TaskExecutors and record them in the TaskManager. Make sure the TaskExecutor survives, but the task is unassigned. Add a method to TaskManager to drain the exceptions. The aim here is that the polling thread will drain the exceptions to be able to execute the uncaught exception handler, abort transactions, etc. Reviewer: Bruno Cadonna <cadonna@apache.org>

…che#13703) Standardize controller log4j output for replaying important records. The log message should include word "replayed" to make it clear that this is a record replay. Log the replay of records for ACLs, client quotas, and producer IDs, which were previously not logged. Also fix a case where we weren't logging changes to broker registrations. AclControlManager, ClientQuotaControlManager, and ProducerIdControlManager didn't previously have a log4j logger object, so this PR adds one. It also converts them to using Builder objects. This makes junit tests more readable because we don't need to specify paramaters where the test can use the default (like LogContexts). Throw an exception in replay if we get another TopicRecord for a topic which already exists. Example log messages: INFO [QuorumController id=3000] Replayed a FeatureLevelRecord setting metadata version to 3.6-IV0 DEBUG [QuorumController id=3000] Replayed a ZkMigrationStateRecord which did not alter the state from NONE. INFO [QuorumController id=3000] Replayed BrokerRegistrationChangeRecord modifying the registration for broker 0: BrokerRegistrationChangeRecord(brokerId=0, brokerEpoch=3, fenced=-1, inControlledShutdown=0) INFO [QuorumController id=3000] Replayed ClientQuotaRecord for ClientQuotaEntity(entries={user=testkit}) setting request_percentage to 0.99. Reviewers: Divij Vaidya <diviv@amazon.com>, Ron Dagostino <rndgstn@gmail.com>, David Arthur <mumrah@gmail.com>

…ers (apache#14044) Reviewers: Chris Egerton <chrise@aiven.io>

apache#14041) Reviewers: Mickael Maison <mickael.maison@gmail.com>

…che#13874) Reviewers: Divij Vaidya <diviv@amazon.com>

…ools dependency from connect-runtime (apache#13313) Reviewers: Ismael Juma <ismael@juma.me.uk>

Part of KIP-925. Reviewers: Matthias J. Sax <matthias@confluent.io>

Added tests for metrics: 1. RemoteLogReaderTaskQueueSize 2. RemoteLogReaderAvgIdlePercent 3. RemoteLogManagerTasksAvgIdlePercent Also, added tests for OffsetOutOfRangeException will be thrown while reading logs Reviewers: Christo Lolov <christololov@gmail.com>, Satish Duggana <satishd@apache.org>

Fixing the build failure caused by the earlier commit apache@27ea025 ``` [Error] /Users/satishd/repos/apache-kafka/core/src/test/scala/unit/kafka/server/ReplicaManagerTest.scala:3526:77: the result type of an implicit conversion must be more specific than Object [Error] /Users/satishd/repos/apache-kafka/core/src/test/scala/unit/kafka/server/ReplicaManagerTest.scala:3530:70: the result type of an implicit conversion must be more specific than Object [Warn] /Users/satishd/repos/apache-kafka/core/src/test/scala/unit/kafka/server/ServerGenerateBrokerIdTest.scala:23:21: imported `QuorumTestHarness` is permanently hidden by definition of object QuorumTestHarness in package server [Warn] /Users/satishd/repos/apache-kafka/core/src/test/scala/unit/kafka/server/ServerGenerateClusterIdTest.scala:29:21: imported `QuorumTestHarness` is permanently hidden by definition of object QuorumTestHarness in package server [Error] /Users/satishd/repos/apache-kafka/core/src/test/scala/unit/kafka/utils/TestUtils.scala:1438:15: ambiguous reference to overloaded definition, both method doReturn in class Mockito of type (x$1: Any, x$2: Object*)org.mockito.stubbing.Stubber and method doReturn in class Mockito of type (x$1: Any)org.mockito.stubbing.Stubber match argument types (kafka.log.UnifiedLog) ``` Reviewers: Luke Chen <showuon@gmail.com>

Reviewers: Divij Vaidya <diviv@amazon.com>

Adding the value 20 to the JDK version that can build Apache Kafka into README.md Reviewers: Divij Vaidya <diviv@amazon.com>

Reviewers: Mickael Maison <mickael.maison@gmail.com>, Federico Valeri <fedevaleri@gmail.com>

…ion and offset in sink records (apache#14024) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

…taleMemberEpochException error (apache#14046) This patch does a few things: 1) It introduces version 9 of the OffsetCommit API. This new version has no schema changes but it can return a StaleMemberEpochException if the new consumer group protocol is used. Note the use of `"latestVersionUnstable": true` in the request schema. This means that this new version is not available yet unless activated. 2) It renames the `generationId` field in the request to `GenerationIdOrMemberEpoch`. This is backward compatible change. 3) It introduces the new StaleMemberEpochException error. 4) It does a minor refactoring in OffsetCommitRequest class. Reviewers: Jeff Kim <jeff.kim@confluent.io>, David Arthur <mumrah@gmail.com>, Justine Olshan <jolshan@confluent.io>

This patch does a few things: 1) It introduces the `OffsetAndMetadata` class which hold the committed offsets in the group coordinator. 2) It adds methods to deal with OffsetCommit records to `RecordHelpers`. 3) It adds `MetadataVersion#offsetCommitValueVersion` to get the version of the OffsetCommit value record that should be used. Reviewers: Jeff Kim <jeff.kim@confluent.io>, David Arthur <mumrah@gmail.com>, Justine Olshan <jolshan@confluent.io>

Reviewers: Mickael Maison <mickael.maison@gmail.com>

We will explicitly send an assignment change event to the background thread to invoke auto-commit if the group.id is configured. After updating the subscription state, a NewTopicsMetadataUpdateRequestEvent will also be sent to the background thread to update the metadata. Co-authored-by: Kirk True <kirk@kirktrue.pro> Reviewers: Jun Rao <junrao@gmail.com>

… consumer-manager/task (apache#14045) Improved logging and docs on consumer manager/task call paths. Reviewers: Luke Chen <showuon@gmail.com>, Satish Duggana <satishd@apache.org>

…ge (apache#14057) Reviewers: Divij Vaidya <diviv@amazon.com>

…pache#13773) Fix the confusing error message in ImageWriterOptions Reviewers: Luke Chen <showuon@gmail.com>, David Arthur <mumrah@gmail.com>

…ata cache (apache#14004) KAFKA-15168: Handle overlapping remote log segments in RemoteLogMetadata cache Reviewers: Satish Duggana <satishd@apache.org>, Viktor Nikitash <nikitashvictor@pdffiller.com>, Jorge Esteban Quilcate Otoya <quilcate.jorge@gmail.com>, Abhijeet Kumar <abhijeet.cse.kgp@gmail.com>

When creating a verification state entry, we also store sequence and epoch. On subsequent requests, we will take the latest epoch seen and the earliest sequence seen. That way, if we try to append a sequence after the earliest seen sequence, we can block that and retry. This addresses potential OutOfOrderSequence loops caused by errors during verification (coordinator loading, timeouts, etc). Reviewers: David Jacot <david.jacot@gmail.com>, Artem Livshits <alivshits@confluent.io>

…oker restart (apache#13707) Dynamic overrides for the producer ID expiration config are not picked up on broker restart in Zookeeper mode. Based on the integration test, this does not apply to KRaft mode. Adds a broker restart that fails without the corresponding KafkaConfig change. Reviewers: Justine Olshan <jolshan@confluent.io>

…pache#14010) Implement some of the metrics from KIP-938: Add more metrics for measuring KRaft performance. Add these metrics to QuorumControllerMetrics: kafka.controller:type=KafkaController,name=TimedOutBrokerHeartbeatCount kafka.controller:type=KafkaController,name=EventQueueOperationsStartedCount kafka.controller:type=KafkaController,name=EventQueueOperationsTimedOutCount kafka.controller:type=KafkaController,name=NewActiveControllersCount Create LoaderMetrics with these new metrics: kafka.server:type=MetadataLoader,name=CurrentMetadataVersion kafka.server:type=MetadataLoader,name=HandleLoadSnapshotCount Create SnapshotEmitterMetrics with these new metrics: kafka.server:type=SnapshotEmitter,name=LatestSnapshotGeneratedBytes kafka.server:type=SnapshotEmitter,name=LatestSnapshotGeneratedAgeMs Reviewers: Ron Dagostino <rndgstn@gmail.com>

…ache#14081) Reviewers: Greg Harris <greg.harris@aiven.io>

…k thread to the sink task thread (apache#14079) Reviewers: Chris Egerton <chrise@aiven.io>

Reviewers: Mickael Maison <mickael.maison@gmail.com>

Cerchie · 2023-08-01T16:17:35Z

closing due to merge issue, refer to #14137

Cerchie and others added 3 commits July 10, 2023 11:33

add .ofNullable to line 57

03fbf2d

change test to cover null values in RangeQuery

f8f83ea

Update IQv2StoreIntegrationTest.java

a511ae6

removing errant comment

bbejeck reviewed Jul 10, 2023

View reviewed changes

divijvaidya added the streams label Jul 11, 2023

enhance javadoc

fcc8ced

bbejeck approved these changes Jul 25, 2023

View reviewed changes

hudeqi and others added 22 commits July 25, 2023 13:06

MINOR: Move TROGDOR.md to trogdor module (apache#13979)

8f5a4c5

Reviewers: Divij Vaidya <diviv@amazon.com> --------- Co-authored-by: Deqi Hu <deqi.hu@shopee.com>

Doc fixes: Fix format and other small errors in config documentation (a…

e28c0a0

…pache#13661) Various formatting fixes in the config docs. Reviewers: Bill Bejeck <bbejeck@apache.org>

KAFKA-14718: Wait for MirrorMaker to start before executing test (apa…

fa5653d

…che#13284)

KAFKA-14059 Replace PowerMock with Mockito in WorkerSourceTaskTest (a…

39b2794

…pache#13383) Reviewers: Chris Egerton <chrise@aiven.io>

KAFKA-15145: Don't re-process records filtered out by SMTs on Kafka c…

3fac756

…lient retriable exceptions in AbstractWorkerSourceTask (apache#13955) Reviewers: Sagar Rao <sagarmeansocean@gmail.com>, Chris Egerton <chrise@aiven.io>

KAFKA-15139: Avoid slow Set.removeAll(List) in MirrorCheckpointConnec…

3c0fad1

…tor (apache#13946) Reviewed-by: Greg Harris <greg.harris@aiven.io>

KAFKA-15159: upgrade minor dependencies (apache#13982)

28be64f

Reviewers: Divij Vaidya <diviv@amazon.com> --------- Co-authored-by: Damon Xie <damon.xie@zoom.us>

[KAFKA-15137] Do not log entire request payload in KRaftControllerCha…

d1e266f

…nnelManager (apache#13988) Reviewers: David Arthur <mumrah@gmail.com>

KAFKA-15155: Follow PEP 8 best practice in Python to check if a conta…

63ea3c6

…iner is empty (apache#13974) Reviewers: Divij Vaidya <diviv@amazon.com>

MINOR: Add Streams API broker compatibility table (apache#13937)

13b4b20

Reviewers: Matthias J. Sax <matthias@confluent.io>

MINOR: Add 3.5.0 and 3.4.1 to system tests (apache#13849)

72f774b

Reviewers: Luke Chen <showuon@gmail.com>

KAFKA-15093: Add 3.4.0 and 3.5.0 to core upgrade and compatibility sy…

546d848

…stem tests (apache#13859) Reviewers: Luke Chen <showuon@gmail.com>, Christo Lolov <christololov@gmail.com>

MINOR: Capture build scans on ge.apache.org to benefit from deep buil…

27f0a71

…d insights (apache#13676) Reviewers: Mickael Maison <mickael.maison@gmail.com>, Ismael Juma <ismael@juma.me.uk>, Divij Vaidya <diviv@amazon.com>

KAFKA-15148: Mark tests correctly as integration tests where they run…

ba78c0a

…ning as unit tests (apache#13973) Reviewers: Divij Vaidya <diviv@amazon.com>

KAFKA-14938: Fixing flaky test testConnectorBoundary (apache#13646)

355799c

Reviewers: Sagar Rao <sagarmeansocean@gmail.com>, Yash Mayya <yash.mayya@gmail.com>, Sudesh Wasnik <swasnik@confluent.io>, Chris Egerton <chrise@aiven.io>

MINOR: Avoid slow Set.removeAll(List) in MirrorSourceConnector (apach…

5349dfb

…e#13992) Reviewed-by: Greg Harris <greg.harris@aiven.io>

yashmayya and others added 28 commits July 25, 2023 13:06

KAFKA-15216: InternalSinkRecord::newRecord should not ignore new head…

3518861

…ers (apache#14044) Reviewers: Chris Egerton <chrise@aiven.io>

KAFKA-14469: Add MirrorMaker configs to table of contents in docs page (

c07bfd5

apache#14041) Reviewers: Mickael Maison <mickael.maison@gmail.com>

KAFKA-14133: Migrate various mocks in TaskManagerTest to Mockito (apa…

303eb7e

…che#13874) Reviewers: Divij Vaidya <diviv@amazon.com>

KAFKA-14760: Move ThroughputThrottler from tools to clients, remove t…

383a005

…ools dependency from connect-runtime (apache#13313) Reviewers: Ismael Juma <ismael@juma.me.uk>

KAFKA-15022: [2/N] introduce graph to compute min cost (apache#13996)

5445062

Part of KIP-925. Reviewers: Matthias J. Sax <matthias@confluent.io>

KAFKA-15222: upgrade zinc scala plugin to 1.9.2 (apache#14060)

6316b2b

Reviewers: Divij Vaidya <diviv@amazon.com>

MINOR: add JDK 20 build support to README (apache#14061)

53f2c19

Adding the value 20 to the JDK version that can build Apache Kafka into README.md Reviewers: Divij Vaidya <diviv@amazon.com>

KAFKA-14591: Move DeleteRecordsCommand to tools (apache#13278)

7302324

Reviewers: Mickael Maison <mickael.maison@gmail.com>, Federico Valeri <fedevaleri@gmail.com>

KAFKA-13431 (KIP-793): Expose the original pre-transform topic partit…

bdf7b90

…ion and offset in sink records (apache#14024) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

KAFKA-15232: Move ToolsUtils to tools (apache#14066)

eee7026

Reviewers: Mickael Maison <mickael.maison@gmail.com>

MINOR: Minor logging and doc related improvements in topic-based RLMM…

4599b2b

… consumer-manager/task (apache#14045) Improved logging and docs on consumer manager/task call paths. Reviewers: Luke Chen <showuon@gmail.com>, Satish Duggana <satishd@apache.org>

KAFKA-15194: Prepend offset in the filenames used by LocalTieredStora…

2b70811

…ge (apache#14057) Reviewers: Divij Vaidya <diviv@amazon.com>

KAFKA-14712: Produce correct error msg with correct metadataversion (a…

0956cb0

…pache#13773) Fix the confusing error message in ImageWriterOptions Reviewers: Luke Chen <showuon@gmail.com>, David Arthur <mumrah@gmail.com>

MINOR: Downgrade log level for conflicting Connect plugin aliases (ap…

2242de9

…ache#14081) Reviewers: Greg Harris <greg.harris@aiven.io>

KAFKA-15238: Move DLQ reporter setup from the DistributedHerder's tic…

5cab2f8

…k thread to the sink task thread (apache#14079) Reviewers: Chris Egerton <chrise@aiven.io>

MINOR: Fix typo in ProduceRequest.json (apache#14070)

3be0dc5

Reviewers: Mickael Maison <mickael.maison@gmail.com>

rebase jul 25

85f209b

Merge branch 'trunk' into KAFKA-15126

856ee7f

Merge branch 'apache:trunk' into KAFKA-15126

7f9696c

Cerchie closed this Aug 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kafka-15126: Range queries to accept null lower and upper bounds#13987

Kafka-15126: Range queries to accept null lower and upper bounds#13987
Cerchie wants to merge 88 commits intoapache:trunkfrom
Cerchie:KAFKA-15126

Cerchie commented Jul 10, 2023

Uh oh!

bbejeck left a comment

Uh oh!

bbejeck left a comment

Uh oh!

bbejeck commented Jul 25, 2023

Uh oh!

Cerchie commented Aug 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

Cerchie commented Jul 10, 2023

Committer Checklist (excluded from commit message)

Uh oh!

bbejeck left a comment

Choose a reason for hiding this comment

Uh oh!

bbejeck left a comment

Choose a reason for hiding this comment

Uh oh!

bbejeck commented Jul 25, 2023

Uh oh!

Cerchie commented Aug 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants