feat(keys, scan): Support arbitrary glob patterns #2608

nathanlo-hrt · 2024-10-17T15:46:11Z

Kvrocks currently only supports prefix matching (glob patterns like ab*).
This change implements arbitrary glob patterns for KEYS and SCAN MATCH [pattern].

tests/gocase/unit/keyspace/keyspace_test.go

src/common/glob.h

tests/cppunit/string_util_test.cc

PragmaTwice · 2024-10-19T03:18:10Z

src/storage/redis_db.cc

+
+      if (!util::StringMatch(suffix_glob, user_key.substr(prefix.size()))) {
+        continue;
+      }
      keys->emplace_back(user_key);
      cnt++;


I'm wondering that, if nothing got matched in this limit, e.g. the limit is 10, and in this 10 keys no key is matched, but the 12th key is matched, will a valid cursor be returned so that users can use it to continue the scan?

Also could we add a test case to ensure that?

it seems like cnt is only incremented when we actually add a key to the result, so the loop will continue until no more keys match the prefix, or limit keys are inserted into the result vector

Hmmm I think we'd better refactor the logic, so that the SCAN can be quick, instead of a long-time blocking scan. E.g. we can scan a fixed maximum numbers of keys even if the limit/cnt is not reach. It's fine to return even zero matched key to users.
Or maybe we can confirm the logic in Redis?

I'm not sure if we can just merge this first and then plan to refactor to a "quick" scan.

WDYT? cc @mapleFU @git-hulk

Sorry for missing this comment. This sounds good to me after taking a rough review.

Hmmm I think we'd better refactor the logic, so that the SCAN can be quick, instead of a long-time blocking scan. E.g. we can scan a fixed maximum numbers of keys even if the limit/cnt is not reach. It's fine to return even zero matched key to users.
Or maybe we can confirm the logic in Redis?

Yes, Redis will set a max iteration number(10*count) to avoid blocking too long.

Refer: https://github.com/redis/redis/blob/611c950293ae34dcef148ec62c9dd9626d7dc9e3/src/db.c#L1212

nathanlo99 · 2024-10-24T19:52:34Z

just to say it, it would be nice to have the typos script run as a precommit hook or lint: it's a bit unideal to get the workflow approved by a reviewer just to find out you've introduced a typo in your commit somewhere

git-hulk · 2024-10-28T15:22:52Z

Hi @nathanlo-hrt Seems the Go test is broken, would you mind fixing it to make it mergeable?

sonarqubecloud · 2024-10-28T18:29:28Z

Quality Gate passed

Issues
15 New issues
0 Accepted issues

Measures
0 Security Hotspots
69.5% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

nathanlo-hrt and others added 6 commits October 17, 2024 11:39

Implement glob matching

c1c1e1a

Implement glob splitting

93b9502

Fix bugs, add go tests

c3829c9

Remove LOG statements

c50fa57

Ignore glob test files in typos.toml

faa10ae

Merge branch 'unstable' into globs

cbe3d2f

PragmaTwice reviewed Oct 18, 2024

View reviewed changes

tests/gocase/unit/keyspace/keyspace_test.go Show resolved Hide resolved

PragmaTwice reviewed Oct 18, 2024

View reviewed changes

src/common/glob.h Outdated Show resolved Hide resolved

PragmaTwice requested review from git-hulk and mapleFU October 18, 2024 06:11

nathanlo-hrt and others added 4 commits October 18, 2024 11:10

Code review; use StringMatch and add to tests

4cea174

Merge branch 'unstable' into globs

31a0927

Replace removed sorts and compacts from scan_test.go

8fe2147

Replace removed sorts and compacts from scan_test.go

ec7ee3f

nathanlo-hrt requested a review from PragmaTwice October 18, 2024 15:16

PragmaTwice reviewed Oct 18, 2024

View reviewed changes

tests/cppunit/string_util_test.cc Show resolved Hide resolved

Validate globs; fix tests

376fcf0

PragmaTwice reviewed Oct 19, 2024

View reviewed changes

nathanlo-hrt and others added 4 commits October 21, 2024 11:50

Update typos.toml

2a0160c

Merge branch 'unstable' into globs

7640515

Add a new test exercising an edge case

4da5300

Ignore the right file

866c807

nathanlo-hrt added 2 commits October 24, 2024 17:18

Merge branch 'unstable' into globs

af2ad6b

Merge branch 'unstable' into globs

7063a8b

nathanlo-hrt requested a review from PragmaTwice October 28, 2024 14:09

Fix scan_test

ad49f6f

PragmaTwice approved these changes Oct 29, 2024

View reviewed changes

git-hulk approved these changes Oct 29, 2024

View reviewed changes

PragmaTwice merged commit 4aa36ec into apache:unstable Oct 29, 2024
32 checks passed

PragmaTwice mentioned this pull request Oct 29, 2024

Allow to scan keys with the suffix #2222

Closed

nathanlo-hrt deleted the globs branch October 29, 2024 14:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(keys, scan): Support arbitrary glob patterns #2608

feat(keys, scan): Support arbitrary glob patterns #2608

nathanlo-hrt commented Oct 17, 2024

PragmaTwice Oct 19, 2024 •

edited

Loading

nathanlo-hrt Oct 21, 2024

PragmaTwice Oct 22, 2024 •

edited

Loading

PragmaTwice Oct 24, 2024

git-hulk Oct 28, 2024

nathanlo99 commented Oct 24, 2024

git-hulk commented Oct 28, 2024

sonarqubecloud bot commented Oct 28, 2024

feat(keys, scan): Support arbitrary glob patterns #2608

feat(keys, scan): Support arbitrary glob patterns #2608

Conversation

nathanlo-hrt commented Oct 17, 2024

PragmaTwice Oct 19, 2024 • edited Loading

Choose a reason for hiding this comment

nathanlo-hrt Oct 21, 2024

Choose a reason for hiding this comment

PragmaTwice Oct 22, 2024 • edited Loading

Choose a reason for hiding this comment

PragmaTwice Oct 24, 2024

Choose a reason for hiding this comment

git-hulk Oct 28, 2024

Choose a reason for hiding this comment

nathanlo99 commented Oct 24, 2024

git-hulk commented Oct 28, 2024

sonarqubecloud bot commented Oct 28, 2024

Quality Gate passed

PragmaTwice Oct 19, 2024 •

edited

Loading

PragmaTwice Oct 22, 2024 •

edited

Loading