Make client-side pagination a true recommendation #3743

awoods187 · 2018-09-13T17:40:25Z

I heard from k yesterday that they thought pagination via offset/limit was slow and were looking for recommendations on what to do for large result sets.

I spoke with @jordanlewis and he recommended:
"Doing “client-side pagination” by retrieving a set of records with a limit, and then checking the index key of the last row, using that as an index constraint, and then running the query again
this is normal - not specific to cockroach"

Our current docs: https://www.cockroachlabs.com/docs/stable/selection-queries.html#limiting-row-count-and-pagination actually suggest the wrong solution for pagination.

The reason is that "offset doesn’t know anything smart - it has to get the same data as before, just skips the first n. You have to participate as a client by remembering the index key of the last result set you saw."

We should also reference that some databases offer a feature called cursors to do this. @andy-kimball mentioned that "cursors are generally not a great architecture as well b/c they force server to keep state. client side is way to go for pagination. server pagination just doesn't scale well"

Jordan also mentioned that "you run into trouble with a scale out system like cockroach - if your load balancer moves you to a different server for example the cursor will be lost"

Andy also mentioned that "sql server has cursors, and we spent a lot of time recommending customers not use them"

This way we can make it clear that client-side is the way to go and its not just because we don't have cursors.

awoods187 · 2018-09-14T14:49:02Z

I sent the recommendations to K and he replied:

`Hi, terrific answer Andy. Thanks for spending time on looking this up and we will add it to our general guidelines/knowledgebase for CRDB.

What devs usually do (I think) is to use what Spring Data JPA or Spring Data JDBC provides. It has a simple pagination API which does basic limit/offset traversal by default, but you can create own custom queries.`

Let's make sure to recommend the right course of action here.

awoods187 · 2018-09-18T13:21:31Z

Another customer needing updated docs on pagination:
cockroachdb/cockroach#30352 (comment)

knz · 2018-09-18T16:46:36Z

Thanks Andy for collating things together. After reading this I concur with Jordan and Andy K's opinions. We could implement cursors technically but the PK values + LIMIT (and thus not OFFSET) are a better solution. This needs to be combined with AS OF SYSTEM TIME for reliable pagination.

I think the product is ready for clients to do effective pagination, however I also think it's not actually trivial for clients to achieve so we need ample guidance in docs. This will be an interesting / worthy doc project.

wzrdtales · 2018-09-18T20:37:29Z

With "client-side" pagination I assume you actually mean keyset pagination?
Since you also closed #1067, without catching important bits from it, it should be noted that with cockroachdb currently only quite ineffecient keyset pagination can be done (assuming utilizing UUIDs which pretty much everyone does today.) Should there be something new in cockroachdb I missed, please let me know.

However what could be a valuable feature for cockroach would be to have AFTER and BEFORE keyword which could be used instead of offset. Cockroachdb could optimize this way better, by using the index on the column and start retrieving records after that, than this very nasty subquery solution.

jordanlewis · 2018-09-18T20:47:35Z

@wzrdtales, I don't understand what you mean. If you have a table with UUID primary key, which is I think what you're talking about, you should be able to select all of the rows that come after a given row with a > constraint. UUIDs still have order, after all. Are you having problems with that?

wzrdtales · 2018-09-18T20:49:03Z

@jordanlewis Nope, they are ordered, but not, well ordered in the order they're created.

jordanlewis · 2018-09-18T20:50:38Z

That's true that UUIDs aren't well-ordered in their creation order, but IMO that's entirely separate from the discussion we're having here, which is about the recommendation we give to users for how to paginate a table by the order of its primary key.

wzrdtales · 2018-09-18T20:52:31Z

well pagination is about a predictable outcome of pagination. Which is basically the issue of LIMIT and OFFSET. they're not quite predictable. However > with UUID is less predictable than LIMIT, OFFSET, since LIMIT, OFFSET at least have an order in that case.

wzrdtales · 2018-09-18T20:57:03Z

Actually also adding to this, especially this is unpredictable since new records can appear after and before the record you're comparing against without any deterministic outcome. So yeah may be still a different discussion, but just adding this here since you closed the other issue.

knz · 2018-09-19T02:42:54Z

Thomas with CockroachDB you can use AS OF SYSTEM TIME to make the scrolling deterministic. I am still confident there is a way to obtain what you desire. -- Sent from my Android device with K-9 Mail. Please excuse my brevity.

wzrdtales · 2018-09-19T08:00:03Z

@knz I guess you meant me with Thomas? In that case: AS OF SYSTEM TIME was also discussed in the closed issue. It is not appropriate since this is a snapshot of reality and not the reality itself. Changes just disappear and new things do not appear. As said above the best solution would be something like AFTER and BEFORE keywords that could be easier translated by cockroachdb itself. Doing search over outdated data is almost at no time an option though.

wzrdtales · 2018-09-19T08:02:23Z

So the best option is still doing this #1067 (comment), but with questionable performance. Which is sad, b/c mostly keyset pagination tends to be faster than others, but not quite in this case.

jseldess · 2019-03-22T03:12:08Z

This isn't going to get done for 19.1. Moving to 19.2.

jseldess · 2019-11-07T11:44:34Z

@ericharmeling, @rmloveland, another pagination-related issue from a user, this time on our new community slack:

Hi all, nice to be here. Just a shoutout and a TIL (actually yesterday) - Ordering by timestamp and LIMIT/OFFSET may trip you up! There's probably good information about this somewhere, but we spent all day yesterday trying to find out why pagination was missing entries - inconsistently on different machines.

nickjackson · 2019-11-24T18:08:07Z

Just to leave a suggestion here again. The best case for a pagination implementation provided by cockroachdb, that would create true value for your users would be:
SELECT * FROM table WHERE "somethingDate" >= NOW() ORDER BY "createdAt" DESC LIMIT ${amount} AFTER KEY id = ${item}

This suggestion from @wzrdtales would be an extremely welcome addition.

knz · 2019-11-25T13:27:45Z

Huh I don't understand why can't this be written as

WHERE "somethingDate" >= NOW() 
  AND id > ${item}
ORDER BY "createdAt" DESC LIMIT ${amount}

which is arguably simpler?

wzrdtales · 2019-11-25T13:33:34Z

@knz I don't mind explaining it again :)

Your query does not work with cockroachdbs official recommendations of using UUIDs. You can't sort for UUIDs (you can, but that would be stupid since the result is random and unpredictable). Your suggestion would violate the recommendation since you would ask to use serialized integer increments (all already discussed before #3743 (comment)). The suggested function eliminates this, by stating instead:

Hey, in that sorted key createdAt (key would consists out of createdAt, id), please return me everything after the value xyz. The >= NOW() would be just an optimization in doubt to have a smaller dataset to scan (even in the indexed case).

brendan-hall · 2019-12-02T13:32:09Z

@wzrdtales What about:

SELECT * FROM table
  WHERE ((created_at, id) > (${time}, ${id}))
  ORDER BY (created_at, id)
  LIMIT ${amount}

This should result in the correct page as Tuples are ordered Lexicographically, and we only care about the UUID order if the TimeStamps are equal. It's not important what that order is, just that is an order.

The After syntax is then syntactic sugar for WHERE ${order_tuple} {comparison} ${cursor tuple}, with comparison being either < or > based on whether the ordering is ASC or DESC

wzrdtales · 2019-12-02T14:19:16Z

Sry, I am getting a bit tired to answer the same answer all the time... . So I will skip the explanation this time, it doesn't change the outcome and is non deterministic.

andy-kimball · 2019-12-02T17:03:13Z

@wzrdtales, I've read your comments, and am still unclear on exactly what you're looking for. Are you trying to build a "feed", where you are guaranteed to always see the newest values, in the order they were committed?

Also, can you give a specific example where @brendan-hall's suggestion does not work? It would be very helpful to see a concrete set of values that would fail, as it would help me to understand exactly what you're looking to do.

Bessonov · 2019-12-04T10:53:35Z

@brendan-hall @andy-kimball

I see two problems.

Outcome of the query

SELECT * FROM table
  WHERE ((created_at, id) > (${time}, ${id}))
  ORDER BY (created_at, id)
  LIMIT ${amount}

is deterministic because of ordering. But it's not useful, if the id isn't monotonic. The UUID isn't monotonic. Imagine situation (already ordered):

Time  | UUID
10:00 | 10
10:01 | 60
10:02 | 20
10:03 | 15

If you start with no page and $amount = 2, then you page item is (10:01, 60). So, next page would be:

SELECT * FROM table
  WHERE ((created_at, id) > (10:01, 60))
  ORDER BY (created_at, id)
  LIMIT 2

But you get nothing, because id of third and fourth items is less than 60.

Second problem with the query is, when you have situation like this:

Time  | UUID
10:00 | 10
10:01 | 50
10:01 | 60
10:02 | 20
10:03 | 15

With the page size of two you see (10:01, 50), but never (10:01, 60).

wzrdtales · 2019-12-04T12:45:57Z

Thank you @Bessonov I am glad not to argue alone into an emtpy room...

brendan-hall · 2019-12-04T13:09:51Z

@Bessonov

Except that that's not how < on Tuples works.

Your claim is that (a, b) < (x, y) is implemented as (a < x) AND (b < y)

Instead (In pseudo code), it works like so

switch (compare(a.time, b.time)) {
  LT -> LT
  GT -> GT
  EQ -> compare(a.id, b.id)
}

Assuming tuples of (time, id)

Select First Page

WITH t(time,id) as (VALUES (10.00, 10), (10.01, 60), (10.02, 20), (10.03, 15))
SELECT * FROM t
  ORDER BY (time, id)
  LIMIT 2

time	id
10.00	10
10.01	60

Select Second Page (after (10.01, 60))

WITH t(time,id) as (VALUES (10.00, 10), (10.01, 60), (10.02, 20), (10.03, 15))
SELECT * FROM t
  WHERE (time, id) > (10.01, 60) 
  ORDER BY (time, id)
  LIMIT 2

time	id
10.02	20
10.03	15

Select Second Page with both (10.01, 50) and (10.01, 60) (after (10.01, 50))

WITH t(time,id) as (VALUES (10.00, 10), (10.01, 60), (10.01, 50), (10.02, 20), (10.03, 15))
SELECT * FROM t
  WHERE (time, id) > (10.01, 50) 
  ORDER BY (time, id)
  LIMIT 2

time	id
10.01	60
10.02	20

R.E. usefulness, my argument is that if your created_at timestamps are equal, then you no longer care, nor have the ability to care, about the order the rows were inserted. So what does it matter that uuid's aren't monotonic?
I'm just including ids to make sure that the sort order doesn't change in 5 minutes because a chunk of the underlying data got reversed.

If you absolutely need to know the order a row was inserted, then you're going to have to either implement your own higher resolution timestamps, or fall back on auto-incrementing ids or row_number()

andy-kimball · 2019-12-04T13:58:08Z

Another related thing to understand is that timestamps aren't guaranteed to be in strict commit order. The now() function is evaluated at the beginning of a transaction. Imagine you have two concurrent transactions both running this query:

INSERT INTO (id, time) VALUES (gen_random_uuid(), now())

It's possible that transaction B commits first:

id	time
id2	10.02

and a split second later transaction A commits:

id	time
id1	10.01

Therefore, if you're trying to use timestamps to implement a "feed", this behavior will cause you to miss items. This is not specific to CRDB. Every database I know of would behave this way for current time and auto-incrementing ids as well. Auto-incrementing ids also would not follow strict commit order in the case of concurrent transactions.

andy-kimball · 2019-12-04T14:25:21Z

Assuming that concurrently committed transactions are not a concern for your particular application, here is another trick I've used in the past to further simplify pagination:

CREATE TABLE t (created_at TIMESTAMPTZ PRIMARY KEY);
INSERT INTO t (created_at) VALUES (now());

I put a retry loop around the INSERT statement to retry cases where concurrent transactions insert the same timestamp (because they're inserted in the same microsecond). Having a guarantee of no duplicates allow this simplified pagination pattern:

SELECT * FROM t
  WHERE created_at > ${time}
  ORDER BY created_at
  LIMIT ${amount}

One big caveat: using timestamp columns indexed by the current time creates "hot spots" in your database, because all INSERT transactions will write to the same machine repeatedly (i.e. whatever machine contains the page containing the highest timestamp value). This can have the effect of preventing scale-out of your database beyond a single machine if you try to do this with a high-volume INSERT transaction that is a bottleneck in your application. There are techniques to address this problem, such as using hybrid hash/range partitioning, but it does tend to get complex. We'll be adding support to upcoming versions of CRDB to make this easier.

Summary of changes: - Explain difference between keyset pagination and LIMIT/OFFSET - Show examples of the former being fast and the latter being slow - Show how to use EXPLAIN to check why the difference exists - Add warning to LIMIT/OFFSET docs recommending keyset pagination Fixes #3743

Summary of changes: - Explain difference between keyset pagination and LIMIT/OFFSET - Show examples of the former being fast and the latter being slow - Show how to use EXPLAIN to check why the difference exists - Add warning to LIMIT/OFFSET docs recommending keyset pagination - ... all of the above for 19.1, 19.2, 20.1 docs Fixes #3743

awoods187 · 2019-12-06T15:26:55Z

Hey all--we have a pr in process to address the concerns here #6114. Let us know if you think it is missing something important!

Summary of changes: - Explain difference between keyset pagination and LIMIT/OFFSET - Show examples of the former being fast and the latter being slow - Show how to use EXPLAIN to check why the difference exists - Add warning to LIMIT/OFFSET docs recommending keyset pagination - ... all of the above for 19.1, 19.2, 20.1 docs Fixes #3743

Summary of changes: - Explain difference between keyset pagination and LIMIT/OFFSET - Show examples of the former being fast and the latter being slow - Show how to use EXPLAIN to check why the difference exists - Add warning to LIMIT/OFFSET docs recommending keyset pagination - ... all of the above for 19.1, 19.2, 20.1 docs Fixes #3743.

wzrdtales · 2020-08-15T10:30:58Z

bit late as a comment, but @awoods187 this didn't touch the topic at all. This is about the problems generated by UUIDs concerning pagination, which this docs didn't touch at all and didn't presented a solution to the user. Even the tuples mentioned here, which also don't solve it completely, were not mentioned there.

I by the time now went for critical tables on a different route. The database is not trusted anymore as a source of truth for such assets, since the implementation just lacks at too many corners. I implemented a logic that spans across services for each domain that needs absolute time critical sorting. Everything else, less important will in doubt go by the timestamp and loose precision.

andy-kimball · 2020-08-15T14:57:42Z

Sorry to hear that the recommendations didn't meet your needs, @wzrdtales. Are you aware of any other database that solves this problem in a way that meets your needs? Do you have any links to blogs, doc pages, or other resources showing what you want and how some other system solves it in a way that works for you?

The only examples I've seen were outlined by @Bessonov above, and @brendan-hall pointed out why those actually work fine. I have yet to see examples of cases where @brendan-hall's method doesn't "solve it completely", except in cases where the paging is racing with newly committed values, as I explained above.

Separately, we can look into adding documentation that describes the best keyset pagination pattern to use for current timestamp columns (like created_at type columns). I agree that the docs don't seem to cover that.

jseldess added the docs-project label Sep 18, 2018

jseldess added this to the 2.1 milestone Sep 18, 2018

jseldess mentioned this issue Sep 18, 2018

SQL pagination #1067

Closed

jseldess assigned lnhsingh Sep 18, 2018

jseldess added the A-sql label Oct 3, 2018

rmloveland assigned rmloveland and unassigned lnhsingh Oct 22, 2018

jseldess modified the milestones: 2.1, 2.2 Oct 29, 2018

jseldess added O-external Origin: Issue comes from external users. O-docs-project and removed O-docs-project labels Oct 29, 2018

jseldess added the T-missing-info label Nov 9, 2018

rmloveland added the P-1 High priority; must be done this release label Jan 16, 2019

jseldess modified the milestones: 2.2, 19.1 Feb 25, 2019

awoods187 mentioned this issue Dec 3, 2019

sql: multi-use common table expressions cockroachdb/cockroach#21084

Closed

rmloveland mentioned this issue Dec 5, 2019

Update pagination docs to use keyset / seek method #6114

Merged

rmloveland closed this as completed in #6114 Jan 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make client-side pagination a true recommendation #3743

Make client-side pagination a true recommendation #3743

awoods187 commented Sep 13, 2018 •

edited

Loading

awoods187 commented Sep 14, 2018 •

edited

Loading

awoods187 commented Sep 18, 2018

knz commented Sep 18, 2018

wzrdtales commented Sep 18, 2018

jordanlewis commented Sep 18, 2018

wzrdtales commented Sep 18, 2018

jordanlewis commented Sep 18, 2018

wzrdtales commented Sep 18, 2018

wzrdtales commented Sep 18, 2018

knz commented Sep 19, 2018 via email

wzrdtales commented Sep 19, 2018

wzrdtales commented Sep 19, 2018

jseldess commented Mar 22, 2019

jseldess commented Nov 7, 2019

nickjackson commented Nov 24, 2019

knz commented Nov 25, 2019 •

edited

Loading

wzrdtales commented Nov 25, 2019 •

edited

Loading

brendan-hall commented Dec 2, 2019 •

edited

Loading

wzrdtales commented Dec 2, 2019 •

edited

Loading

andy-kimball commented Dec 2, 2019

Bessonov commented Dec 4, 2019

wzrdtales commented Dec 4, 2019

brendan-hall commented Dec 4, 2019

andy-kimball commented Dec 4, 2019

andy-kimball commented Dec 4, 2019

awoods187 commented Dec 6, 2019

wzrdtales commented Aug 15, 2020

andy-kimball commented Aug 15, 2020

Make client-side pagination a true recommendation #3743

Make client-side pagination a true recommendation #3743

Comments

awoods187 commented Sep 13, 2018 • edited Loading

awoods187 commented Sep 14, 2018 • edited Loading

awoods187 commented Sep 18, 2018

knz commented Sep 18, 2018

wzrdtales commented Sep 18, 2018

jordanlewis commented Sep 18, 2018

wzrdtales commented Sep 18, 2018

jordanlewis commented Sep 18, 2018

wzrdtales commented Sep 18, 2018

wzrdtales commented Sep 18, 2018

knz commented Sep 19, 2018 via email

wzrdtales commented Sep 19, 2018

wzrdtales commented Sep 19, 2018

jseldess commented Mar 22, 2019

jseldess commented Nov 7, 2019

nickjackson commented Nov 24, 2019

knz commented Nov 25, 2019 • edited Loading

wzrdtales commented Nov 25, 2019 • edited Loading

brendan-hall commented Dec 2, 2019 • edited Loading

wzrdtales commented Dec 2, 2019 • edited Loading

andy-kimball commented Dec 2, 2019

Bessonov commented Dec 4, 2019

wzrdtales commented Dec 4, 2019

brendan-hall commented Dec 4, 2019

andy-kimball commented Dec 4, 2019

andy-kimball commented Dec 4, 2019

awoods187 commented Dec 6, 2019

wzrdtales commented Aug 15, 2020

andy-kimball commented Aug 15, 2020

awoods187 commented Sep 13, 2018 •

edited

Loading

awoods187 commented Sep 14, 2018 •

edited

Loading

knz commented Nov 25, 2019 •

edited

Loading

wzrdtales commented Nov 25, 2019 •

edited

Loading

brendan-hall commented Dec 2, 2019 •

edited

Loading

wzrdtales commented Dec 2, 2019 •

edited

Loading