feat: use gen_random_uuid() for CockroachDB INSERTs #807

alnr · 2023-01-20T23:47:19Z

Using the gen_random_uuid() builtin can significantly speed up inserts because the primary key uniqueness constraint check is elided.

I've also fixed up code where there was a prepared statement being created for just one query, which is super inefficient.

I'm kinda having trouble running the tests locally, so I'd appreciate a workflow approval too. @aeneasr @sio4 👍

aeneasr · 2023-01-23T19:20:30Z

Approval given :)

You can run the tests locally using ./test.sh which executes tests against all DBs, or using:

go build -v -tags sqlite -o tsoda ./soda

SODA_DIALECT=cockroach

./tsoda drop -e $SODA_DIALECT -c ./database.yml -p ./testdata/migrations
./tsoda create -e $SODA_DIALECT -c ./database.yml -p ./testdata/migrations
./tsoda migrate -e $SODA_DIALECT -c ./database.yml -p ./testdata/migrations

go test -cover -race -tags sqlite -count=1 $args ./...

aeneasr

This looks very nice already! Do we have a test showing that the ID is generated correctly for Cockroach? I do believe that such tests exist, but it would be good to have proof that we don't introduce regressions :)

aeneasr · 2023-01-23T19:27:25Z

dialect_cockroach.go

 		}
 		txlog(logging.SQL, c, query, model.Value)
-		stmt, err := c.Store.PrepareNamed(query)
+		rows, err := c.Store.NamedQueryContext(model.ctx, query, model.Value)


Good point, it makes no sense to use prepared statements if they're not cached.

alnr · 2023-01-23T22:10:36Z

A couple of tests were not idempotent. I think I've fixed them all.
Fixed missing Close on sql.Rows
Added more explicit checks about UUID generation to test

ready for review :)

sio4

Hi @alnr,
Thank you for your PR.

Before reviewing the main topic of this PR, please remove changes regarding testing configurations (database.yml and docker-compose.yml). I don't think they are parts of the topic of this PR. Changing the supported database version should have more consideration around other conditions and may not be a part of this feature PR.

Please keep PR as simple as possible with a single point of interest.

database.yml

docker-compose.yml

sio4

LGTM!

sio4 · 2023-01-24T16:12:07Z

Hi @alnr,

I am just wondering. What could be the cons and pros of using gen_random_uuid() for generating a random UUID ID on the server side?

Personally, I prefer to use application-side actions (spending the application server's CPU) rather than offloading CPU tasks to the database side since the application is more flexible to spread out (scaling) than controlling database load unless the action should be taken on the database side for integrity or similar things. Also for the main topic of this PR, it is easier to maintain the code if we use the same generic method in the generic dialect.

Could you please explain your idea and/or considerations?

significantly speed up inserts because the primary key uniqueness constraint check is elided

Do you have any data regarding this? Having the real or test data here would be great!

alnr · 2023-01-24T16:27:14Z

I am just wondering. What could be the cons and pros of using gen_random_uuid() for generating a random UUID ID on the server side?

Pros:

primary key uniqueness constraint check will be elided on insert. This makes a huge difference in CockroachDB multiregion databases. If you don't use gen_random_uuid(), the uniqueness constraint will be verified prior to insert on all regional shards, which requires contacting them. Across continents, that's multiple hundred milliseconds. I've confirmed this with a US-EU-APAC cluster on a hosted enterprise Cockroach cluster.

Cons:

can't think of any. Generating a UUID is free. Doesn't matter if the app or the DB does it.

Personally, I prefer to use application-side actions (spending the application server's CPU) rather than offloading CPU tasks to the database side since the application is more flexible to spread out (scaling) than controlling database load unless the action should be taken on the database side for integrity or similar things. Also for the main topic of this PR, it is easier to maintain the code if we use the same generic method in the generic dialect.

After this change, we burn a lot fewer CPU cycles anyway, because the DB does not have to verify the uniqueness constraint on the primary key. Checking that constraint takes approximately infinity times more time than generating a UUID.

alnr · 2023-01-24T16:29:44Z

Not sure what's going on with that failing test 🤷

sio4 · 2023-01-24T16:50:46Z

Not sure what's going on with that failing test shrug

Don't worry. The short test on the Windows machine is failing many times, and I don't think it is related to your PR. We need to make it better but actually, the Windows environment is too hard. :-)

For the main topic, I quickly researched the Cockroach's resources and I found the followings:

It seems like we can take another approach for this, not for right now but for later:

improve the generic routine can allow empty UUID id value (with a new option)
improve fizz to create a better CREATE statement with the default gen_random_uuid() (or manually?)

I think this approach will make Pop simple without losing the database side performance. Any ideas or suggestions will help us greatly!

Checking that constraint takes approximately infinity times more time than generating a UUID.

Yeah, could be in a global cluster across the universe 😆

alnr · 2023-01-24T17:03:22Z

Thanks for merging!

Honestly I would keep the code as-is for now. Not everyone uses fizz to create schemas. Regarding the empty UUID value: I don't think allowing null UUIDs makes sense for the ID (primary key) column. It makes a lot of sense for other columns, of course, but that's not where this optimization is most useful anyway.

sio4 · 2023-01-26T12:36:49Z

Honestly I would keep the code as-is for now. Not everyone uses fizz to create schemas. Regarding the empty UUID value: I don't think allowing null UUIDs makes sense for the ID (primary key) column.

Yes, for now. and...
Maybe my explanation is not enough. I don't mean ID can be null UUID. Currently, we remove an integer ID field from the model when executing Create() because we assume auto-increment is enabled on the database side. With the same concept, if we know (with option or anything) the database table was created with a UUID key with DEFAULT like below (example taken from the cockroach's document), we can just remove the ID field so the database will handle it.

CREATE TABLE movr.max_schema.vehicles (
      id UUID DEFAULT gen_random_uuid() PRIMARY KEY,
      type movr.max_schema.vtype,
      creation_time TIMESTAMPTZ,
      available BOOL,
      last_location STRING
  );

The function or method to create an auto key can be different by database engines, but we can make the Create() routine the same.

There are also related issues (opposite actually) for the integer ID that cannot be preserved when if the user sets ID explicitly and I think those issues can be considered together with it. (#382, #586)

feat: use gen_random_uuid() for CockroachDB INSERTs

89b6848

alnr mentioned this pull request Jan 23, 2023

feat: Let the database generate UUIDs on insert ory/kratos#3042

Merged

aeneasr reviewed Jan 23, 2023

View reviewed changes

alnr force-pushed the cockroach-uuid branch 4 times, most recently from cf82f4b to 4d9ddfc Compare January 23, 2023 22:08

alnr marked this pull request as ready for review January 23, 2023 22:10

sio4 requested changes Jan 24, 2023

View reviewed changes

database.yml Outdated Show resolved Hide resolved

docker-compose.yml Outdated Show resolved Hide resolved

sio4 self-assigned this Jan 24, 2023

sio4 added the enhancement New feature or request label Jan 24, 2023

alnr force-pushed the cockroach-uuid branch from 4d9ddfc to 52e8c2e Compare January 24, 2023 08:44

feat: don't create single-use prepared statements

cb7479c

alnr force-pushed the cockroach-uuid branch from 52e8c2e to cb7479c Compare January 24, 2023 09:00

alnr requested a review from sio4 January 24, 2023 13:31

sio4 approved these changes Jan 24, 2023

View reviewed changes

sio4 self-requested a review January 24, 2023 16:17

sio4 approved these changes Jan 24, 2023

View reviewed changes

sio4 added this to the v6.2.0 milestone Jan 24, 2023

sio4 merged commit ec9229d into gobuffalo:main Jan 24, 2023

alnr mentioned this pull request Jan 24, 2023

fix: deflake test #809

Closed

alnr mentioned this pull request May 2, 2023

chore: bump deps ory/hydra#3506

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: use gen_random_uuid() for CockroachDB INSERTs #807

feat: use gen_random_uuid() for CockroachDB INSERTs #807

alnr commented Jan 20, 2023 •

edited

Loading

aeneasr commented Jan 23, 2023

aeneasr left a comment

aeneasr Jan 23, 2023

alnr commented Jan 23, 2023

sio4 left a comment

sio4 left a comment

sio4 commented Jan 24, 2023 •

edited

Loading

alnr commented Jan 24, 2023

alnr commented Jan 24, 2023

sio4 commented Jan 24, 2023

alnr commented Jan 24, 2023

sio4 commented Jan 26, 2023

feat: use gen_random_uuid() for CockroachDB INSERTs #807

feat: use gen_random_uuid() for CockroachDB INSERTs #807

Conversation

alnr commented Jan 20, 2023 • edited Loading

aeneasr commented Jan 23, 2023

aeneasr left a comment

Choose a reason for hiding this comment

aeneasr Jan 23, 2023

Choose a reason for hiding this comment

alnr commented Jan 23, 2023

sio4 left a comment

Choose a reason for hiding this comment

sio4 left a comment

Choose a reason for hiding this comment

sio4 commented Jan 24, 2023 • edited Loading

alnr commented Jan 24, 2023

alnr commented Jan 24, 2023

sio4 commented Jan 24, 2023

alnr commented Jan 24, 2023

sio4 commented Jan 26, 2023

alnr commented Jan 20, 2023 •

edited

Loading

sio4 commented Jan 24, 2023 •

edited

Loading