Fix compile time verification performance regression for sqlite #1946

liningpan · 2022-07-02T20:57:51Z

The sqlite CTE feature (#1816) introduced severe performance degradation, and this PR attempts to fix it.

This patch drastically reduces search space by adding memorization based on register type and cursor data type. Pushing new states on to the Stack is guarded by a HashSet, where if no new information is gained about an instruction (i.e. different register/cursor data type), the instruction will not be searched again.

Now the example query in #1921 only takes 5s to verify instead of several minutes. It might be possible to further optimize this by stop tracking unnecessary information, which could save a lot of memory allocations.

abonander · 2022-07-12T21:00:46Z

Do you mind looking at the failing CI checks?

Update to 0.6.1 isn't possible right now due to massive increase in compile time and memory usage. The core build uses 30GB after a few minutes! With 0.5.13 the build takes less than a minute and < 2GB. See launchbadge/sqlx#1921 Possible fix with launchbadge/sqlx#1946

abonander · 2022-09-03T00:47:15Z

@liningpan do you have time to address the CI failures I mentioned before? The logs are gone now but if you amend your commit to update the SHA and push it to re-run the workflows we'll get new logs.

liningpan · 2022-09-03T00:57:37Z

I though initially CI failed at formatting, and I believe I fixed them. I was waiting for you to trigger CI. Get Outlook for iOS<https://aka.ms/o0ukef>

…

________________________________ From: Austin Bonander ***@***.***> Sent: Friday, September 2, 2022 8:47:26 PM To: launchbadge/sqlx ***@***.***> Cc: Lining Pan ***@***.***>; Mention ***@***.***> Subject: Re: [launchbadge/sqlx] Fix compile time verification performance regression for sqlite (PR #1946) @liningpan<https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fliningpan&data=05%7C01%7C%7C0437255e19414a35a8fc08da8d45dfb7%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C637977628498825884%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=%2Bxm%2FzrXbKTNRFQq%2BHgDMFdLhR48R6sLZrf7TJ0%2F657Y%3D&reserved=0> do you have time to address the CI failures I mentioned before? The logs are gone now but if you amend your commit to update the SHA and push it to re-run the workflows we'll get new logs. — Reply to this email directly, view it on GitHub<https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Flaunchbadge%2Fsqlx%2Fpull%2F1946%23issuecomment-1236006508&data=05%7C01%7C%7C0437255e19414a35a8fc08da8d45dfb7%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C637977628498825884%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=y%2B65xW0EQh4GiG1xEboxT1f%2BkqmoCe9kfyXncYWFd40%3D&reserved=0>, or unsubscribe<https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAED47ZU6MNZAMJBH5FQ74ZTV4KNZ5ANCNFSM52PTDQRA&data=05%7C01%7C%7C0437255e19414a35a8fc08da8d45dfb7%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C637977628498825884%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=5S9awzkL1gzxpigJCZm58Hl1J5kNCPcw%2B3sAsNEn%2Bi8%3D&reserved=0>. You are receiving this because you were mentioned.Message ID: ***@***.***>

abonander · 2022-09-03T01:03:06Z

Yeah, sorry. It's easy to get sidetracked and forget about PRs in the queue. I don't even have the "Approve and Run" button anymore though, you'll need to update the branch for it to come back I think. A simple rebase should do it.

liningpan · 2022-09-03T02:27:25Z

Although the CI pipeline appears to be passing, I'm not 100% certain about the correctness of this patch and the additional memory consumed by the branch state hash table. That being said it should still use way less memory than what's previously stored in the query history structure.

I would also like to hear the original author's (sqlite CTE #1816) opinion @tyrelr.

tyrelr · 2022-09-06T14:07:49Z

Overall, I think this is a good approach and it worked when I tested it. It is a bit unfortunate to need a 3rd way of storing column-states. But there is no simple way to avoid that due to HashMaps not implementing Hash (with good reason).

Based on reading the code, the only piece of information that seems to be discarded is which register is pointed at by a PseudoCursor. That should be almost impossible to cause an issue in practice. Literally everything in the state except that pseudocursor would need to be identical to cause a mistaken short-circuit. Then, for that mistaken short-circuit to matter, the two nearly identical states would need to result in significantly different output data types. Such a query plan would be valid, but would be unlike any other query plan I've seen sqlite generate.

abonander · 2022-09-13T01:09:55Z

@liningpan @tyrelr is it possible for this PR to cause type/null inference changes?

tyrelr · 2022-09-14T02:03:18Z

In practice, I expect that a different result would never actually happen. It requires a very atypical query plan for the issue above to occur, I expect sqlite would never generate such a plan.

But theoretically, this could change type/null inference if sqlite DID happen to generate that weird query plan.

abonander · 2022-09-15T01:12:53Z

For safety, I've earmarked this for 0.7.0, which I've started working on.

* add instruction, register, and cursor state memorization * fix: fixed formating

…chbadge#1946) * add instruction, register, and cursor state memorization * fix: fixed formating

liningpan force-pushed the fix-sqlite-explain-performance branch from 4a06161 to fbc796b Compare August 1, 2022 17:22

This was referenced Sep 3, 2022

Query with RETURNING clause is extremely slow to compile #2083

Closed

Regression in v0.6.0 when using an INSERT ... RETURNING clause with SQLite #1921

Closed

liningpan added 2 commits September 2, 2022 22:09

add instruction, register, and cursor state memorization

3d8feb1

fix: fixed formating

92289c0

liningpan force-pushed the fix-sqlite-explain-performance branch from fbc796b to 92289c0 Compare September 3, 2022 02:13

abonander mentioned this pull request Sep 13, 2022

Sqlite explain plan log efficiency #2091

Merged

abonander changed the base branch from main to 0.7-dev September 15, 2022 01:12

abonander merged commit 1379eb6 into launchbadge:0.7-dev Sep 15, 2022

cycraig mentioned this pull request Sep 18, 2022

Fix sqlite compilation #2098

Merged

abonander pushed a commit that referenced this pull request Feb 18, 2023

Fix compile time verification performance regression for sqlite (#1946)

6cdc784

* add instruction, register, and cursor state memorization * fix: fixed formating

abonander pushed a commit that referenced this pull request Feb 21, 2023

Fix compile time verification performance regression for sqlite (#1946)

75a3495

* add instruction, register, and cursor state memorization * fix: fixed formating

Aandreba pushed a commit to Aandreba/sqlx that referenced this pull request Mar 31, 2023

Fix compile time verification performance regression for sqlite (laun…

c4f57a5

…chbadge#1946) * add instruction, register, and cursor state memorization * fix: fixed formating

raviqqe mentioned this pull request Aug 25, 2023

Update dependencies raviqqe/sqlx#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix compile time verification performance regression for sqlite #1946

Fix compile time verification performance regression for sqlite #1946

liningpan commented Jul 2, 2022 •

edited

Loading

abonander commented Jul 12, 2022

abonander commented Sep 3, 2022

liningpan commented Sep 3, 2022 via email

abonander commented Sep 3, 2022 •

edited

Loading

liningpan commented Sep 3, 2022

tyrelr commented Sep 6, 2022

abonander commented Sep 13, 2022

tyrelr commented Sep 14, 2022

abonander commented Sep 15, 2022

Fix compile time verification performance regression for sqlite #1946

Fix compile time verification performance regression for sqlite #1946

Conversation

liningpan commented Jul 2, 2022 • edited Loading

abonander commented Jul 12, 2022

abonander commented Sep 3, 2022

liningpan commented Sep 3, 2022 via email

abonander commented Sep 3, 2022 • edited Loading

liningpan commented Sep 3, 2022

tyrelr commented Sep 6, 2022

abonander commented Sep 13, 2022

tyrelr commented Sep 14, 2022

abonander commented Sep 15, 2022

liningpan commented Jul 2, 2022 •

edited

Loading

abonander commented Sep 3, 2022 •

edited

Loading