sql: add ability to get a hash value from a logical plan #63885

Azhng · 2021-04-19T21:50:45Z

Currently, @cockroachdb/sql-observability are working towards adding the ability to compare historical query plans for a given statement.

We want to be able to quickly tell if the plan for a given statement has changed in a specified interval. Ideally, we should be able to do this without comparing the entire plan structure.

Since we already anonymized statements by removing constants in the statement, two plans with the same structure but different scan contraint values should produce the same hash.

Epic: CRDB-8631

Azhng · 2021-04-21T20:31:32Z

cc: @RaduBerinde

rytaft · 2021-04-27T18:42:22Z

cc @kevin-v-ngo @awoods187 to decide if needed for 21.2

kevin-v-ngo · 2021-07-07T18:44:15Z

This would be helpful once we have persisted stats: #64743. If we surface a hash value, users can compare for a statement fingerprint how many times the plan had changed over time and what the impact was on the execution statistics.

Azhng · 2021-07-07T19:10:36Z

I think one important thing about this is that the hash should only be derived from the execution plan and should be unaffected by other unrelated attributes in the logical plan.

For example, currently in the crdb_internal.node_statement_statistics we expose a sample_plan column:

root@127.0.0.1:26257/movr> select key, sample_plan from crdb_internal.node_statement_statistics where application_name = 'whack';
                                                key                                               |                                                                                                                      sample_plan
--------------------------------------------------------------------------------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
  SELECT * FROM users                                                                             | {"Children": [], "Missing Stats": "", "Name": "scan", "Spans": "FULL SCAN", "Table": "users@primary"}
  SELECT * FROM users WHERE name = _                                                              | {"Children": [{"Children": [], "Estimated Row Count": "50 (100% of the table; stats collected 52 seconds ago)", "Name": "scan", "Spans": "FULL SCAN", "Table": "users@primary"}], "Estimated Row Count": "0", "Filter": "name = _", "Name": "filter"}
  SELECT key, sample_plan FROM crdb_internal.node_statement_statistics WHERE application_name = _ | {"Children": [{"Children": [], "Name": "virtual table", "Table": "node_statement_statistics@primary"}], "Filter": "application_name = _", "Name": "filter"}
  SHOW database                                                                                   | {"Children": [{"Children": [], "Name": "virtual table", "Table": "session_variables@primary"}], "Filter": "variable = _", "Name": "filter"}
(4 rows)

We can see that in the sample_plan we have attributes such as "Missing Stats" and "Estimated Row Count". If we are to trivially hash the JSON representation of the plan, we would have different hashes for the same plan as the result of variation in the table statistics, which is not desirable.

kevin-v-ngo · 2021-07-15T02:41:17Z

Agree @Azhng! We'd need a notion of a 'plan fingerprint' (ignoring insignificant plan attributes) on which the hash is based on.

Implement a plan "gist" serializer piggy backing on the exec gen/explain factory infrastructure so that we can always know what the logical plan was and can do historical and statistical tracking. Logically its like an explain (SHAPE) but is even more stripped down. A gist is a sequence of bytes representing the flattened tree of operators and various operator specific metadata. The goal is to record every logical plan we use for every query to have historical data on which plans are used possibly linked up to statistics so we know which stats go with which logical plan. Also implement a decoder to turn the serialized plan back into a tree of explain.Node's that can be displayed using existing explain code. Currently this functionality is only exposed via a new EXPLAIN mode and via a crdb_internal "decoder" SRF. EXPLAIN (GIST) takes a query and returns a single string which is the encoded gist. crdb_internal.decode_plan_gist() takes an encoded gist string and writes out the logical plan one row per line. For performance numbers of the ExecBuild comparing a StubFactory to a PlanGistFactory wrapped around a StubFactory see the PR. Release note (sql change): Record compressed plan gist for all queries. For example, a query like this: SELECT * FROM abc UNION SELECT * FROM abc ORDER BY b,a Produces the following plan according to EXPLAIN (SHAPE) • distinct │ distinct on: a │ └── • union all │ ├── • sort │ │ order: +b,+a │ │ │ └── • scan │ missing stats │ table: abc@primary │ spans: FULL SCAN │ └── • sort │ order: +b,+a │ └── • scan missing stats table: abc@primary spans: FULL SCAN produces the following "gist": AgFuAgAHAAAAEQFuAgAHAAAAERANAAYGAA== The "gist" can be turned back into the following plan: • distinct │ distinct on │ └── • union all │ ├── • sort │ │ order │ │ │ └── • scan │ table: abc@primary │ spans: FULL SCAN │ └── • sort │ order │ └── • scan table: abc@primary spans: FULL SCAN Fixes: cockroachdb#63885

Implement a plan "gist" serializer piggy backing on the exec gen/explain factory infrastructure so that we can always know what the logical plan was and can do historical and statistical tracking. Logically its like an explain (SHAPE) but is even more stripped down. A gist is a sequence of bytes representing the flattened tree of operators and various operator specific metadata. The goal is to record every logical plan we use for every query to have historical data on which plans are used possibly linked up to statistics so we know which stats go with which logical plan. Also implement a decoder to turn the serialized plan back into a tree of explain.Node's that can be displayed using existing explain code. Currently this functionality is only exposed via a new EXPLAIN mode and via a crdb_internal "decoder" SRF. EXPLAIN (GIST) takes a query and returns a single string which is the encoded gist. crdb_internal.decode_plan_gist() takes an encoded gist string and writes out the logical plan one row per line. For performance numbers of the ExecBuild comparing a StubFactory to a PlanGistFactory wrapped around a StubFactory see the PR. Fixes: cockroachdb#63885 Release note (sql change): Record compressed plan gist for all queries. For example, a query like this: SELECT * FROM abc UNION SELECT * FROM abc ORDER BY b,a Produces the following plan according to EXPLAIN (SHAPE) • distinct │ distinct on: a │ └── • union all │ ├── • sort │ │ order: +b,+a │ │ │ └── • scan │ missing stats │ table: abc@primary │ spans: FULL SCAN │ └── • sort │ order: +b,+a │ └── • scan missing stats table: abc@primary spans: FULL SCAN produces the following "gist": AgFuAgAHAAAAEQFuAgAHAAAAERANAAYGAA== The "gist" can be turned back into the following plan: • distinct │ distinct on │ └── • union all │ ├── • sort │ │ order │ │ │ └── • scan │ table: abc@primary │ spans: FULL SCAN │ └── • sort │ order │ └── • scan table: abc@primary spans: FULL SCAN

69293: sql: implement a fast compressed logical plan mechanism r=rtaft,RaduBerinde a=cucaroach Implement a plan "gist" serializer piggy backing on the exec gen/explain factory infrastructure so that we can always know what the logical plan was and can do historical and statistical tracking. Logically its like an explain (SHAPE) but is even more stripped down. A gist is a sequence of bytes representing the flattened tree of operators and various operator specific metadata. The goal is to record every logical plan we use for every query to have historical data on which plans are used possibly linked up to statistics so we know which stats go with which logical plan. Also implement a decoder to turn the serialized plan back into a tree of explain.Node's that can be displayed using existing explain code. Currently this functionality is only exposed via a new EXPLAIN mode and via a crdb_internal "decoder" SRF. EXPLAIN (GIST) takes a query and returns a single string which is the encoded gist. crdb_internal.decode_plan_gist() takes an encoded gist string and writes out the logical plan one row per line. For performance numbers of the ExecBuild comparing a StubFactory to a PlanGistFactory wrapped around a StubFactory see the PR. Release note (sql change): Record compressed plan gist for all queries. For example, a query like this: SELECT * FROM abc UNION SELECT * FROM abc ORDER BY b,a Produces the following plan according to EXPLAIN (SHAPE) • distinct │ distinct on: a │ └── • union all │ ├── • sort │ │ order: +b,+a │ │ │ └── • scan │ missing stats │ table: abc@primary │ spans: FULL SCAN │ └── • sort │ order: +b,+a │ └── • scan missing stats table: abc@primary spans: FULL SCAN produces the following "gist": AgFuAgAHAAAAEQFuAgAHAAAAERANAAYGAA== The "gist" can be turned back into the following plan: • distinct │ distinct on │ └── • union all │ ├── • sort │ │ order │ │ │ └── • scan │ table: abc@primary │ spans: FULL SCAN │ └── • sort │ order │ └── • scan table: abc@primary spans: FULL SCAN Fixes: #63885 Co-authored-by: Tommy Reilly <treilly@cockroachlabs.com>

Azhng added the C-enhancement Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception) label Apr 19, 2021

jlinder added the T-sql-queries SQL Queries Team label Jun 16, 2021

Azhng mentioned this issue Jul 5, 2021

Surface the sampled query plan through the SQL CLI #65013

Closed

blathers-crl bot added the T-sql-observability label Jul 7, 2021

kevin-v-ngo removed the T-sql-observability label Jul 12, 2021

kevin-v-ngo mentioned this issue Jul 12, 2021

Query plan changes can be easily detected through our internal tables and the SQL CLI #67500

Closed

jordanlewis assigned cucaroach Jul 19, 2021

Azhng mentioned this issue Aug 28, 2021

sql: persistent SQL Stats main tracking issue #64743

Closed

24 tasks

cucaroach mentioned this issue Aug 31, 2021

sql: implement a fast compressed logical plan mechanism #69293

Merged

craig bot closed this as completed in 8bf5d1c Oct 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: add ability to get a hash value from a logical plan #63885

sql: add ability to get a hash value from a logical plan #63885

Azhng commented Apr 19, 2021 •

edited by kevin-v-ngo

Loading

Azhng commented Apr 21, 2021

rytaft commented Apr 27, 2021

kevin-v-ngo commented Jul 7, 2021

Azhng commented Jul 7, 2021

kevin-v-ngo commented Jul 15, 2021

sql: add ability to get a hash value from a logical plan #63885

sql: add ability to get a hash value from a logical plan #63885

Comments

Azhng commented Apr 19, 2021 • edited by kevin-v-ngo Loading

Azhng commented Apr 21, 2021

rytaft commented Apr 27, 2021

kevin-v-ngo commented Jul 7, 2021

Azhng commented Jul 7, 2021

kevin-v-ngo commented Jul 15, 2021

Azhng commented Apr 19, 2021 •

edited by kevin-v-ngo

Loading