Create data cube index tables in a schema. #519

jheer · 2024-09-13T17:11:52Z

This PR redesigns the data cube indexer to write index tables into a named schema (default 'mosaic'). This allows data cube index tables to be managed in a largely isolated environment and to be reused across users and sessions. Temporary index tables are no longer supported, as temporary tables can not be created within a named schema.

Breaking: Remove support for temporary tables for data cube index tables.
Breaking: Remove data cube indexer enabled method, instead use get/set properties.
Add data cube indexer schema get/set properties.
Add data cube indexer dropIndexTables() method. This method issues a query to remove the entire data cube index table schema. It should be called if base tables are updated, causing index tables to become stale and inaccurate.
Update Jupyter widget, remove temp_indexes property and add dataCubeSchema property.
Update types, method signatures in Coordinator and DataCubeIndexer.
Update Coordinator API documentation.

jheer · 2024-09-13T17:15:06Z

@domoritz Can you please spot check the changes to the Jupyter widget? Is the model state initialization (setting the default data cube index table schema) correct? Also, in this context should we prefer camelCase (dataCubeSchema) or underscore_case (data_cube_schema) for model property names?

domoritz · 2024-09-14T22:40:15Z

It wasn't quite correct. The idea with temp_indexes was that you can set whether indexes should be created as temp indexes or not from the python side.

I updated the code to let you get and set the index schema from the widget. We should prefer data_cube_schema since the variable is accessible from Python (where users interact with it).

domoritz

Looks good. I think the idea schema functionality in the python widget is for advanced use cases. Right now, we don't automatically call dropIndexTables which means indexes will not be cleaned up when the user switches where indexes are created. I think that makes sense as default behavior.

willium · 2024-09-18T18:07:54Z

Exciting to see dropIndexTables()! thanks for adding that.

I believe the most straightforward way for MotherDuck to know that you want some table to live in the browser/WASM (rather than upstream in the warehouse) is by specifying it as a TEMP table. The alternative is mounting a local db. I wonder if this approach will have performance implications when MotherDuck is the server.

jheer · 2024-09-18T18:37:21Z

We decided to move away from temporary tables in part to also support persistence of index cubes across multiples users and sessions. So that is another aspect to weigh among the trade-offs and design space here.

feat!: Create data cube index tables in a schema.

282824c

jheer requested a review from domoritz September 13, 2024 17:13

Merge branch 'main' into jh/data-cube-schema

a111d8f

This was referenced Sep 13, 2024

Automatic Time Binning Fails for Large Time Intervals #484

Closed

feat: parallel queries #475

Merged

rename to data_cube_schema and implement corresponding python part

8310c47

domoritz approved these changes Sep 14, 2024

View reviewed changes

docs: Fix docs typo.

781f69f

jheer merged commit 09d292c into main Sep 15, 2024
3 checks passed

jheer deleted the jh/data-cube-schema branch September 15, 2024 14:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create data cube index tables in a schema. #519

Create data cube index tables in a schema. #519

jheer commented Sep 13, 2024 •

edited

Loading

jheer commented Sep 13, 2024 •

edited

Loading

domoritz commented Sep 14, 2024 •

edited

Loading

domoritz left a comment

willium commented Sep 18, 2024

jheer commented Sep 18, 2024

Create data cube index tables in a schema. #519

Create data cube index tables in a schema. #519

Conversation

jheer commented Sep 13, 2024 • edited Loading

jheer commented Sep 13, 2024 • edited Loading

domoritz commented Sep 14, 2024 • edited Loading

domoritz left a comment

Choose a reason for hiding this comment

willium commented Sep 18, 2024

jheer commented Sep 18, 2024

jheer commented Sep 13, 2024 •

edited

Loading

jheer commented Sep 13, 2024 •

edited

Loading

domoritz commented Sep 14, 2024 •

edited

Loading