Consider automatically creating local indexes on partition keys #856

senderista · 2016-09-09T04:07:30Z

When a user specifies a hash partitioning in STORE(), they presumably anticipate join queries on that partition key with other relations. Provided the local join optimization in RACO is applied, and the local join is pushed into the local storage engine (i.e, Postgres), the join should be considerably optimized if an index on the partition key already exists for both relations (it should guarantee that either an indexed merge join or indexed nested loop join is chosen). We could automatically create such an index whenever a DbInsert operator with hash partitioning is executed.

The downside of this automated approach is that considerable resources may be expended during index creation, which may take a long time for large relations and slow down queries in progress. We should benchmark this overhead and automate this index optimization if the overhead seems acceptable.

The text was updated successfully, but these errors were encountered:

senderista · 2017-02-10T18:22:15Z

A possible refinement is to use the C locale for collation on these indexes. That massively speeds up sorting (because Unicode normalization can be avoided), and allows the Postgres "abbreviated keys" optimization to kick in (which had to be disabled for non-C locales because of glibc bugs).

If we did use the C locale for these indexes, we would have to ensure that they couldn't be used for user-visible comparisons, or users might get unexpected results.

senderista added the Enhancement label Sep 9, 2016

senderista self-assigned this Sep 9, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider automatically creating local indexes on partition keys #856

Consider automatically creating local indexes on partition keys #856

senderista commented Sep 9, 2016

senderista commented Feb 10, 2017

Consider automatically creating local indexes on partition keys #856

Consider automatically creating local indexes on partition keys #856

Comments

senderista commented Sep 9, 2016

senderista commented Feb 10, 2017