sql: support JOIN #2970

petermattis · 2015-10-30T16:41:00Z

Support JOIN in all of its wonderful incarnations. The initial implementation should focus on correctness on not worry about optimizing the join order based on table statistics.

Freeaqingme · 2016-04-27T17:06:17Z

Has there been any work done on this feature behind the scenes? If not, is there perhaps some design documentation already available?

It could be fun to try to give this one a shot to contribute...

RaduBerinde · 2016-04-27T17:28:01Z

We have been thinking about the more general problem of how to distribute SQL computation across the cluster, there is an RFC at https://github.com/cockroachdb/cockroach/blob/master/docs/RFCS/distributed_sql.md

We are aiming to have some limited join support before that, using code that will be reusable later within the distributed SQL framework. But even that will involve quite a bit of code restructuring, TBH this doesn't seem like a good starter project for someone who isn't familiar with the codebase.

Freeaqingme · 2016-04-27T17:44:11Z

Alright. I will let this one slide then, see if there's something interesting with the help-wanted label.

Tnx!

dt · 2016-05-02T22:21:12Z

@Freeaqingme
A few options come to mind that might be well suited for getting started with the codebase:

CREATE TABLE ... AS sql: support CREATE TABLE ... AS ...? #2483
TEMP table support sql: Add support for TEMP tables #5807
Add support for password-based auth to the pgwire protocol sql: add support for pgwire password-based auth #6457

electrum · 2016-05-04T22:46:15Z

Have you considered writing a Presto connector for CockroachDB? Presto is a full distributed SQL query engine with pluggable connectors (data sources) and supports distributed joins, including joins between different connectors.

We support batch index joins for tables that are indexed on the join key and otherwise support broadcast and distributed hash joins.

petermattis · 2016-05-05T01:15:11Z

@electrum Presto appears targeted at analytics, while CockroachDB is targeted at transactional workloads. Beyond that, Presto is written in Java while CockroachDB is written in Go. Calling out to Java for SQL execution doesn't seem good from a performance perspective for transactional workloads.

electrum · 2016-05-05T04:29:59Z

@petermattis You're correct, Presto is definitely targeted at analytics, although the engine itself is capable of low latency queries. We have an internal connector at Facebook based on a sharded MySQL backend that can do complex, multi-way index join queries for reporting workloads in hundreds of milliseconds: https://www.youtube.com/watch?v=Gf9JqvNNRZg

I'm definitely not suggesting calling out to or trying to use Presto within CoackroachDB -- as you say, that would be horrible for transactional workloads, nor is it technically feasible. However, it could be a good complement for other workloads like reporting, analytics, ETL, batch pipelines, combining heterogeneous data sources, etc., and might also serve as a stop gap.

electrum · 2016-05-05T04:30:47Z

Unrelated, I really like that design document and all the rest of the documentation for the project. It's probably the best documented project I've seen and is a model for others to strive towards.

petermattis · 2016-05-05T12:49:13Z

CockroachDB speaks the postgres wire protocol and our SQL is similar to the PostgreSQL dialect. The existing presto-postgres connector might work (with some adjustments).

Thanks for the note about the documentation. Is it nice to to hear those efforts are being recognized and appreciated.

tamird · 2016-07-02T01:49:19Z

I'm going to close this now #7202 is in. There's more work to be done, but the spirit of this issue is implemented.

petermattis added the SQL label Oct 30, 2015

petermattis added this to the 1.0 milestone Oct 30, 2015

jess-edwards mentioned this issue Oct 30, 2015

Product Roadmap #2132

Closed

78 tasks

maddyblue mentioned this issue Dec 2, 2015

sql: increase the sqllogic test coverage #3292

Closed

7 tasks

petermattis assigned RaduBerinde Feb 3, 2016

petermattis added C-enhancement Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception) and removed SQL labels Feb 13, 2016

alex mentioned this issue Mar 12, 2016

sql: support for "meta" postgresql tables? #5194

Closed

knz mentioned this issue Jun 18, 2016

sql: prototype JOIN and refactor table alias and qargs #7202

Merged

tamird closed this as completed Jul 2, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: support JOIN #2970

sql: support JOIN #2970

petermattis commented Oct 30, 2015

Freeaqingme commented Apr 27, 2016

RaduBerinde commented Apr 27, 2016

Freeaqingme commented Apr 27, 2016

dt commented May 2, 2016

electrum commented May 4, 2016

petermattis commented May 5, 2016

electrum commented May 5, 2016

electrum commented May 5, 2016

petermattis commented May 5, 2016

tamird commented Jul 2, 2016

sql: support JOIN #2970

sql: support JOIN #2970

Comments

petermattis commented Oct 30, 2015

Freeaqingme commented Apr 27, 2016

RaduBerinde commented Apr 27, 2016

Freeaqingme commented Apr 27, 2016

dt commented May 2, 2016

electrum commented May 4, 2016

petermattis commented May 5, 2016

electrum commented May 5, 2016

electrum commented May 5, 2016

petermattis commented May 5, 2016

tamird commented Jul 2, 2016