Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Initial commit of pyspark DataFrame backend #1

Merged
merged 1 commit into from
Jul 16, 2019

Conversation

icexelloss
Copy link
Owner

@icexelloss icexelloss commented Jul 12, 2019

Initial commit of Pyspark Datafarme backend for ibis.

Implemented Table, Column and trivial selection.

@icexelloss icexelloss merged commit 94d8b0e into pyspark-backend-prototype Jul 16, 2019
icexelloss added a commit that referenced this pull request Sep 12, 2019
This is a Pyspark backend for ibis. This is different from the spark
backend where the ibis expr is compiled to SQL string. Instead, the
pyspark backend compiles the ibis expr to pyspark.DataFrame exprs.
Author: Li Jin <ice.xelloss@gmail.com>
Author: Hyonjee Joo <5000208+hjoo@users.noreply.github.com>

Closes ibis-project#1913 from icexelloss/pyspark-backend-prototype and squashes the following commits:

213e371 [Li Jin] Add pyspark/__init__.py
8f1c35e [Li Jin] Address comments
f173425 [Li Jin] Fix tests
0969b0a [Li Jin] Skip unimplemented tests
1f9409b [Li Jin] Change pyspark imports to optional
26b041c [Li Jin] Add importskip
108ccd8 [Li Jin] Add scope
e00dc00 [Li Jin] Address PR comments
4764a4e [Li Jin] Add pyspark marker to setup.cfg
7cc2a9e [Li Jin] Remove dead code
72b45f8 [Li Jin] Fix rebase errors
9ad663f [Hyonjee Joo] implement pyspark numeric operations to pass all/test_numeric.py (#9)
675a89f [Li Jin] Implement compiler rules to pass all/test_aggregation.py
215c0d9 [Li Jin] Link existing tests with PySpark backend (#7)
88705fe [Li Jin] Implement basic join
c4a2b79 [Hyonjee Joo] add pyspark compile rule for greatest, fix bug with selection (#4)
fa4ad23 [Li Jin] Implement basic aggregation, group_by and window (#3)
54c2f2d [Li Jin] Initial commit of pyspark DataFrame backend (#1)
icexelloss pushed a commit that referenced this pull request Oct 17, 2019
Fixes ibis-project#1944. This does *not* add support for JSON, JSONB, and UUID
PostgreSQL types. Instead, it marks them as having `dt.Any` type,
allowing the tables to be loaded, and those columns mostly ignored.
There is also some black formatting on some lines I didn't touch...
Author: Ian Rose <ian.r.rose@gmail.com>
Author: Ivan Ogasawara <ivan.ogasawara@gmail.com>

Closes ibis-project#1962 from ian-r-rose/json-uuid-any and squashes the following commits:

6a878b0 [Ian Rose] Merge pull request #1 from Quansight/json-uuid-any
298efc2 [Ivan Ogasawara] Added JSON JSONB and UUID data types.
79bab94 [Ian Rose] Move test to postgres client test suite.
3748ef9 [Ian Rose] Add some light type testing for json, jsonb, uuid.
b514069 [Ian Rose] Allow postgres client to read tables with UUID, JSON, JSONB types.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant