forked from ibis-project/ibis
-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Initial commit of pyspark DataFrame backend #1
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
icexelloss
added a commit
that referenced
this pull request
Jul 18, 2019
icexelloss
added a commit
that referenced
this pull request
Jul 18, 2019
icexelloss
added a commit
that referenced
this pull request
Aug 7, 2019
icexelloss
added a commit
that referenced
this pull request
Aug 13, 2019
icexelloss
added a commit
that referenced
this pull request
Aug 15, 2019
icexelloss
added a commit
that referenced
this pull request
Aug 15, 2019
icexelloss
added a commit
that referenced
this pull request
Aug 22, 2019
icexelloss
added a commit
that referenced
this pull request
Sep 12, 2019
This is a Pyspark backend for ibis. This is different from the spark backend where the ibis expr is compiled to SQL string. Instead, the pyspark backend compiles the ibis expr to pyspark.DataFrame exprs. Author: Li Jin <ice.xelloss@gmail.com> Author: Hyonjee Joo <5000208+hjoo@users.noreply.github.com> Closes ibis-project#1913 from icexelloss/pyspark-backend-prototype and squashes the following commits: 213e371 [Li Jin] Add pyspark/__init__.py 8f1c35e [Li Jin] Address comments f173425 [Li Jin] Fix tests 0969b0a [Li Jin] Skip unimplemented tests 1f9409b [Li Jin] Change pyspark imports to optional 26b041c [Li Jin] Add importskip 108ccd8 [Li Jin] Add scope e00dc00 [Li Jin] Address PR comments 4764a4e [Li Jin] Add pyspark marker to setup.cfg 7cc2a9e [Li Jin] Remove dead code 72b45f8 [Li Jin] Fix rebase errors 9ad663f [Hyonjee Joo] implement pyspark numeric operations to pass all/test_numeric.py (#9) 675a89f [Li Jin] Implement compiler rules to pass all/test_aggregation.py 215c0d9 [Li Jin] Link existing tests with PySpark backend (#7) 88705fe [Li Jin] Implement basic join c4a2b79 [Hyonjee Joo] add pyspark compile rule for greatest, fix bug with selection (#4) fa4ad23 [Li Jin] Implement basic aggregation, group_by and window (#3) 54c2f2d [Li Jin] Initial commit of pyspark DataFrame backend (#1)
icexelloss
pushed a commit
that referenced
this pull request
Oct 17, 2019
Fixes ibis-project#1944. This does *not* add support for JSON, JSONB, and UUID PostgreSQL types. Instead, it marks them as having `dt.Any` type, allowing the tables to be loaded, and those columns mostly ignored. There is also some black formatting on some lines I didn't touch... Author: Ian Rose <ian.r.rose@gmail.com> Author: Ivan Ogasawara <ivan.ogasawara@gmail.com> Closes ibis-project#1962 from ian-r-rose/json-uuid-any and squashes the following commits: 6a878b0 [Ian Rose] Merge pull request #1 from Quansight/json-uuid-any 298efc2 [Ivan Ogasawara] Added JSON JSONB and UUID data types. 79bab94 [Ian Rose] Move test to postgres client test suite. 3748ef9 [Ian Rose] Add some light type testing for json, jsonb, uuid. b514069 [Ian Rose] Allow postgres client to read tables with UUID, JSON, JSONB types.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Initial commit of Pyspark Datafarme backend for ibis.
Implemented Table, Column and trivial selection.