-
Notifications
You must be signed in to change notification settings - Fork 96
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature: PySpark integration #3774
Comments
Hi @BjarkeTornager, this is something that could be on the roadmap but not yet been prioritized as we typically wait for several upvotes from the community to decide how much to prioritize new integrations. There are numerous other integrations already underway for our 0.5.0 release and beyond, so hope you can understand. In the meantime, we are also releasing a basic graph algorithms package soon that can provide some of the functionality that GraphFrames does, so stay tuned! |
Thanks @prrao87, looking forward to the Kùzu basic graph algorithm package! |
It would be have to have spark integration with kuzu, especially for large scale data ingestion! |
Just adding some scope for initial functionality here: The proposed integration would behave just like the Pandas/Polars DataFrame integration does:
Unlike Pandas/Polars, the I/O and related tasks may not be fully in-memory - we'd need to see how the persistent formats under the hood of Spark work, and also how to design the API to expose the connector to the Python client of Kùzu. |
While dealing with large scale data it's best if there is a way to integrate kuzu with spark dataframe. Something like what Neo4j has. This way anyone can upload batches of data to Kuzu without writing extensive code. |
API
Python
Description
Have you considered making an integration between Kùzu and PySpark?
Neo4j, as an example, has a Neo4j connector for Apache Spark.
Spark also has a community project called GraphFrames that can be used for basic graph algorithms.
Since Spark is widely used for analytical queries, machine learning, and streaming it could be useful to move between the two.
The text was updated successfully, but these errors were encountered: