Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Python] Add read_csv and read_parquet methods #976

Merged
merged 13 commits into from
Feb 10, 2024

Conversation

andygrove
Copy link
Member

@andygrove andygrove commented Feb 10, 2024

Which issue does this PR close?

N/A

Depends on apache/datafusion-python#582

Rationale for this change

This is the first step to re-enabling the full DataFrame API from Python, but without duplicating DataFusion's PyDataFrame this time.

What changes are included in this PR?

  • Add read_csv and read_parquet to PySessionContext
  • Refactor code into separate files
  • Add new tests that demonstrate using these methods as well as using DataFrame API transformations

Are there any user-facing changes?

@andygrove andygrove changed the title Read csv [Python] Add PySessionContext::read_csv Feb 10, 2024
@andygrove andygrove changed the title [Python] Add PySessionContext::read_csv [Python] Add read_csv and read_parquet methods Feb 10, 2024
@andygrove andygrove marked this pull request as ready for review February 10, 2024 17:37
@andygrove andygrove merged commit 912f789 into apache:main Feb 10, 2024
17 checks passed
@andygrove andygrove deleted the read-csv branch February 10, 2024 18:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant