Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

parquet reading improvements #650

Merged
merged 1 commit into from
Nov 20, 2024
Merged

parquet reading improvements #650

merged 1 commit into from
Nov 20, 2024

Conversation

richox
Copy link
Collaborator

@richox richox commented Nov 19, 2024

supports parquet parallel reading.

use bufreader to read page header, avoid reading in fragments.

@richox richox force-pushed the dev-parquet-read branch 3 times, most recently from be6c9bf to 5f8000c Compare November 19, 2024 14:50
use bufreader to read page header, avoid reading in fragments.

add hadoop shim.
@richox richox merged commit 22040b0 into master Nov 20, 2024
richox added a commit that referenced this pull request Nov 25, 2024
use bufreader to read page header, avoid reading in fragments.

add hadoop shim.

Co-authored-by: zhangli20 <zhangli20@kuaishou.com>
richox added a commit that referenced this pull request Nov 29, 2024
use bufreader to read page header, avoid reading in fragments.

add hadoop shim.

Co-authored-by: zhangli20 <zhangli20@kuaishou.com>
@richox richox mentioned this pull request Dec 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants