Skip to content
This repository has been archived by the owner on Sep 18, 2023. It is now read-only.

[NSE-223] Add Parquet write support to Arrow data source #324

Merged
merged 7 commits into from
May 27, 2021

Conversation

zhztheplayer
Copy link
Collaborator

WIP

@github-actions
Copy link

#223

@zhztheplayer
Copy link
Collaborator Author

@github-actions ram-usage-test

@github-actions
Copy link


@zhztheplayer
Copy link
Collaborator Author

@github-actions ram-usage-test

@github-actions
Copy link


@zhztheplayer zhztheplayer marked this pull request as ready for review May 24, 2021 08:27
@zhztheplayer zhztheplayer force-pushed the NSE-223 branch 2 times, most recently from fb40874 to 999d3fc Compare May 25, 2021 04:29
@zhztheplayer
Copy link
Collaborator Author

@github-actions ram-usage-test

@github-actions
Copy link


@zhztheplayer
Copy link
Collaborator Author

Benchmarked locally:

Disk write speed (SSD) = 10GiB / 35.653s = 287.212857263 MiB/s

10GiB Data, Write, 3 Columns, Parquet, Single Write Task:

vanilla read + vanilla write: 4.0 min
native read + vanilla write: 4.2 min

native read + native write: 2.2 min

@zhztheplayer
Copy link
Collaborator Author

@github-actions ram-usage-test

@github-actions
Copy link


@zhztheplayer zhztheplayer merged commit daed8e7 into oap-project:master May 27, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant