You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Skipping pages is often significantly cheaper than skipping values within a page, as such we want a reasonable granularity of pages within a column chunk. However, parquet's encodings can be very efficient leading to a very large number of rows in a single page before hitting the page size limit
Describe the solution you'd like
Add an optional page row count limit to the parquet writer, that limits the maximum number of rows within a page.
Describe alternatives you've considered
Additional context
Impala has such an option for the same reason
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Skipping pages is often significantly cheaper than skipping values within a page, as such we want a reasonable granularity of pages within a column chunk. However, parquet's encodings can be very efficient leading to a very large number of rows in a single page before hitting the page size limit
Describe the solution you'd like
Add an optional page row count limit to the parquet writer, that limits the maximum number of rows within a page.
Describe alternatives you've considered
Additional context
Impala has such an option for the same reason
The text was updated successfully, but these errors were encountered: