Expose ArrowWriter
row group flush in public API
#1626
Labels
enhancement
Any new improvement worthy of a entry in the changelog
parquet
Changes to the parquet crate
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
From what I have read predicate pushdown filtering in parquet works on row-group level, so in my case I should be able to optimize reads by manually closing row-group.
Describe the solution you'd like
Simply expose in ArrowWriter API flush_row_group method that flushes all buffered rows.
Describe alternatives you've considered
I have considered using SerializedFileWriter, however due to my lack of complete understanding of definition, repetition levels I would prefer to use high level API like ArrowWriter.
The text was updated successfully, but these errors were encountered: