Use FastPFOR in RLE/BP encoder #123

gaborcsardi · 2025-02-15T10:47:49Z

The new code is not templated, as we only ever encode uint32_t values currently.
It is much cleaner now, and the encoder itself is probably faster, but this does not really make write_parquet() faster, at least not for flights.

In fact, it is slightly slower for flights x 200, probably because we pack a lot of sections that are shorter than 32 values, so we need to copy them into a buffer first. I guess I can use the old bit packer for these shorter sections.

Bit-packing directly into the output buffer is also problematic because the bit packer is writing out unaligned uint32_t values. It seems to work on aarch64 and x86_64, but it might now work correctly everywhere.

gaborcsardi added 2 commits February 15, 2025 11:47

Use FastPFOR in RLE/BP encoder

9313512

Need to include cmath

a6182a3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use FastPFOR in RLE/BP encoder #123

Use FastPFOR in RLE/BP encoder #123

gaborcsardi commented Feb 15, 2025 •

edited

Loading

Use FastPFOR in RLE/BP encoder #123

Are you sure you want to change the base?

Use FastPFOR in RLE/BP encoder #123

Conversation

gaborcsardi commented Feb 15, 2025 • edited Loading

gaborcsardi commented Feb 15, 2025 •

edited

Loading