Small optimization in Parquet varint decoder #8742

etseidl · 2025-10-30T04:18:29Z

Which issue does this PR close?

Part of [EPIC] A collection of items to improve speed of parquet metadata encoding #5853.

Rationale for this change

Following the recent improvements in Thrift decoding, the percentage of time spent decoding LEB128 encoded integers has increased.

What changes are included in this PR?

This PR modifies the varint decoder to first test for integers that can be encoded in a single byte (using zig-zag encoding, the maximum int that can be encoded is 63). Many of the fields in the Parquet footer (including all enum values) will be in this range, so optimizing for this frequent occurrence makes sense.

Are these changes tested?

Should be covered by existing tests

Are there any user-facing changes?

No

etseidl · 2025-10-30T14:15:40Z

bench on intel i7-12700K

group                             57_0_0                                 vlq
-----                             ------                                 ---
decode parquet metadata           1.04      4.9±0.04µs        ? ?/sec    1.00      4.7±0.06µs        ? ?/sec
decode parquet metadata (wide)    1.06     17.8±0.20ms        ? ?/sec    1.00     16.8±0.25ms        ? ?/sec
open(default)                     1.04      5.2±0.07µs        ? ?/sec    1.00      5.0±0.05µs        ? ?/sec
open(page index)                  1.12    104.9±0.80µs        ? ?/sec    1.00     93.8±1.07µs        ? ?/sec

etseidl · 2025-10-30T14:34:40Z

Hmm...github lost a comment last night.

bench on intel macbook

group                             57_0                                   vlq
-----                             ----                                   ---
decode parquet metadata           1.03     14.9±0.43µs        ? ?/sec    1.00     14.5±0.30µs        ? ?/sec
decode parquet metadata (wide)    1.05     52.0±1.26ms        ? ?/sec    1.00     49.5±1.58ms        ? ?/sec
open(default)                     1.03     15.4±0.47µs        ? ?/sec    1.00     15.0±0.25µs        ? ?/sec
open(page index)                  1.13    242.2±9.01µs        ? ?/sec    1.00    215.1±5.72µs        ? ?/sec

alamb

Makes sense to me. Thank you @etseidl -- I queued up a benchmark to confirm

Another thing we could try if we wanted to get all crazy is manually unrolling the loop (at least for the first 4 or 8 bytes) to remove the back branch 🤔

alamb · 2025-10-30T17:29:13Z

BTW it was nice to read https://en.wikipedia.org/wiki/Variable-length_quantity understand this better

alamb · 2025-10-30T17:30:03Z

Many of the fields in the Parquet footer (including all enum values) will be in this range, so optimizing for this frequent occurrence makes sense.

This is a great observation btw

etseidl · 2025-10-30T17:42:30Z

Many of the fields in the Parquet footer (including all enum values) will be in this range, so optimizing for this frequent occurrence makes sense.

This is a great observation btw

There's actually prior art in the rust compiler. rust-lang/rust#92604

alamb · 2025-10-30T18:45:54Z

🤖 ./gh_compare_arrow.sh Benchmark Script Running
Linux aal-dev 6.14.0-1017-gcp #18~24.04.1-Ubuntu SMP Tue Sep 23 17:51:44 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing vlq_speedup (e78c56d) to 1c8eac1 diff
BENCH_NAME=metadata
BENCH_COMMAND=cargo bench --features=arrow,async,test_common,experimental --bench metadata
BENCH_FILTER=
BENCH_BRANCH_NAME=vlq_speedup
Results will be posted here when complete

alamb · 2025-10-30T18:49:07Z

🤖: Benchmark completed

Details

group                             main                                   vlq_speedup
-----                             ----                                   -----------
decode parquet metadata           1.01      9.6±0.07µs        ? ?/sec    1.00      9.5±0.04µs        ? ?/sec
decode parquet metadata (wide)    1.00     43.8±0.71ms        ? ?/sec    1.00     43.9±1.57ms        ? ?/sec
open(default)                     1.00      9.6±0.04µs        ? ?/sec    1.02      9.8±0.04µs        ? ?/sec
open(page index)                  1.10    194.0±1.39µs        ? ?/sec    1.00    176.0±2.40µs        ? ?/sec

alamb · 2025-10-31T13:49:36Z

The benchmark results look consistent with an improvment to me -- great work @etseidl

alamb · 2025-10-31T14:31:07Z

Another thing we could try if we wanted to get all crazy is manually unrolling the loop (at least for the first 4 or 8 bytes) to remove the back branch 🤔

I got crazy and gave it a try:

PROTOTYPE: manually unroll vlq decoder loop #8757

optimization for read_vlq

e78c56d

github-actions bot added the parquet Changes to the parquet crate label Oct 30, 2025

etseidl added the performance label Oct 30, 2025

etseidl marked this pull request as ready for review October 30, 2025 14:34

alamb approved these changes Oct 30, 2025

View reviewed changes

alamb merged commit bac0cb5 into apache:main Oct 31, 2025
16 checks passed

alamb mentioned this pull request Oct 31, 2025

PROTOTYPE: manually unroll vlq decoder loop #8757

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Small optimization in Parquet varint decoder #8742

Small optimization in Parquet varint decoder #8742

Uh oh!

etseidl commented Oct 30, 2025

Uh oh!

etseidl commented Oct 30, 2025

Uh oh!

etseidl commented Oct 30, 2025

Uh oh!

alamb left a comment •

edited

Loading

Uh oh!

alamb commented Oct 30, 2025 •

edited

Loading

Uh oh!

alamb commented Oct 30, 2025

Uh oh!

etseidl commented Oct 30, 2025

Uh oh!

alamb commented Oct 30, 2025

Uh oh!

alamb commented Oct 30, 2025

Uh oh!

alamb commented Oct 31, 2025

Uh oh!

Uh oh!

alamb commented Oct 31, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Small optimization in Parquet varint decoder #8742

Small optimization in Parquet varint decoder #8742

Uh oh!

Conversation

etseidl commented Oct 30, 2025

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

etseidl commented Oct 30, 2025

Uh oh!

etseidl commented Oct 30, 2025

Uh oh!

alamb left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alamb commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alamb commented Oct 30, 2025

Uh oh!

etseidl commented Oct 30, 2025

Uh oh!

alamb commented Oct 30, 2025

Uh oh!

alamb commented Oct 30, 2025

Uh oh!

alamb commented Oct 31, 2025

Uh oh!

Uh oh!

alamb commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

alamb left a comment •

edited

Loading

alamb commented Oct 30, 2025 •

edited

Loading

alamb commented Oct 31, 2025 •

edited

Loading