Perf: Fix a bottleneck in `calculate_hashes` #41

Davidson-Souza · 2023-09-13T18:21:55Z

After profiling Floresta I found that calculate_hashes, a method used for computing roots from a proof, was taking an absurd amount of time in one specific function called sorted_push. As the name implies, this function adds an element to a collection, and keeps the full collection sorted. The approach used was simply to push and then sort the array. But the successive calls to merge-sort caused a severe performance hit for large proofs.

This PR uses an elementary replacement for this function. We keep the invariant of sorting by appending the element right where it should be, if we want to keep the whole collection sorted. We find this element by performing a binary search over the collection.

For reference, I wrote a small doc with different approaches that could be taken. The mentioned profiling is this flamegraph. Here's how it looks like after this change. This pr also adds a benchmark for this method. Here's the result before and after it.

test accumulator::proof::bench::bench_calculate_hashes ... bench:      83,043 ns/iter (+/- 9,842)

test accumulator::proof::bench::bench_calculate_hashes ... bench:      57,712 ns/iter (+/- 1,221)

While profiling some downstream crate, I found that "calculate_hashes" is dominated by "sorted_push" because it calls merge sort on every push. After investigating some alternatives, finding the position with a binary search and inserting at that position achieves the best result.

This is useful for node solutions like umbrel and resplitz

Davidson-Souza added 2 commits September 5, 2023 13:41

Benchmark CalculateHashes

676e0ef

Davidson-Souza merged commit b590e73 into mit-dci:main Sep 22, 2023

Davidson-Souza added a commit to Davidson-Souza/rustreexo that referenced this pull request Jun 8, 2024

read xpub from env var (mit-dci#41)

fd1b8ea

This is useful for node solutions like umbrel and resplitz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perf: Fix a bottleneck in `calculate_hashes` #41

Perf: Fix a bottleneck in `calculate_hashes` #41

Davidson-Souza commented Sep 13, 2023

Perf: Fix a bottleneck in calculate_hashes #41

Perf: Fix a bottleneck in calculate_hashes #41

Conversation

Davidson-Souza commented Sep 13, 2023

Perf: Fix a bottleneck in `calculate_hashes` #41

Perf: Fix a bottleneck in `calculate_hashes` #41