Asymptotic complexity of keyed functions. #25

wrengr · 2021-12-09T06:40:02Z

All the functions which reconstruct keys (traverseWithKey, mapBy, contextualMapBy, foldrWithKey, toList, keys, etc) suffer from a quadratic slowdown due to how we reconstruct the keys. I can make some changes to reduce the non-asymptotic factors; but so far as I can tell, actually fixing the bug is irreconcilable under the current design.

We could easily eliminate the slowdown by instead constructing a reversed variant of lazy bytestrings (reversed, because we need snoc-lists of strict bytestrings, whereas the standard lazy bytestrings are cons-lists). However, if the caller then converts those to standard bytestrings, doing so will incur the quadratic cost again. (Going from reversed-lazy-bytestrings to lazy-bytestrings is only quadratic in the number of chunks, whereas going to strict-bytestrings is quadratic in the length.) Thus, while this approach would technically solve our problems, it only does so by pushing the problem off onto the user, which is unacceptable.

I do, at least, have some ideas for how we could solve this by storing extra metadata in the trie. But so far I have no idea how great the overhead of that would be. Ideally, if the metadata can be made cheap enough to compute on the fly, then we could just compute it when we need it. Each call to the key-reconstructing functions would require two passes over the trie, but that's far better than the current approach. However, if the user wants to call these functions often, then it makes more sense to keep the metadata around; but that introduces the burden of updating it whenever changes are made to the trie. And if the cost of those updates is too great, then that pushes us towards forking the datatype so that users can decide which cost model they want —which I'd really rather not do, if it can be avoided.

The text was updated successfully, but these errors were encountered:

…25

wrengr added the bug label Dec 9, 2021

wrengr added a commit that referenced this issue Dec 10, 2021

Data.Trie.Internal: using RevLazyByteString to reduce the cost of bug #…

ccaed44

…25

wrengr added a commit that referenced this issue Dec 14, 2021

CHANGELOG: reworded the issue of bug #25

be9cc1f

wrengr mentioned this issue Dec 15, 2021

Asymptotic complexity of functions deleting values #26

Open

wrengr added a commit that referenced this issue Jan 1, 2022

Cleaning up Haddock references to bug #25

6ed56cc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Asymptotic complexity of keyed functions. #25

Asymptotic complexity of keyed functions. #25

wrengr commented Dec 9, 2021

Asymptotic complexity of keyed functions. #25

Asymptotic complexity of keyed functions. #25

Comments

wrengr commented Dec 9, 2021