A collection of hashing improvements from using hashtables. #4094

chandlerc · 2024-06-30T03:29:42Z

LLVM's APInt and APFloat need specialized handling to be used effectively in hashtables. We can't inject overrides into LLVM so we need to handle them in our hashing routine.

There were also problematic limits on hashing pairs and tuples. First, the unique-object-representation hashing of pairs was more restricted than tuples which was a problematic asymmetry and isn't needed. But the larger issue is that we didn't support recursively hashing when necessary. That requires a careful predicate to avoid infinite recursion but lets us handle important use cases for hashtables with a tuple as a key.

Also added support for hashing arrays that recurse in addition to arrays where we can hash the raw storage, and added overloads to redirect to common array handling from various array-like types.

Last but not least, re-worked the constraint model for hashing as raw data to not override custom hashing functions.

LLVM's `APInt` and `APFloat` need specialized handling to be used effectively in hashtables. We can't inject overrides into LLVM so we need to handle them in our hashing routine. There were also problematic limits on hashing pairs and tuples. First, the unique-object-representation hashing of pairs was more restricted than tuples which was a problematic asymmetry and isn't needed. But the larger issue is that we didn't support recursively hashing when necessary. That requires a careful predicate to avoid infinite recursion but lets us handle important use cases for hashtables with a tuple as a key. Also added support for hashing arrays that recurse in addition to arrays where we can hash the raw storage, and added overloads to redirect to common array handling from various array-like types. Last but not least, re-worked the constraint model for hashing as raw data to not override custom hashing functions.

common/hashing.h

chandlerc

Thanks for the review. A new version that tries to do this in a bit more principled way.

common/hashing.h

Co-authored-by: Richard Smith <richard@metafoo.co.uk>

common/hashing.h

Co-authored-by: Richard Smith <richard@metafoo.co.uk>

Co-authored-by: Carbon Infra Bot <carbon-external-infra@google.com>

common/hashing.h

github-actions bot requested a review from josh11b June 30, 2024 03:29

github-actions bot added the toolchain label Jun 30, 2024

chandlerc force-pushed the hashing-improvements branch from b348a8a to e649adc Compare June 30, 2024 03:30

chandlerc mentioned this pull request Jun 30, 2024

Key context improvements #4095

Merged

chandlerc requested a review from zygoloid July 1, 2024 20:10

zygoloid reviewed Jul 1, 2024

View reviewed changes

common/hashing.h Show resolved Hide resolved

common/hashing.h Outdated Show resolved Hide resolved

common/hashing.h Outdated Show resolved Hide resolved

Sink the dispatch down and make it more robust.

eefb408

chandlerc commented Jul 2, 2024

View reviewed changes

common/hashing.h Show resolved Hide resolved

common/hashing.h Outdated Show resolved Hide resolved

common/hashing.h Outdated Show resolved Hide resolved

chandlerc requested a review from zygoloid July 2, 2024 19:09

zygoloid reviewed Jul 2, 2024

View reviewed changes

common/hashing.h Outdated Show resolved Hide resolved

Update common/hashing.h

61fb202

Co-authored-by: Richard Smith <richard@metafoo.co.uk>

zygoloid approved these changes Jul 2, 2024

View reviewed changes

common/hashing.h Outdated Show resolved Hide resolved

common/hashing.h Outdated Show resolved Hide resolved

CarbonInfraBot reviewed Jul 2, 2024

View reviewed changes

common/hashing.h Outdated Show resolved Hide resolved

chandlerc and others added 2 commits July 2, 2024 13:44

Apply suggestions from code review

d6181e9

Co-authored-by: Richard Smith <richard@metafoo.co.uk>

Update common/hashing.h

17e0191

Co-authored-by: Carbon Infra Bot <carbon-external-infra@google.com>

CarbonInfraBot reviewed Jul 2, 2024

View reviewed changes

common/hashing.h Outdated Show resolved Hide resolved

chandlerc enabled auto-merge July 2, 2024 20:48

chandlerc added this pull request to the merge queue Jul 2, 2024

Merged via the queue into carbon-language:trunk with commit bf736e6 Jul 2, 2024
7 checks passed

chandlerc deleted the hashing-improvements branch July 2, 2024 21:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A collection of hashing improvements from using hashtables. #4094

A collection of hashing improvements from using hashtables. #4094

chandlerc commented Jun 30, 2024

chandlerc left a comment

A collection of hashing improvements from using hashtables. #4094

A collection of hashing improvements from using hashtables. #4094

Conversation

chandlerc commented Jun 30, 2024

chandlerc left a comment

Choose a reason for hiding this comment