-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reuse phf hash and remove phf::OrderedSet indirection #103
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This allows us to bypass the indirection of `phf::OrderedSet`: we don’t care about the order, only about getting numeric indices. This will also allow us to re-use the phf hash for the dynamic table, to avoid hashing strings twice.
SimonSapin
added a commit
to SimonSapin/rust-phf
that referenced
this pull request
Aug 3, 2015
In https://github.com/servo/string-cache, we currently use a `phf::OrderedSet` with its `get_index` method to get an identified stored in an `Atom`, and `index` to get a string back from that identifier. However, the extra inderection of `OrderedSet` of `Set` is not necessary. We don’t care about the order, only about getting numeric identifiers. Additionally, when `get_index` returns `None`, we hash the input string again to find it in table of dynamic atoms. With this chang, we can reuse the phf hash instead: servo/string-cache#103 At first I tried adding hash and index access to `phf::Map`, but the API got messy quickly.
@pcwalton, would this help servo/servo#6906 ? |
I suspect it will, yes. |
r? @glennw |
@bors-servo r+ |
📌 Commit 573161f has been approved by |
bors-servo
pushed a commit
that referenced
this pull request
Sep 1, 2015
Reuse phf hash and remove phf::OrderedSet indirection <s>Do not merge yet.</s> This depends on rust-phf/rust-phf#62 Use the `phf_shared` and `phf_generator` crates directly instead of `phf`. This allows us to re-use the phf hash in the dynamic table and avoid hashing the same string again. Also remove the indirection of `phf::OrderedSet` compared to `phf::Set`: we don’t care about the order, only about getting numeric indices. (The optimization mentioned in a comment of using a bit map of the first 64 atoms in the html5ever tree builder was never implemented. If we want it, the indirection and order preservation can be added back while preserving the hash reuse.) Fixes #38. <!-- Reviewable:start --> [<img src="https://reviewable.io/review_button.png" height=40 alt="Review on Reviewable"/>](https://reviewable.io/reviews/servo/string-cache/103) <!-- Reviewable:end -->
☀️ Test successful - travis |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Do not merge yet.This depends on rust-phf/rust-phf#62Use the
phf_shared
andphf_generator
crates directly instead ofphf
. This allows us to re-use the phf hash in the dynamic table and avoid hashing the same string again.Also remove the indirection of
phf::OrderedSet
compared tophf::Set
: we don’t care about the order, only about getting numeric indices. (The optimization mentioned in a comment of using a bit map of the first 64 atoms in the html5ever tree builder was never implemented. If we want it, the indirection and order preservation can be added back while preserving the hash reuse.)Fixes #38.