-
Notifications
You must be signed in to change notification settings - Fork 13.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Poor codegen for string search (i.e., memchr
) with iterators
#94573
Comments
(I should mention that this used the newest stable rustc and clang as of writing: 1.59 and 13, resp.) |
OK, I godbolted this and am working through the assembler code. It's definitely using SSE on |
Those functions seem different, am I missing something? C++ version returns true when there are at least 2 dashes, while Rust version returns true when there are at least 3 dashes. |
Also, when first |
Note that fn has_more_than_two_dashes(s: &[u8]) -> bool {
sl.iter().filter(|&&c| c == b'-').nth(1).is_some()
} I think C++'s Edit: not to say this is what you should do, but I find interesting that you can manually inline |
There's also memchr crate, which was created to address similar performance issues. |
I am not convinced this should be considered a bug; like @panstromek said, if performance matters you can switch in |
A colleague provided me with this benchmark, which compares the following equivalent C++ and Rust:
C++ performs 20x better in the microbenchmark. We haven't tried very hard to figure out why, but our suspicion is that Rust's implementation of
memchr
is worse than the one provided by the system that ran the benchmark. C++ also bails out early, and Rust does not, but that appears not to matter because the dashes are at the far end of the strings.We're not actually sure what CPU quick-bench ran this with, but I can run it on my (icelake, I think) Xeon later. I think the difference in memchr implementations is worth investigating regardless.
The text was updated successfully, but these errors were encountered: