md5: minor optimization in software backend #755

newpavlov · 2025-11-06T01:31:22Z

Replaces the logical OR in the G function with addition. It seemingly results in a better ALU utilization and improves performance by several percents. From 699 MB/s to 753 MB/s on my x86 PC and from 910 MB/s to 960 MB/s on Mac M4.

Based on #749

newpavlov · 2025-11-06T01:49:55Z

Huh... Compiling op_g in isolation results in the same assembly whether we use wrapping_add or not (see here). In other words, the compiler is able to apply such optimization itself. But it seems this change (accidentally?) nudges the compiler towards a better codegen.

newpavlov · 2025-11-06T01:55:38Z

Interestingly, compiling the full compress function with the wrapping_add change results in more instructions on x86 (672 vs 648), but I guess the resulting code is a bit friendlier to pipeline. On AArch64 the number of instructions is the same (632), but with wrapping_add some orr instructions get replaced with add (i.e. the compiler has failed to apply the optimization when the full function is compiled).

@tarcieri
Could you check whether this change results in a better performance on Mac?

newpavlov · 2025-11-06T04:09:08Z

On M4 this change improves performance from 910 MB/s to 960 MB/s.

tarcieri · 2025-11-06T04:12:49Z

Seems about 5-6% faster on my M1 Max

md5: minor optimization in software backend

fd5bdc8

newpavlov requested a review from tarcieri November 6, 2025 01:31

newpavlov mentioned this pull request Nov 6, 2025

md5: Add optimized AArch64 assembly implementation #749

Closed

tweak comment

bc73dde

newpavlov merged commit 6aa90e8 into master Nov 6, 2025
13 checks passed

newpavlov deleted the md5/add_opt branch November 6, 2025 04:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

md5: minor optimization in software backend #755

md5: minor optimization in software backend #755

Uh oh!

newpavlov commented Nov 6, 2025 •

edited

Loading

Uh oh!

newpavlov commented Nov 6, 2025 •

edited

Loading

Uh oh!

newpavlov commented Nov 6, 2025 •

edited

Loading

Uh oh!

newpavlov commented Nov 6, 2025

Uh oh!

Uh oh!

tarcieri commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

md5: minor optimization in software backend #755

md5: minor optimization in software backend #755

Uh oh!

Conversation

newpavlov commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

newpavlov commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

newpavlov commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

newpavlov commented Nov 6, 2025

Uh oh!

Uh oh!

tarcieri commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

newpavlov commented Nov 6, 2025 •

edited

Loading

newpavlov commented Nov 6, 2025 •

edited

Loading

newpavlov commented Nov 6, 2025 •

edited

Loading