Reduce runtime of Go Encode() by another 25% #649

willbeason · 2024-12-10T17:09:20Z

This is (another) performance optimization change for the Go implementation of Encode(), which reduces the runtime by about 25%. This is the additional performance improvement I mentioned in #566.

There are two main changes here to accomplish this, either of which may be controversial as they do harm the code's readability. I have attempted to mitigate this with refactorings, and verified the extracted functions are inlined to avoid the additional overhead of function calls.

The first change is that all loops are unrolled. Given that all loops are unconditionally executed a constant number of times, this can be done without introducing any additional branches to the code.

The second change is that the precision rounding logic is modified to require zero divisions. This new method is significantly faster, but may change the final digits of codes requiring sub-centimeter precision (I don't believe this library has this as a supported use case?). There are no cases in the test suite in which this causes a difference, but if desired I can likely find such an edge case.

Before:

will@Janeway:~/GolandProjects/open-location-code/go$ go test --run=NONE --bench=Encode --benchtime=10s -cpuprofile=/home/will/cpu2.out
goos: linux
goarch: amd64
pkg: github.com/google/open-location-code/go
cpu: AMD Ryzen Threadripper 3970X 32-Core Processor 
BenchmarkEncode-64    	186547320	        64.70 ns/op	      16 B/op	       1 allocs/op
PASS
ok  	github.com/google/open-location-code/go	22.222s

After:

will@Janeway:~/GolandProjects/open-location-code/go$ git checkout -
Switched to branch 'optimize-2'
will@Janeway:~/GolandProjects/open-location-code/go$ go test --run=NONE --bench=Encode --benchtime=10s -cpuprofile=/home/will/cpu.out
goos: linux
goarch: amd64
pkg: github.com/google/open-location-code/go
cpu: AMD Ryzen Threadripper 3970X 32-Core Processor 
BenchmarkEncode-64    	233725107	        49.01 ns/op	      16 B/op	       1 allocs/op
PASS
ok  	github.com/google/open-location-code/go	21.471s

CPU profile of before:

CPU profile of after:

Unroll loops to improve performance. Rewrite lat/lng rounding logic to avoid divides. Signed-off-by: Will Beason <willbeason@gmail.com>

go/encode.go

Also merge logic for lat/lng iterations as they are identical. Signed-off-by: Will Beason <willbeason@gmail.com>

…ion-code into optimize-2

willbeason · 2025-01-06T16:31:58Z

Okay, improved comments and renamed functions as requested.

drinckes

Thanks!

Improve performance of Go Encode() implementation further

6a614d8

Unroll loops to improve performance. Rewrite lat/lng rounding logic to avoid divides. Signed-off-by: Will Beason <willbeason@gmail.com>

willbeason force-pushed the optimize-2 branch from 432806b to 6a614d8 Compare December 10, 2024 17:12

willbeason marked this pull request as ready for review December 10, 2024 17:13

drinckes self-assigned this Dec 23, 2024

Merge branch 'main' into optimize-2

33a8353

drinckes reviewed Dec 27, 2024

View reviewed changes

go/encode.go Show resolved Hide resolved

go/encode.go Outdated Show resolved Hide resolved

go/encode.go Outdated Show resolved Hide resolved

go/encode.go Outdated Show resolved Hide resolved

willbeason added 2 commits January 6, 2025 10:29

Improve comments and function names in Go Encode()

268d3a5

Also merge logic for lat/lng iterations as they are identical. Signed-off-by: Will Beason <willbeason@gmail.com>

Merge branch 'optimize-2' of https://github.com/willbeason/open-locat…

0e40085

…ion-code into optimize-2

willbeason requested a review from drinckes January 6, 2025 16:36

drinckes approved these changes Jan 8, 2025

View reviewed changes

Merge branch 'main' into optimize-2

7d96ff3

drinckes merged commit 2586090 into google:main Jan 8, 2025
14 checks passed

willbeason deleted the optimize-2 branch January 8, 2025 17:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce runtime of Go Encode() by another 25% #649

Reduce runtime of Go Encode() by another 25% #649

willbeason commented Dec 10, 2024 •

edited

Loading

willbeason commented Jan 6, 2025

drinckes left a comment

Reduce runtime of Go Encode() by another 25% #649

Reduce runtime of Go Encode() by another 25% #649

Conversation

willbeason commented Dec 10, 2024 • edited Loading

willbeason commented Jan 6, 2025

drinckes left a comment

Choose a reason for hiding this comment

willbeason commented Dec 10, 2024 •

edited

Loading