go-simd

This repo is an ARM64 NEON architecture specific implementation of SIMD (Single Instruction, Multiple Data) operations in Go.

The SIMD instructions are written entirely with Assembly, and does not use CGO, and wrapped in a more useable "API" layer.

Benchmarks

To see benchmarks & tests, run go test -v -bench=. -benchmem

Here are the results from BenchmarkInt8DotProduct on 11/28/2024

goos: darwin
goarch: arm64
pkg: go-simd
cpu: Apple M2 Pro
BenchmarkInt8DotProduct/Scalar-16-12          175903202          6.641 ns/op        0 B/op        0 allocs/op
BenchmarkInt8DotProduct/SIMD-16-12            506505428          2.384 ns/op        0 B/op        0 allocs/op
BenchmarkInt8DotProduct/Scalar-100-12         33686653         35.22 ns/op        0 B/op        0 allocs/op
BenchmarkInt8DotProduct/SIMD-100-12           78084122         15.39 ns/op        0 B/op        0 allocs/op
BenchmarkInt8DotProduct/Scalar-1000-12         3858519        309.3 ns/op        0 B/op        0 allocs/op
BenchmarkInt8DotProduct/SIMD-1000-12          43131205         28.08 ns/op        0 B/op        0 allocs/op
BenchmarkInt8DotProduct/Scalar-4096-12          940941       1268 ns/op        0 B/op        0 allocs/op
BenchmarkInt8DotProduct/SIMD-4096-12          20645560         58.13 ns/op        0 B/op        0 allocs/op
BenchmarkInt8DotProduct/Scalar-10000-12         387094       3101 ns/op        0 B/op        0 allocs/op
BenchmarkInt8DotProduct/SIMD-10000-12          8187171        146.4 ns/op        0 B/op        0 allocs/op
BenchmarkInt8DotProduct/Scalar-100000-12         34800      30903 ns/op        0 B/op        0 allocs/op
BenchmarkInt8DotProduct/SIMD-100000-12          542487       2214 ns/op        0 B/op        0 allocs/op
PASS
ok   go-simd 16.531s

Results are ~14x faster with SIMD for 100000 elements.

Contributing

Development Prerequisites

ARM64 architecture (Apple Silicon or equivalent)
CPU supports Neon SIMD

But other than that just add a test & benchmark for each SIMD operation and open a PR :D

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github/workflows		.github/workflows
simd_int8		simd_int8
simd_uint8		simd_uint8
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
main.go		main.go
neon_101.s		neon_101.s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

go-simd

Benchmarks

Contributing

Development Prerequisites

About

Releases

Packages

Languages

License

jairad26/go-simd

Folders and files

Latest commit

History

Repository files navigation

go-simd

Benchmarks

Contributing

Development Prerequisites

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages