Name	Name	Last commit message	Last commit date
Latest commit zbjornson v2.0.1 Jul 8, 2022 21b721b · Jul 8, 2022 History 56 Commits
.github/workflows	.github/workflows	src: switch to ESM	Jul 8, 2022
benchmark	benchmark	add unaligned benchmark	Sep 26, 2016
src	src	feat: support BigInt64Array and BigUint64Array	Jul 8, 2022
test	test	feat: support BigInt64Array and BigUint64Array	Jul 8, 2022
.drone.yml	.drone.yml	Add arm64 CI	May 11, 2019
.editorconfig	.editorconfig	Initial commit: 1.0	Jun 3, 2016
.gitignore	.gitignore	src(): add yarn.lock to .gitignore	Dec 30, 2021
.jscsrc	.jscsrc	Initial commit: 1.0	Jun 3, 2016
BenchmarksByVersion_Compiler.xlsx	BenchmarksByVersion_Compiler.xlsx	Add benchmarking notebook	Nov 6, 2018
LICENSE	LICENSE	Initial commit	Jun 2, 2016
README.md	README.md	feat: support BigInt64Array and BigUint64Array	Jul 8, 2022
binding.gyp	binding.gyp	Add -Wno-unused-function to xcode flags	Nov 15, 2018
bswap.mjs	bswap.mjs	feat: support BigInt64Array and BigUint64Array	Jul 8, 2022
package.json	package.json	v2.0.1	Jul 8, 2022

Repository files navigation

node-bswap

x86:
ARM:

The fastest function to swap bytes (a.k.a. reverse the byte ordering, change endianness) of TypedArrays in-place for Node.js and browsers. Uses SIMD when available. Works with all of the TypedArray types, including BigUint64Array and BigInt64Array. Also works on Buffers if you construct a TypedArray view on the underlying ArrayBuffer (see below).

Install:

$ npm install bswap

Use:

import bswap from "bswap";
const x = new Uint16Array([1, 2, 3, 4, 5, 6, 7, 8]);
bswap(x);
// now: Uint16Array [ 256, 512, 768, 1024, 1280, 1536, 1792, 2048 ]

// With buffers:
const b = Buffer.alloc(128);
// This constructs a "view" on the same memory; it does not allocate new memory:
const ui32 = new Uint32Array(b.buffer, b.byteOffset, b.byteLength / Uint32Array.BYTES_PER_ELEMENT);
bswap(ui32);

In Node.js when native code and a recent x86 or ARM processor is available, this library uses the fastest available SIMD instructions (PSHUFB (SSSE3) or VPSHUFB (AVX2), REVn (NEON)), which process multiple array elements simultaneously.

Native code requires one of:

MSVC 2015 or later
Clang 3.4.x or later
GCC 4.8.x or later
ICC 16 or later

In the browser or when native code is unavailable, this library falls back to the fastest JavaScript implementation. The JavaScript implementation is also always explicitly available:

import {js} from "bswap"; // Use javascript implementation explicitly

Benchmarks

Showing millions of elements processed per second when invoked with a 10,000-element array. (Run the benchmark suite to see results for varying array lengths and other libraries.) Ran on an Intel i7-7700HQ 2.80 GHz processor (AVX2 supported) or Cavium ThunderX 2.0 GHz processor (ARM NEON); Node.js v8.x; Windows 10 (MSVC) or Ubuntu 16.04 (GCC, Clang). (Note that a 10,000-element Int16Array fits in L1 cache, whereas a 10,000-element Int32Array or Float64Array does not.)

compiler	C++	JS	Native:JS	Node.js	Native:Node
16 bit types (Uint16Array, Int16Array)
MSVC 2015	32,286	625	51.7x	12,141	2.7x
GCC 8.1	31,549	(same)	50.5x	1,507	20.9x
Clang 6	30,238	(same)	48.4x	(same)	20.1x
GCC-ARM	2,677	183	14.6x	297	9.0x
32 bits types (Uint32Array, Int32Array, Float32Array)
MSVC 2015	12,558	342	36.7x	5,840	2.2x
GCC 8.1	12,074	(same)	35.3x	2,361	5.1x
Clang 6	12,587	(same)	36.8x	(same)	5.3x
GCC-ARM	670	94	7.1x	249	2.7x
64 bit types (Float64Array)
MSVC 2015	6,841	179	38.2x	3,043	2.2x
GCC 8.1	6,528	(same)	36.5x	1,790	3.6x
Clang 6	6,598	(same)	36.9x	(same)	3.7x
GCC-ARM	382	49	7.8x	213	1.8x

There's an AVX512 implementation that is disabled by default. On the Cascade Lake CPU that I tested on, it is ~28% faster than the AVX2 version when the data fit in the L1 cache. However, it is ~10% slower than the AVX2 version when the data come from L2 and ~15% slower from L3. Under the assumption that this module is more often used with arrays larger than 32KB, I've thus left it disabled. Sometime maybe I'll make it select between AVX2 and AVX512 depending on the array length, but this module has no ability to know if the data is resident in the L1 cache.

Comparison to other libraries

Library	Operand	In-Place	64-bit Type Support	Browser	Speed (vs. bswap)*
bswap (this)	TypedArray	yes	yes	yes	1.00
node `buffer.swap16/32/64`	Buffer	yes	since 6.3.0	no	0.05 to 0.38
network-byte-order	Number/[Octet]	no	no	yes	0.010
endian-toggle	Buffer	no	yes	no	0.0056

* Higher is better. For 16-bit types, 10k-element arrays. Range given for Node.js version reflects Windows vs. Linux benchmark.

Node.js' built-in buffer.swap16|32|64 methods (16/32 since v5.10.0; 64 since 6.3.0). Operates in-place. No browser support. Slower except for tiny arrays (where it uses the JS implementation).

In 6.3.0 I added some optimizations to Node.js' implementation. The optimizations are effective on Windows, but GCC does not do the same automatic vectorization that MSVC does, nor does Node's default build config enable the newer SIMD instructions that this library uses.
Usage
```
> Buffer.from(typedArray.buffer).swap16()
```
endian-toggle. Simple usage, operates on a Node.js Buffer, handles any byte size, returns a new buffer (does not operate in-place).
Usage
```
> const x = new Uint16Array([2048])
> toggle(Buffer.from(x.buffer), x.BYTES_PER_ELEMENT * 8)
<Buffer d2 04 09 07>
```

network-byte-order. Operates on a single value at a time (i.e. needs to be looped to operate on an array) and has separate hton and ntoh methods, which do effectively the same thing but have different syntaxes. It can operate on strings, but it cannot swap 64-bit types.

Usage

// Using hton
> const b = [];
> nbo.htons(b, 0, 2048);
> b
[8, 0]

// or using ntoh
> const x = new Uint16Array([2048])
> nbo.ntohs(new Uint8Array(x.buffer, x.byteOffset, 2), 0)
8
> const z = new Uint16Array([8])
> new Uint8Array(z.buffer, z.byteOffset, 2)
Uint8Array [ 8, 0 ]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

node-bswap

Benchmarks

Comparison to other libraries

About

Releases 17

Packages

Languages

License

zbjornson/node-bswap

Folders and files

Latest commit

History

Repository files navigation

node-bswap

Benchmarks

Comparison to other libraries

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 17

Packages 0

Languages

Packages