Candidate benchmarks #6

mhdawson · 2015-05-29T18:14:09Z

This issues is to discuss/identify candidate benchmarks. So far what we have on the list is:

Acme-Air for Node https://github.com/acmeair/acmeair-nodejs
ghost blog (https://github.com/tryghost/Ghost) and create our own load driver
techEmpower (https://www.techempower.com/benchmarks/)
Existing micro benchmarks

We expect that we'll want multiple, with at least one to cover each use case identified in #5

meaku · 2015-06-01T07:58:33Z

Concerning benchmarking "real" apps it might be hard to get meaningful results that can be compared.
If you're benchmarking an app there are many modules involved and external resources like database.

Think of benchmarking acmeair v0.1.1 using io.js 2.2.1 on day 1.
A week later we benchmark acmeair v 0.1.1 using iojs 2.2.2 and the benchmark shows that it's way faster.

But we actually don't know why? Might be an updated module, or a database upgrade or io.js being faster.

I think benchmarking a real application is a good thing, but there should be a clear strategy on how to get reproducible results. Maybe shrinkwrapping modules versions and mocking databases could do the trick.

What do you think?

seabaylea · 2015-06-01T08:10:25Z

Agreed - for any benchmark it will be important to only change one "component" (be that node.js/io.js or used module) at a time, so we know where the change in performance is coming from. Shrinkwrapping is a good approach to making sure that happens.

mhdawson · 2015-06-01T22:04:18Z

+1 for seabaylea's comment. For comparison purposes we'll want to limit the changes so that we can isolate what may have affected results

meaku · 2015-06-02T08:06:24Z

Do you think database latency might be an issue? If we stick to the same machine(s) and database version it might not be a problem.

mhdawson · 2015-06-03T21:08:33Z

Like other variables we'd need to make sure we keep the database version/systems consistent between runs and when we do change the database not change anything else at the same time

mhdawson · 2015-08-10T16:01:35Z

Since WebSockets are often used with Node some that we might consider including:

https://www.npmjs.com/package/thor
https://www.npmjs.com/package/websocket-benchmark
https://www.npmjs.com/package/websocket-bench

trevnorris · 2015-09-01T21:28:09Z

Here are a couple gulp scripts that would put decent pressure on a macro-benchmark: gulpjs/gulp#1118

davisjam · 2018-10-01T20:39:36Z

docs/case_coverage.md is somewhat sparse.

I am interested in helping to populate it.

I'd like use this comment to track recommended benchmarks for the various use cases.
If you have a suggestion, please reply on this issue and I'll update this comment.
The current use cases below are taken from #243.

Use Cases

Node.js a component in a web stack

Use case	Suggested benchmark(s)
Back-end API services	- ezPAARSE? See #76 though. - ?
Service oriented architectures (SOA)	?
Microservice-based applications	- Node-DC-EIS in u-service mode, but see #78. - jasnell and mcollina suggested workloads that are (a) JSON parse/stringify heavy, or (b) use FS and DNS heavily
Generating/serving dynamic web page content	- Acme Air - Node-DC-EIS (monolithic mode) - Node-DC-SSR (electrode) - ghost
Single page applications (SPA)	etherpad-lite

Node.js outside of the web stack

Use case	Suggested benchmark(s)
Scripting and automation	- Micro-benchmark for `require` - Micro-benchmark for node start/stop time
Agents and Data Collectors	Something based on Telegraf?
Developer tooling: web	Web Tooling Benchmark
Developer tooling: Node.js	Run `npm` commands like `npm install` and `npm audit`. Ideally we configure npm to use a local registry to eliminate network interference.
Desktop applications	Electron. Atom.
Systems software	Synthetic workload provided by jorangreef
Embedded software	?

mhdawson · 2018-10-02T19:35:04Z

https://github.com/nodejs/benchmarking/blob/master/docs/case_coverage.md is not completely empty, but it would be happier if there were less blank spaces (which I'm guessing is what you meant).

davisjam · 2018-10-02T21:39:26Z

Yes, I realize that was not clear. I've edited my post to read "somewhat sparse".

jorangreef · 2018-10-03T12:58:46Z

From #243:

@jorangreef I think you might have some comments on the "systems software" use case and perhaps others?

Firstly, thanks @davisjam and everyone here for your efforts expanding the Node.js benchmarking use cases.

Ronomon is an email startup in private beta. It falls into the "systems software" use case. Our new storage stack is being written in Node.js to drive 16x 10TB disks per server.

Things that are important for this use case:

The system encrypts and authenticates large 64KB+ fixed-size disk sectors, and needs to saturate the sequential write throughput of 16 disks. This requires HMAC and AES-256-CTR throughput > 1.6 GB/s. That rules out Node's synchronous crypto from the start, and makes asynchronous crypto essential (https://github.com/ronomon/crypto-async) to avoid blocking and to achieve multi-core throughput. If a disk or storage node fails and we need to rebuild, we can't afford to have the system bottlenecked on the throughput of a single CPU core doing crypto. The alternative of a cluster or multi-process solution would introduce needless complexity and overhead, and defeat the point of using Node.js in the first place, i.e. single-threaded non-blocking control plane with an asynchronous data plane.
Of course, the storage stack is not just doing crypto, it's also doing fs operations, using the same threadpool. At present, this is causing massive head-of-line blocking in the threadpool, with the much faster crypto tasks getting stuck behind the much slower fs tasks. You can imagine what happens when you race the Dakar Rally and the Monaco Grand Prix on the same track. For benchmarking, this means we need to benchmark the threadpool not just for DNS or FS tasks, but also for CPU-intensive tasks.
In addition to crypto and fs tasks, the storage stack also does erasure coding (https://github.com/ronomon/reed-solomon) and deduplication (https://github.com/ronomon/deduplication) using the threadpool. These are too CPU-intensive to be run synchronously, on the order of tens of milliseconds per task, and again we need multi-core throughput to saturate the disks' write bandwidth.
We use direct IO to raw block devices, for more control over a few things, not least to avoid spiking write commit latency due to large write buffer stalls. From a benchmarking point of view, this means that fs benchmarks should reflect realistic disk performance, instead of measuring only the filesystem cache. This becomes especially important when benchmarking the interaction between fs tasks and CPU tasks.
A single Node.js process for one of the storage servers manages 48-64 GB RAM. As a result, most of Ronomon's data structures are already large flat buffers, e.g. https://github.com/ronomon/hash-table, to reduce GC pause times, but reducing GC pause times under load remains critical to avoid blocking the event loop.
Because of the large memory footprint, simple things like spawning a child process asynchronously using Node.js turned out to be synchronous instead, and led to the event loop blocking for 1-2 seconds per async spawn(). We eventually had to stop using spawn() and switched to a unix socket. More benchmarks for the Node.js api for large memory footprints would be brilliant.

I hope this helps, Node.js has been great so far, making it easy to dip into C when needed, and with Javascript as a fast control plane language. It's fantastic to have a whole benchmarking team, and I'm looking forward to seeing CPU-intensive tasks becoming first-class asynchronous citizens.

davisjam · 2018-10-16T21:40:15Z

The TechEmpower benchmark source for Node.js is here.

mhdawson mentioned this issue May 29, 2015

2nd Benchmarking WG meeting #8

Closed

mhdawson mentioned this issue Feb 24, 2016

Node.js-based alternative to jMeter #30

Closed

mhdawson changed the title ~~Canditate benchmarks~~ Candidate benchmarks May 18, 2017

davisjam mentioned this issue Oct 1, 2018

doc: update use cases and case coverage #243

Merged

davisjam mentioned this issue Oct 2, 2018

Can it use system node? ezpaarse-project/ezpaarse#76

Closed

davisjam mentioned this issue Oct 3, 2018

Node.js Foundation Benchmarking WorkGroup Meeting 2018-10-09 #244

Closed

mhdawson added the benchmarking-agenda label Oct 3, 2018

davisjam mentioned this issue Oct 6, 2018

Think about Node library/tool benchmarks #206

Open

mhdawson mentioned this issue Oct 10, 2018

Node.js Foundation Benchmarking WorkGroup Meeting 2018-10-16 #245

Closed

mhdawson mentioned this issue Oct 24, 2018

Node.js Foundation Benchmarking WorkGroup Meeting 2018-10-30 #250

Closed

mhdawson mentioned this issue Nov 14, 2018

Node.js Foundation Benchmarking WorkGroup Meeting 2018-11-20 #253

Closed

mhdawson mentioned this issue Dec 12, 2018

Node.js Foundation Benchmarking WorkGroup Meeting 2018-12-18 #255

Closed

mhdawson mentioned this issue Jan 16, 2019

Node.js Foundation Benchmarking WorkGroup Meeting 2019-01-22 #258

Closed

mhdawson mentioned this issue Feb 6, 2019

Node.js Foundation Benchmarking WorkGroup Meeting 2019-02-12 #260

Closed

mhdawson mentioned this issue Feb 27, 2019

Node.js Foundation Benchmarking WorkGroup Meeting 2019-03-05 #261

Closed

mhdawson removed the benchmarking-agenda label Mar 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Candidate benchmarks #6

Candidate benchmarks #6

mhdawson commented May 29, 2015

meaku commented Jun 1, 2015

seabaylea commented Jun 1, 2015

mhdawson commented Jun 1, 2015

meaku commented Jun 2, 2015

mhdawson commented Jun 3, 2015

mhdawson commented Aug 10, 2015

trevnorris commented Sep 1, 2015

davisjam commented Oct 1, 2018 •

edited

Loading

mhdawson commented Oct 2, 2018

davisjam commented Oct 2, 2018

jorangreef commented Oct 3, 2018

davisjam commented Oct 16, 2018

Candidate benchmarks #6

Candidate benchmarks #6

Comments

mhdawson commented May 29, 2015

meaku commented Jun 1, 2015

seabaylea commented Jun 1, 2015

mhdawson commented Jun 1, 2015

meaku commented Jun 2, 2015

mhdawson commented Jun 3, 2015

mhdawson commented Aug 10, 2015

trevnorris commented Sep 1, 2015

davisjam commented Oct 1, 2018 • edited Loading

Use Cases

Node.js a component in a web stack

Node.js outside of the web stack

mhdawson commented Oct 2, 2018

davisjam commented Oct 2, 2018

jorangreef commented Oct 3, 2018

davisjam commented Oct 16, 2018

davisjam commented Oct 1, 2018 •

edited

Loading