Mutate labels to avoid excessive object cloning. #220

nowells · 2018-09-20T02:04:21Z

I discovered another performance optimization we can perform, which is to mutate the labels and avoid Object.assign where we can in hot paths.

❯ node benchmarks/index.js
Progress:
published#registry#getMetricsAsJSON x 1,819 ops/sec ±4.51% (80 runs sampled)
published#registry#metrics x 1,301 ops/sec ±5.13% (76 runs sampled)
local#registry#getMetricsAsJSON x 9,025 ops/sec ±4.71% (82 runs sampled)
local#registry#metrics x 2,968 ops/sec ±3.59% (85 runs sampled)

Results:
╔══════════════════╤════════════════════════════╤═══════════════════════════╗
║ registry         │ published                  │ local                     ║
╟──────────────────┼────────────────────────────┼───────────────────────────╢
║ getMetricsAsJSON │ 1819.383794048214 ops/sec  │ 9024.761535386682 ops/sec ║
╟──────────────────┼────────────────────────────┼───────────────────────────╢
║ metrics          │ 1300.6959959677145 ops/sec │ 2968.460595395247 ops/sec ║
╚══════════════════╧════════════════════════════╧═══════════════════════════╝

Fastest is local#registry#getMetricsAsJSON
Fastest is local#registry#metrics

SimenB · 2018-09-20T06:02:27Z

Thanks!

I also decided to include a basic benchmarking suite. If you want me to improve the suite, change it, or remove it entirely let me know, but seems like it might be useful to compare/see performance over time.

That's awesome, we've long wanted a benchmarking suite! But if we add it, I think it should be written in JS using e.g. https://github.com/bestiejs/benchmark.js/.

nowells · 2018-09-20T12:37:44Z

@SimenB happy to do it in JS, I only used the one I added here because I already had it sitting around in another project, so just copy and pasted. I'll take on creating the benchmark suite using that library (or something similar). I will rip out the current suite and leave just the performance changes (you can feel free to test with the test script locally to confirm)

SimenB · 2018-09-20T13:16:54Z

@KevinAMurray linked to a suite written in the framework I suggested in https://github.com/KevinAMurray/prom-client/tree/master/benchmarks. Might be a good starting point 🙂

SimenB · 2018-09-20T13:17:57Z

lib/registry.js

-				})
-			);
+			for (const val of item.values) {
+				for (const label of Object.keys(this._defaultLabels)) {


Since this is a perf PR, we can lift Object.keys(this._defaultLabels) out of the inner loop to avoid iterating through that object every time

nowells · 2018-09-20T13:25:15Z

With the latest benchmark suite and changes:

❯ node benchmarks/index.js
published#getMetricsAsJSON x 20.24 ops/sec ±7.67% (38 runs sampled)
local#getMetricsAsJSON x 72.32 ops/sec ±5.77% (61 runs sampled)
Fastest is local#getMetricsAsJSON

SimenB · 2018-09-20T13:26:18Z

Oooh, I love the idea of requiring our own latest publish for comparison

nowells · 2018-09-20T13:28:32Z

lib/histogram.js

-			createBucketValues(bucketData, histogram)
-		);
+		const buckets = [];
+		let acc = 0;


I inlined this to avoid creating a function on a big loop, also made performance profiling cleaner. If you don't like it I can revert, but I did get a performance gain by inlining the method.

I'm fine with this, especially if we now have benchmarks where it shows improvement

SimenB · 2018-09-20T13:35:27Z

lib/histogram.js

@@ -280,12 +280,13 @@ function convertLabelsAndValues(labels, value) {
 function extractBucketValuesForExport(histogram) {
 	return bucketData => {
 		const buckets = [];
+		const bucketLabels = Object.entries(bucketData.labels);


Object.entries is not available in Node 6 :(

Whoops! I'll switch back to just shifting Object.keys up. Sorry!

no worries! I thought we had the linter set up to yell, apparently not...

#221 fwiw 🙂

nowells · 2018-09-20T13:57:21Z

I switched to benchtable (which just extends benchmark internally) for better test setup as well as reporting. If you would rather it just be vanilla benchmark it should be easy to switch back.

❯ node benchmarks/index.js
getMetricsAsJSON for inputs published x 17.88 ops/sec ±7.60% (35 runs sampled)
getMetricsAsJSON for inputs local x 68.51 ops/sec ±8.18% (58 runs sampled)
Fastest is getMetricsAsJSON for inputs local
+-----------+------------------+
|           │ getMetricsAsJSON |
+-----------+------------------+
| published │ 17.88 ops/sec    |
+-----------+------------------+
| local     │ 68.51 ops/sec    |
+-----------+------------------+

SimenB · 2018-09-20T14:07:14Z

benchtable looks nice!

nowells · 2018-09-20T14:11:28Z

benchtable looks nice!

It is not maintained (2016 was last commit) but it just extends benchtable@^2 so it seemed reasonable. But I would understand if you would rather not pull in a seemingly unmaintained library, but I figure that it probably "just works" and it is just an extension of a library that is actively maintained.

SimenB · 2018-09-20T14:16:00Z

Yeah, no worries. I'd rather think of it as "complete" and not "unmaintained" 😀

SimenB · 2018-09-20T14:16:39Z

Mind updating the changelog?

nowells · 2018-09-20T14:39:14Z

Mind updating the changelog?

Sure thing!

nowells · 2018-09-20T14:43:41Z

> node ./benchmarks/index.js

getMetricsAsJSON for inputs published x 1,627 ops/sec ±5.03% (78 runs sampled)
getMetricsAsJSON for inputs local x 8,694 ops/sec ±4.96% (82 runs sampled)
metrics for inputs published x 1,252 ops/sec ±6.36% (76 runs sampled)
metrics for inputs local x 2,250 ops/sec ±5.84% (81 runs sampled)
Fastest is getMetricsAsJSON for inputs local
+------------------+---------------+---------------+
|                  │ published     │ local         |
+------------------+---------------+---------------+
| getMetricsAsJSON │ 1,627 ops/sec │ 8,694 ops/sec |
+------------------+---------------+---------------+
| metrics          │ 1,252 ops/sec │ 2,250 ops/sec |
+------------------+---------------+---------------+

siimon · 2018-09-20T15:02:04Z

Great work! I'll try and have a closer look tonight

SimenB · 2018-09-20T15:07:33Z

lib/registry.js

-			valAcc += '\n';
-			return valAcc;
-		}, '');
+			values += line.join(' ').trim();


values += line.join(' ').trim() + '\n'; to save an assignment

SimenB · 2018-09-20T15:10:16Z

benchmarks/index.js

+		}
+	})
+	.on('complete', () => {
+		// eslint-disable-next-line no-console


Instead of this, just add it as exception here:

prom-client/.eslintrc

Lines 64 to 69 in 25255c3

{

"files": ["example/**/*.js"],

"rules": {

"no-console": "off"

}

}

nowells · 2018-09-20T15:36:35Z

After Array.join removal.

❯ node benchmarks/index.js
getMetricsAsJSON for inputs published x 1,764 ops/sec ±4.77% (82 runs sampled)
getMetricsAsJSON for inputs local x 9,399 ops/sec ±3.89% (84 runs sampled)
metrics for inputs published x 1,386 ops/sec ±3.71% (83 runs sampled)
metrics for inputs local x 3,016 ops/sec ±4.65% (85 runs sampled)
Fastest is getMetricsAsJSON for inputs local
+------------------+---------------+---------------+
|                  │ published     │ local         |
+------------------+---------------+---------------+
| getMetricsAsJSON │ 1,764 ops/sec │ 9,399 ops/sec |
+------------------+---------------+---------------+
| metrics          │ 1,386 ops/sec │ 3,016 ops/sec |
+------------------+---------------+---------------+

KevinAMurray · 2018-09-20T15:37:50Z

(not come across BenchTable -- very nice!)

Benchmark is a great addition. And loving the performance increases.

zbjornson · 2018-09-20T16:22:10Z

package.json

@@ -39,6 +41,7 @@
 		"lint-staged": "^7.0.0",
 		"lolex": "^2.1.3",
 		"prettier": "1.14.2",
+		"prom-client": "^11.1.2",


Love testing against a published version. 👍

This might be a non-issue, but we'll have to update this (and the lock files) after every release (or at least when we care about benchmarks). I don't see a way to exclude a module from the lock files, which would give some options for improving that situation. Not listing it as a dependency at all could be another option, and then we could put the require for it in a try/catch?

we could add a CI check or something? npm show prom-client version returns latest published, and if the local version does not match, throw?

Yeah, that is interesting. We could have a postinstall step that only runs locally that installs the latest version? But yeah, I hadn't thought of the lockfile issue. Let me know what you prefer.

Oo, using a package.json script is another good idea. Wouldn't want it to run all the time though, so if it's in postinstall, would have to check somehow if it's being installed in a git checkout or as a dependency, I think?

(Might not be worth the trouble to make this "nice" -- could just manually bump it as necessary.)

SimenB · 2018-09-20T19:01:07Z

Could you add some prose about the benchmarks to a CONTRIBUTING.md or something? Not much, just the fact that they exist and how to run them 🙂

siimon · 2018-09-20T19:15:51Z

lib/registry.js

-		const values = (item.values || []).reduce((valAcc, val) => {
-			const merged = Object.assign({}, this._defaultLabels, val.labels);
+		help = `# HELP ${name} ${help}`;
+		const type = `# TYPE ${name} ${item.type}`;


👏 that was long overdue!

siimon · 2018-09-20T19:38:00Z

lib/registry.js

@@ -26,42 +26,51 @@ class Registry {
 	}

 	getMetricAsPrometheusString(metric, conf) {
-		const opts = Object.assign({}, defaultMetricsOpts, conf);


The point with using Object.assign like this was both a good way to extend and overwrite the default settings but also a way to make sure a specific configuration value always was there. I know that you changed on how you call this method from the metrics function but this function is public available so it is a breaking change.

I can revert this change, not the biggest perf win. Good call on the public api change potential.

siimon · 2018-09-20T19:47:39Z

package.json

 		"lolex": "^2.1.3",
 		"prettier": "1.14.3",
+		"prom-client": "^11.1.2",


I guess you could wildcard it? Doesn't solve the lock problem though.

nowells · 2018-09-20T22:02:59Z

Could you add some prose about the benchmarks to a CONTRIBUTING.md or something? Not much, just the fact that they exist and how to run them 🙂

I was planning on just making the benchmarking part of the test phase, and failing if the performance was not within a standard deviation. Would you rather have just the CONTRIBUTING, just the automatic failures, or both?

SimenB · 2018-09-20T22:08:11Z

I think HW varies too much for it to make sense to run as part of the test build. But I think we can paste the results of whatever we end up merging into a doc as a baseline

nowells · 2018-09-20T22:12:44Z

I think HW varies too much for it to make sense to run as part of the test build.

The build will run not with the results of a previous run, but will run the two jobs (with published and local) so I don't think the HW matters, it will all be relative to the current HW compute power. If we were going based on previous results where it was on different hardware I would agree. Thoughts?

SimenB · 2018-09-20T22:22:22Z

Oh, like run against latest published, then current? That makes sense 🙂

nowells · 2018-09-21T00:31:22Z

I added the automatic check to fail if the installed version of prom-client is not equal to the latest on npm.
I setup the benchmark suite to fail the build if the performance of the local changes are worse than the published (we might want to tweak in the future to give it some wiggle room)
I may have gone a little overboard on fleshing out the benchmarks suite.

Success

Failure

SimenB · 2018-09-21T06:16:39Z

I may have gone a little overboard on fleshing out the benchmarks suite.

Haha, this is awesome! 😀 You might want to consider creating a module we can install for the setup part, but this looks really good. Thank you so much for working on it!

SimenB · 2018-09-21T06:18:18Z

Also, would you mind creating a separate PR for the benchmark (once fully iterated)? That will keep this PR more focused on perf improvements

SimenB · 2018-09-21T08:53:11Z

bin/check-versions.sh

+
+set -e
+
+INSTALLED_VERSION=`node -p -e "require('prom-client/package.json').version"`


you don't need the -e, -p is enough 🙂

nowells · 2018-09-21T12:21:38Z

would you mind creating a separate PR for the benchmark

😆I had a feeling I was approaching that boundary. Makes total sense, I just couldn't help myself. I will update this PR to just be the perf changes, and move the benchmark suite to a new PR.

You might want to consider creating a module we can install for the setup part

Great idea! Once I open a PR with it separately, and land it, I will take on extracting it out into it's own package.

Thank you so much for working on it!

This is an awesome package. Thanks for helping to make it.

nowells · 2018-09-21T12:27:46Z

I extracted the benchmark suite to #222

nowells · 2018-09-21T23:12:31Z

Thanks for the great review everyone. And thanks for the merge @siimon! Will you be releasing this as a new version, or waiting to batch it up with other changes? Thanks!!!

SimenB · 2018-09-22T06:55:24Z

prom-client 11.1.3 published 🎉

Bug introduced at siimon#220 and fixed for .getMetricAsPrometheusString() at siimon#273

Bug introduced at #220 and fixed for .getMetricAsPrometheusString() at #273

SimenB mentioned this pull request Sep 20, 2018

Histogram scrape performance with multiple labels/label values #216

Closed

SimenB reviewed Sep 20, 2018

View reviewed changes

nowells commented Sep 20, 2018

View reviewed changes

SimenB requested review from siimon and zbjornson September 20, 2018 13:30

SimenB reviewed Sep 20, 2018

View reviewed changes

zbjornson reviewed Sep 20, 2018

View reviewed changes

SimenB approved these changes Sep 20, 2018

View reviewed changes

siimon reviewed Sep 20, 2018

View reviewed changes

SimenB reviewed Sep 21, 2018

View reviewed changes

Mutate labels to avoid excessive object cloning.

709a407

nowells force-pushed the more-performance branch from a0087bb to 709a407 Compare September 21, 2018 12:27

nowells mentioned this pull request Sep 21, 2018

Create benchmark suite #222

Merged

SimenB approved these changes Sep 21, 2018

View reviewed changes

zbjornson approved these changes Sep 21, 2018

View reviewed changes

siimon merged commit 6674ada into siimon:master Sep 21, 2018

doochik mentioned this pull request Sep 20, 2019

Memory leak with default labels #287

Closed

doochik added a commit to doochik/prom-client that referenced this pull request Sep 20, 2019

fix: avoid mutation bug in registry.getMetricsAsJSON()

52e5856

Bug introduced at siimon#220 and fixed for .getMetricAsPrometheusString() at siimon#273

doochik mentioned this pull request Sep 20, 2019

fix: avoid mutation bug in registry.getMetricsAsJSON() #288

Merged

doochik added a commit to doochik/prom-client that referenced this pull request Nov 13, 2019

fix: avoid mutation bug in registry.getMetricsAsJSON()

b56f714

Bug introduced at siimon#220 and fixed for .getMetricAsPrometheusString() at siimon#273

zbjornson pushed a commit that referenced this pull request Nov 13, 2019

fix: avoid mutation bug in registry.getMetricsAsJSON()

45c6518

Bug introduced at #220 and fixed for .getMetricAsPrometheusString() at #273

	{
	"files": ["example/*/.js"],
	"rules": {
	"no-console": "off"
	}
	}


		set -e

		INSTALLED_VERSION=`node -p -e "require('prom-client/package.json').version"`

Mutate labels to avoid excessive object cloning. #220

Mutate labels to avoid excessive object cloning. #220

Conversation

nowells commented Sep 20, 2018 • edited Loading

SimenB commented Sep 20, 2018

nowells commented Sep 20, 2018

SimenB commented Sep 20, 2018

SimenB Sep 20, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nowells commented Sep 20, 2018

SimenB commented Sep 20, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nowells commented Sep 20, 2018

SimenB commented Sep 20, 2018

nowells commented Sep 20, 2018

SimenB commented Sep 20, 2018

SimenB commented Sep 20, 2018

nowells commented Sep 20, 2018

nowells commented Sep 20, 2018

siimon commented Sep 20, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nowells commented Sep 20, 2018 • edited Loading

KevinAMurray commented Sep 20, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SimenB commented Sep 20, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nowells commented Sep 20, 2018

SimenB commented Sep 20, 2018

nowells commented Sep 20, 2018

SimenB commented Sep 20, 2018

nowells commented Sep 21, 2018 • edited Loading

Success

Failure

SimenB commented Sep 21, 2018

SimenB commented Sep 21, 2018

Choose a reason for hiding this comment

nowells commented Sep 21, 2018 • edited Loading

nowells commented Sep 21, 2018

nowells commented Sep 21, 2018

SimenB commented Sep 22, 2018

nowells commented Sep 20, 2018 •

edited

Loading

SimenB Sep 20, 2018 •

edited

Loading

nowells commented Sep 20, 2018 •

edited

Loading

nowells commented Sep 21, 2018 •

edited

Loading

nowells commented Sep 21, 2018 •

edited

Loading