encoding/json: marshaling RawMessage has poor performance #33422

rittneje · 2019-08-01T23:06:42Z

What version of Go are you using (`go version`)?

$ go version
go version go1.12.6 linux/amd64

Does this issue reproduce with the latest release?

Yes.

What operating system and processor architecture are you using (`go env`)?

go env Output

$ go env
GOARCH="amd64"
GOBIN=""
GOCACHE="/home/jrittner/.cache/go-build"
GOEXE=""
GOFLAGS=""
GOHOSTARCH="amd64"
GOHOSTOS="linux"
GOOS="linux"
GOPATH="/home/jrittner/go-workspace"
GOPROXY=""
GORACE=""
GOROOT="/home/jrittner/go"
GOTMPDIR=""
GOTOOLDIR="/home/jrittner/go/pkg/tool/linux_amd64"
GCCGO="gccgo"
CC="gcc"
CXX="g++"
CGO_ENABLED="1"
GOMOD=""
CGO_CFLAGS="-g -O2"
CGO_CPPFLAGS=""
CGO_CXXFLAGS="-g -O2"
CGO_FFLAGS="-g -O2"
CGO_LDFLAGS="-g -O2"
PKG_CONFIG="pkg-config"
GOGCCFLAGS="-fPIC -m64 -pthread -fmessage-length=0 -fdebug-prefix-map=/tmp/go-build506812392=/tmp/go-build -gno-record-gcc-switches"

What did you do?

Ran a benchmark to compare marshaling a json.RawMessage, a string and a []byte.

package jsontest

import (
	"encoding/json"
	"testing"
)

const msg = `{"a":"aaaaaaa","b":{"c":["d","e"]}}`

var benchmarkResult interface{}

func BenchmarkRawMessage(b *testing.B) {
	x := json.RawMessage(msg)
	for i := 0; i < b.N; i++ {
		j, err := json.Marshal(x)
		if err != nil {
			b.Fatal(err)
		}
		benchmarkResult = j
	}
}

func BenchmarkString(b *testing.B) {
	x := msg
	for i := 0; i < b.N; i++ {
		j, err := json.Marshal(x)
		if err != nil {
			b.Fatal(err)
		}
		benchmarkResult = j
	}
}

func BenchmarkBytes(b *testing.B) {
	x := []byte(msg)
	for i := 0; i < b.N; i++ {
		j, err := json.Marshal(x)
		if err != nil {
			b.Fatal(err)
		}
		benchmarkResult = j
	}
}

What did you expect to see?

I expected marshaling a json.RawMessage to have the best performance of the three, since it should be a no-op.

What did you see instead?

It is 2 times slower than marshaling a string, and 3 times slower than marshaling a []byte.

BenchmarkRawMessage-2 1000000 1513 ns/op 232 B/op 7 allocs/op
BenchmarkString-2 2000000 869 ns/op 112 B/op 3 allocs/op
BenchmarkBytes-2 3000000 561 ns/op 128 B/op 3 allocs/op

The text was updated successfully, but these errors were encountered:

rittneje · 2019-08-01T23:24:11Z

Investigating further, I believe the slowdown is caused by it trying to unnecessarily compact/validate the json.

go/src/encoding/json/encode.go

Line 456 in 2d6ee6e

err = compact(&e.Buffer, b, opts.escapeHTML)

Replacing this with e.Buffer.Write(b) yields much better performance - 477 ns/op.

agnivade · 2019-08-02T04:42:40Z

@mvdan @dsnet

dsnet · 2019-08-02T05:31:22Z

trying to unnecessarily compact/validate the json

In some applications, this may be considered true, but as a general principle, the encoding/json package strives for correctness first over performance. We can speed up the implementation of compact, but we can't just trivially avoid the call here.

rittneje · 2019-08-02T10:28:41Z

As far as I can tell, compact would only accomplish anything if you had a buggy json.Marshaler implementation. In all the years I've used Go, that has never happened, so I honestly cannot see any justification for this performance hit. It would be nice if there was at least an option on json.Encoder to say "I trust the output of all json.Marshalers to be correct, do not attempt to compact/validate it." (On that note, it occurs to me that there is no way for the encoder options to actually get passed into a custom MarshalJSON method. https://play.golang.org/p/b_DZHIrABif)

dsnet · 2019-08-02T17:58:00Z

In all the years I've used Go, that has never happened, so I honestly cannot see any justification for this performance hit.

Earlier I said: "In some applications, this may be considered true". I don't doubt that this is probably true of your use case. However, it is the current behavior and we can't just remove it as some are relying on this property. Keep in mind that compact does more than simply validate, but also enforces consistent whitespace (or rather lack of) in all outputs.

On that note, it occurs to me that there is no way for the encoder options to actually get passed into a custom MarshalJSON method

Yes. This is a problem that I've written about before regarding encoding/json. In my opinion, this is the primary reason that an option like what you're requesting is hard to fit into the existing API and its current behaviors.

There are many reasonable features to add to encoding/json in isolation, but the problem is that none (or very few) of them operate orthogonally with the existing features.

mvdan · 2019-08-15T09:50:23Z

I'd suggest investigating ways to optimize the current code without changing the API nor adding any options. If it's still too slow, perhaps file a proposal to change the API.

My thinking is similar to @dsnet's; encoding/json values correctness above performance, so any proposed changes to change the package's API should be well thought out.

qingyu31 · 2019-10-24T14:26:00Z

I'd suggest investigating ways to optimize the current code without changing the API nor adding any options. If it's still too slow, perhaps file a proposal to change the API.

My thinking is similar to @dsnet's; encoding/json values correctness above performance, so any proposed changes to change the package's API should be well thought out.

I've got same problem when I analysis performance of my application. Is it not ok to add options to encOpts?

gopherbot · 2019-11-03T09:02:16Z

Change https://golang.org/cl/205018 mentions this issue: encoding/json: prevent compact twice to improve precomputed performance.

dsnet · 2023-10-06T06:08:28Z

Hi all, we kicked off a discussion for a possible "encoding/json/v2" package that addresses the spirit of this proposal.

The prototype v2 implementation has a better parser, able to verify and reformat the result of a MarshalJSON method call much fater. See https://github.com/go-json-experiment/jsonbench#rawvalue-types, which shows that v2 is between 3.6x to 9.1x faster for this situation.

Marshalling a json.RawMessage is not zero overhead. Instead, it compacts the raw message which starts to have an overhead at scale. golang/go#33422 Since we have full control over the message constructed, we can simply write the byte slice into the network stream. This gives considerable performance boost. ``` goos: linux goarch: amd64 pkg: github.com/mattermost/mattermost/server/public/model cpu: Intel(R) Core(TM) i5-8265U CPU @ 1.60GHz │ old.txt │ new_2.txt │ │ sec/op │ sec/op vs base │ EncodeJSON-8 1640.5n ± 2% 289.6n ± 1% -82.35% (p=0.000 n=10) │ old.txt │ new_2.txt │ │ B/op │ B/op vs base │ EncodeJSON-8 528.0 ± 0% 503.0 ± 0% -4.73% (p=0.000 n=10) │ old.txt │ new_2.txt │ │ allocs/op │ allocs/op vs base │ EncodeJSON-8 5.000 ± 0% 4.000 ± 0% -20.00% (p=0.000 n=10) ``` P.S. No concerns over changing the model API because we are still using 0.x https://mattermost.atlassian.net/browse/MM-54998 ```release-note Improve websocket event marshalling performance ```

dsnet added the Performance label Aug 2, 2019

katiehockman added the NeedsDecision Feedback is required from experts, contributors, and/or the community before a change can be made. label Aug 5, 2019

mvdan added this to the Unplanned milestone Aug 15, 2019

qingyu31 mentioned this issue Nov 3, 2019

encoding/json: prevent compact twice to improve precomputed performance. #35320

Closed

s1na mentioned this issue Oct 17, 2023

rpc: improve the realtime notify performance by 30% ethereum/go-ethereum#28328

Merged

agnivade mentioned this issue Nov 3, 2023

MM-54998: Optimize JSON marshalling in websocket broadcast mattermost/mattermost#25286

Merged

gabyhelp mentioned this issue Jun 26, 2024

proposal: encoding/json: avoid massive escape costs #68203

Closed

dsnet mentioned this issue Jun 27, 2024

Somehow expose safeASCII option to outer types? go-json-experiment/json#44

Open

gabyhelp mentioned this issue Jul 11, 2024

testing: Problems with memory allocation calculation for "json.Marshal" in Benchmarkc tests #68381

Closed

gabyhelp mentioned this issue Oct 9, 2024

encoding/json: Unmarshal to a typed struct uses 50% more bytes in go 1.21 (and current versions) vs go 1.20 #69828

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

encoding/json: marshaling RawMessage has poor performance #33422

encoding/json: marshaling RawMessage has poor performance #33422

rittneje commented Aug 1, 2019

rittneje commented Aug 1, 2019

agnivade commented Aug 2, 2019

dsnet commented Aug 2, 2019 •

edited

Loading

rittneje commented Aug 2, 2019

dsnet commented Aug 2, 2019 •

edited

Loading

mvdan commented Aug 15, 2019

qingyu31 commented Oct 24, 2019

gopherbot commented Nov 3, 2019

dsnet commented Oct 6, 2023

encoding/json: marshaling RawMessage has poor performance #33422

encoding/json: marshaling RawMessage has poor performance #33422

Comments

rittneje commented Aug 1, 2019

What version of Go are you using (go version)?

Does this issue reproduce with the latest release?

What operating system and processor architecture are you using (go env)?

What did you do?

What did you expect to see?

What did you see instead?

rittneje commented Aug 1, 2019

agnivade commented Aug 2, 2019

dsnet commented Aug 2, 2019 • edited Loading

rittneje commented Aug 2, 2019

dsnet commented Aug 2, 2019 • edited Loading

mvdan commented Aug 15, 2019

qingyu31 commented Oct 24, 2019

gopherbot commented Nov 3, 2019

dsnet commented Oct 6, 2023

What version of Go are you using (`go version`)?

What operating system and processor architecture are you using (`go env`)?

dsnet commented Aug 2, 2019 •

edited

Loading

dsnet commented Aug 2, 2019 •

edited

Loading