json-serializing a sync response can block the reactor for tens of seconds #6998

richvdh · 2020-02-26T21:52:52Z

A response to an initial /sync can easily run to hundreds of megabytes of JSON, which takes tens of seconds of CPU time to serialise from the objects. This monopolises the Twisted reactor thread, causing all sorts of other problems (including dropped replication connections for worker-mode deployments).

To make matters worse, although we have a cache which is supposed to avoid redoing a lot of the hard work for duplicate requests to initialsync (though its effectiveness is disputed, see #3880), we cache the de-serialised version, so have to re-serialise the response for each request. (One reason we have to do this is to populate the age field).

The text was updated successfully, but these errors were encountered:

richvdh · 2020-02-26T22:12:15Z

A simple solution might be to farm out the json-serialisation to a threadpool?

richvdh · 2020-07-21T23:18:55Z

Actually, I have a much better idea. JSONEncoder implements an iterencode method. Rather than try to generate all the JSON at once, we could use that to generate JSON incrementally.

One approach to this would be to implement, say, a JsonProducer as an implementation of Twisted's IProducer, which we then attach to the request instead of the NoRangeStaticProducer we use at the moment. Every time resumeProducing gets called, we just send the next chunk from the iterencode result instead.

A slight complication would be that we wouldn't know the content-length ahead of time, so we'd have to use chunked transfer-encoding for the response; but that's easy enough to do.

richvdh · 2020-07-21T23:21:04Z

@auscompgeek this feels like the sort of thing that might interest you. Any interest in picking it up?

auscompgeek · 2020-07-22T07:46:50Z

I would agree that using iterencode would be the best solution here. I can't say I know enough Twisted to work on this though.

A small concern:

Every time resumeProducing gets called, we just send the next chunk from the iterencode result instead.

Between each value, iterencode would yield single-char strings, for example:

>>> list(enc.iterencode({'a': 'b', 'c': 'd'}))                                                                  
['{', '"a"', ':', '"b"', ',', '"c"', ':', '"d"', '}']

Does Twisted's HTTP chunking coalesce small chunks together?

richvdh · 2020-07-22T09:35:28Z

I would agree that using iterencode would be the best solution here. I can't say I know enough Twisted to work on this though.

I don't think you should let that be the thing that puts you off picking this up - the amount of Twisted knowledge needed should be minimal [1]. I understand you will have other demands on your time though!

Does Twisted's HTTP chunking coalesce small chunks together?

I don't think Twisted has any support for generating chunked transfer-encoding (or if it does, I haven't found it), so we'd be implementing that part ourselves. You're right though, we'll need to coalesce small chunks, to avoid generating thousands of tiny HTTP chunks each of which get passed through openssl and send out as tiny TCP segments.

[1]: currently, to send responses, we wrap the json bytes in a BytesIO and then use NoRangeStaticProducer, which reads from a file-like object and sends the contents out. As you can see from the source it's pretty simple and it should be easy to implement a replacement which reads from a JSON iterencoder.

clokep · 2020-07-22T11:16:24Z

I don't think Twisted has any support for generating chunked transfer-encoding (or if it does, I haven't found it), so we'd be implementing that part ourselves.

Twisted uses chunked encoding automatically if you don't set a content-length header.

I really thought the Twisted docs had a minimal example of hooking incremental JSON producing up to an IProducer, but I think I'm thinking of an old project. It should be pretty straightforward to hook up the APIs.

clokep · 2020-07-31T17:58:23Z

I was curious to take a look at this...and ended up doing it, oops. See #8013. I haven't tested it to see if it helps perf wise.

This was referenced Feb 26, 2020

Consider deprecating the age property of events #6999

Closed

We shouldn't let people DoS us via /sync (SYN-660) #1239

Open

neilisfragile added A-Performance Performance, both client-facing and admin-facing z-p2 (Deprecated Label) labels Mar 9, 2020

richvdh mentioned this issue Jul 21, 2020

Perform JSON-encoding of CS API responses in a background thread #7926

Closed

richvdh added the Z-Help-Wanted We know exactly how to fix this issue, and would be grateful for any contribution label Jul 21, 2020

clokep mentioned this issue Jul 31, 2020

Iteratively encode JSON responses #8013

Merged

neilisfragile assigned clokep Aug 6, 2020

clokep mentioned this issue Aug 6, 2020

Add the option to iteratively encode JSON. matrix-org/python-canonicaljson#29

Merged

clokep closed this as completed in #8013 Aug 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

json-serializing a sync response can block the reactor for tens of seconds #6998

json-serializing a sync response can block the reactor for tens of seconds #6998

richvdh commented Feb 26, 2020

richvdh commented Feb 26, 2020

richvdh commented Jul 21, 2020

richvdh commented Jul 21, 2020

auscompgeek commented Jul 22, 2020

richvdh commented Jul 22, 2020

clokep commented Jul 22, 2020

clokep commented Jul 31, 2020

json-serializing a sync response can block the reactor for tens of seconds #6998

json-serializing a sync response can block the reactor for tens of seconds #6998

Comments

richvdh commented Feb 26, 2020

richvdh commented Feb 26, 2020

richvdh commented Jul 21, 2020

richvdh commented Jul 21, 2020

auscompgeek commented Jul 22, 2020

richvdh commented Jul 22, 2020

clokep commented Jul 22, 2020

clokep commented Jul 31, 2020