fix #326: Requests write to output streams, they don't create inputstreams #331

carterkozak · 2020-02-14T21:35:57Z

See #326 for details.

carterkozak · 2020-02-16T15:57:58Z

dialogue-java-client/src/main/java/com/palantir/dialogue/HttpChannel.java

+                request.body().get().writeTo(bytes);
+            } catch (IOException e) {
+                throw new SafeRuntimeException("Failed to create a BodyPublisher", e);
+            }


This is the same block from ConjureBodySerDe, but it's only required for HttpChannel, not socket based transports.

carterkozak · 2020-02-16T15:58:19Z

dialogue-java-client/src/main/java/com/palantir/dialogue/HttpChannel.java

+            } catch (IOException e) {
+                throw new SafeRuntimeException("Failed to create a BodyPublisher", e);
+            }
+            return HttpRequest.BodyPublishers.ofByteArray(bytes.toByteArray());


Here we're able to use the more efficient ofByteArray publisher rather than ofInputStream.

Note that before, octet-stream requests would throw (not-implemented) where here they're fully buffered on heap like our feign client. This can be dangerous, but I don't think we should spend the time to optimize that case unless we decide to go all in on the jdk11 client.

carterkozak · 2020-02-16T16:00:12Z

dialogue-serde/src/main/java/com/palantir/conjure/java/dialogue/serde/ConjureBodySerDe.java

@@ -100,8 +99,8 @@ public RequestBody serialize(BinaryRequestBody value) {
            }

            @Override
-            public InputStream content() {
-                throw new UnsupportedOperationException("TODO(rfink): implement this");


todo: implement tests for binary requests in the abstract channel test.

ferozco

It feels weird that we're requiring a request body to advertise its size but also providing an API which encourages the implementer to omit it. I worry that avoiding the duplication of bytes might be pre-mature optimization and we should sit on this until we have some better benchmarks or real life usage

carterkozak · 2020-02-16T21:39:45Z

It feels weird that we're requiring a request body to advertise its size but also providing an API which encourages the implementer to omit it.

I'd be in favor of removing request body size entirely.

carterkozak · 2020-02-16T21:42:16Z

I worry that avoiding the duplication of bytes might be pre-mature optimization and we should sit on this until we have some better benchmarks or real life usage

I think this both simplifies the API, ignoring the performance benefit entirely.

ferozco · 2020-02-16T21:55:48Z

I'd be in favor of removing request body size entirely.

Would you intend to omit the header entirely? It seems we'd be putting ourself in a bind to try and populate the header from within the channel if we decided to not buffer binary requests.

I think this both simplifies the API, ignoring the performance benefit entirely.

I agree that it does simplify the API but I would still prefer to focus on getting the current client out before making too many changes

carterkozak · 2020-02-16T22:12:18Z

The http client libraries I'm aware of tend to either take chunks of data from a callback/consumer (async-io) or provide an OutputStream to write data to. I don't think we should invent a new model for our API that makes it more difficult and less performant to adapt to clients unless we have a strong reason for it.

Would you intend to omit the header entirely?

The Content-Length header? Many clients will buffer up to N bytes (often 8 or 16k) and send content-length if the data is fully buffered, otherwise send chunked requests. I think that's a reasonable approach for us to take.

It seems we'd be putting ourself in a bind to try and populate the header from within the channel if we decided to not buffer binary requests.

There's not really a downside to sending chunked requests, especially compared to buffering. Binary request buffering gets a little bit funky, but I think it's the same problem that we have a TODO to implement already, that's only going to be special for async-io clients (e.g. netty and jdk9 client) where InputStream has similar problems, the jdk9 input stream body producer is complicated and we're better off avoiding it if we can -- this will give us a better idea of the real performance of the client.

I would still prefer to focus on getting the current client out before making too many changes

Fully supportive of getting what we have into a good state but I'm also worried that we'll gather metrics to pick our target client using data from a subpar implementation. Once we start to roll dialogue out I imagine API changes like this will be harder to make.

ferozco · 2020-02-16T22:15:36Z

👍 ok, lets move ahead with this

…reams

changelog-app · 2020-02-16T22:36:33Z

Generate changelog in `changelog/@unreleased`

Type

Description

RequestBody api writes to an OutputStream rather than producing an InputStream for a simpler, more accurate API.
#326

Check the box to generate changelog(s)

Generate changelog entry

markelliot · 2020-02-17T16:25:06Z

dialogue-client-test-lib/src/main/java/com/palantir/dialogue/AbstractChannelTest.java

@@ -67,13 +65,8 @@

    private final RequestBody body = new RequestBody() {
        @Override
-        public OptionalLong length() {


when we know the length we probably want to write the Content-Length header

Clients which buffer requests prior to sending them do send a content-length. This change allowed us to avoid fully buffering requests in many cases as we write directly to a socket.
HttpChannel (java 9 client) could set a content-length header, though it never did before.

carterkozak requested review from iamdanfox and ferozco February 14, 2020 21:35

carterkozak force-pushed the ckozak/output_stream branch from 045918d to 8b661a7 Compare February 16, 2020 15:44

carterkozak commented Feb 16, 2020

View reviewed changes

ferozco reviewed Feb 16, 2020

View reviewed changes

carterkozak force-pushed the ckozak/output_stream branch from b6c3bd3 to 08715b4 Compare February 16, 2020 22:27

carterkozak added 3 commits February 16, 2020 17:34

fix #326: Requests write to output streams, they don't create inputst…

Unverified

This commit is not signed, but one or more authors requires that any commit attributed to them is signed.

Learn about vigilant mode

0257cd5

…reams

Remove RequestBody length

Unverified

This commit is not signed, but one or more authors requires that any commit attributed to them is signed.

Learn about vigilant mode

80953f5

Add generated changelog entries

Unverified

This commit is not signed, but one or more authors requires that any commit attributed to them is signed.

Learn about vigilant mode

fff9b22

carterkozak force-pushed the ckozak/output_stream branch from 08715b4 to 80953f5 Compare February 16, 2020 22:35

carterkozak added no changelog and removed no changelog labels Feb 16, 2020

carterkozak added the merge when ready label Feb 16, 2020

bulldozer-bot bot merged commit e0165ef into develop Feb 16, 2020

bulldozer-bot bot deleted the ckozak/output_stream branch February 16, 2020 22:39

markelliot reviewed Feb 17, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix #326: Requests write to output streams, they don't create inputstreams #331

fix #326: Requests write to output streams, they don't create inputstreams #331

carterkozak commented Feb 14, 2020

carterkozak Feb 16, 2020

carterkozak Feb 16, 2020

carterkozak Feb 16, 2020

carterkozak Feb 16, 2020

carterkozak Feb 16, 2020

ferozco left a comment

carterkozak commented Feb 16, 2020

carterkozak commented Feb 16, 2020

ferozco commented Feb 16, 2020 •

edited by carterkozak

Loading

carterkozak commented Feb 16, 2020

ferozco commented Feb 16, 2020

changelog-app bot commented Feb 16, 2020 •

edited by carterkozak

Loading

markelliot Feb 17, 2020

carterkozak Feb 17, 2020

fix #326: Requests write to output streams, they don't create inputstreams #331

fix #326: Requests write to output streams, they don't create inputstreams #331

Conversation

carterkozak commented Feb 14, 2020

carterkozak Feb 16, 2020

Choose a reason for hiding this comment

carterkozak Feb 16, 2020

Choose a reason for hiding this comment

carterkozak Feb 16, 2020

Choose a reason for hiding this comment

carterkozak Feb 16, 2020

Choose a reason for hiding this comment

carterkozak Feb 16, 2020

Choose a reason for hiding this comment

ferozco left a comment

Choose a reason for hiding this comment

carterkozak commented Feb 16, 2020

carterkozak commented Feb 16, 2020

ferozco commented Feb 16, 2020 • edited by carterkozak Loading

carterkozak commented Feb 16, 2020

ferozco commented Feb 16, 2020

changelog-app bot commented Feb 16, 2020 • edited by carterkozak Loading

Generate changelog in changelog/@unreleased

markelliot Feb 17, 2020

Choose a reason for hiding this comment

carterkozak Feb 17, 2020

Choose a reason for hiding this comment

ferozco commented Feb 16, 2020 •

edited by carterkozak

Loading

changelog-app bot commented Feb 16, 2020 •

edited by carterkozak

Loading

Generate changelog in `changelog/@unreleased`