FromInputStreamPublisher: avoid extra allocation of a buffer #2965

idelpivnitskiy · 2024-06-12T23:26:42Z

Motivation:

In #2949 we optimized a case when available() is not implemented and always returns 0. However, we de-optimized a use-case when it's implemented because the last call to available() always returns 0, but we still allocate a buffer of size readChunkSize that won't be used.

Modifications:

Enhance doNotFailOnInputStreamWithBrokenAvailableCall(...) test before any changes for better test coverage.
Remove byte[] buffer from a class variable. It can be a local variable because it's never reused in practice. Only the last buffer won't be nullified, but we don't need it after that.
When available() returns 0, try reading a single byte and then recheck availability instead of always falling back to
readChunkSize.
Adjust doNotFailOnInputStreamWithBrokenAvailableCall() test to account for the 2nd call to available();
Add singleReadTriggersMoreAvailability() test to simulate when the 2nd call to available() returns positive value;

Result:

No allocation of a buffer that won't be used at the EOF.
Account for new availability if it appears after a read().

Benchmark results:

The effect is visible for 256B and 16Kb payloads (less than 1 readChunkSize) because for them an allocation of an extra buffer is a bigger contributor.
available() of 0 means it always returns 0, true - it always returns a real value. Used in-memory ByteArrayInputStream with overridden available() method.

idelpivnitskiy · 2024-06-12T23:28:22Z

...concurrent-api/src/test/java/io/servicetalk/concurrent/api/FromInputStreamPublisherTest.java

-                    new byte[]{31, 32, 33, 34, 35},
-                    new byte[]{36},
+                    // available < readChunkSize
+                    new byte[]{0, 1, 2},


Changes for this test are made in the first commit to demonstrate that they only enhance test coverage and are not driven by the actual change. Consider reviewing this PR by every commit.

Motivation: In apple#2949 we optimized a case when `available()` is not implemented and always returns `0`. However, we de-optimized a use-case when it's implemented because after that change the last call to `available()` always returns 0, but we still allocate a buffer of size `readChunkSize` that won't be used at all. Modifications: - Enhance `doNotFailOnInputStreamWithBrokenAvailableCall(...)` test before any changes to have better test coverage. - Remove `byte[] buffer` from a class variable. It can be a local variable because it's never reused in practice. Only the last `buffer` won't be used nullified, but we don't need it after that. - When `available()` returns `0`, try reading a single byte and then check availability again instead of always falling back to `readChunkSize`. - Adjust `doNotFailOnInputStreamWithBrokenAvailableCall()` test to account for the 2nd call to `available()`; - Add `singleReadTriggersMoreAvailability()` test to simulate when the 2nd call to `available()` returns positive value; Result: 1. No allocation of a `buffer` that won't be used at the EOF. 2. Account for new availability if it appears after a `read()`.

bryce-anderson · 2024-06-12T23:38:08Z

...alk-concurrent-api/src/main/java/io/servicetalk/concurrent/api/FromInputStreamPublisher.java

@@ -176,14 +173,27 @@ public void cancel() {
        private void readAndDeliver(final Subscriber<? super byte[]> subscriber) {
            try {
                do {
+                    int readByte = -1;


ffti

Suggested change

int readByte = -1;

int readByte = END_OF_FILE;

While technically it's the same value, it has a different meaning as "didn't read a byte" vs "read EOF". I would prefer to not confuse it, but ready to change it to Integer.MIN_VALUE instead. WDYT?

Maybe an inline javadoc would help to clear up confusion and we can keep the -1?

Inline comment, or maybe a new constant (could have same or different value) to signal the different meaning would be fine. tbh I'm also fine with just merging as is, thus the ffti.

daschl · 2024-06-14T11:39:36Z

...alk-concurrent-api/src/main/java/io/servicetalk/concurrent/api/FromInputStreamPublisher.java

@@ -176,14 +173,27 @@ public void cancel() {
        private void readAndDeliver(final Subscriber<? super byte[]> subscriber) {
            try {
                do {
+                    int readByte = -1;


Maybe an inline javadoc would help to clear up confusion and we can keep the -1?

idelpivnitskiy requested review from daschl and bryce-anderson June 12, 2024 23:26

idelpivnitskiy commented Jun 12, 2024

View reviewed changes

idelpivnitskiy added 2 commits June 12, 2024 20:38

Enhance doNotFailOnInputStreamWithBrokenAvailableCall to test more cases

c4288fd

idelpivnitskiy force-pushed the readNoAvailable branch from bf71ddb to 70383fa Compare June 13, 2024 03:38

bryce-anderson approved these changes Jun 13, 2024

View reviewed changes

daschl approved these changes Jun 14, 2024

View reviewed changes

comment

d0d6d9f

idelpivnitskiy merged commit 82e256e into apple:main Jun 14, 2024
11 checks passed

idelpivnitskiy deleted the readNoAvailable branch June 14, 2024 22:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FromInputStreamPublisher: avoid extra allocation of a buffer #2965

FromInputStreamPublisher: avoid extra allocation of a buffer #2965

idelpivnitskiy commented Jun 12, 2024 •

edited

Loading

idelpivnitskiy Jun 12, 2024

bryce-anderson Jun 12, 2024

idelpivnitskiy Jun 13, 2024

daschl Jun 14, 2024

bryce-anderson Jun 14, 2024 •

edited

Loading

daschl Jun 14, 2024

FromInputStreamPublisher: avoid extra allocation of a buffer #2965

FromInputStreamPublisher: avoid extra allocation of a buffer #2965

Conversation

idelpivnitskiy commented Jun 12, 2024 • edited Loading

idelpivnitskiy Jun 12, 2024

Choose a reason for hiding this comment

bryce-anderson Jun 12, 2024

Choose a reason for hiding this comment

idelpivnitskiy Jun 13, 2024

Choose a reason for hiding this comment

daschl Jun 14, 2024

Choose a reason for hiding this comment

bryce-anderson Jun 14, 2024 • edited Loading

Choose a reason for hiding this comment

daschl Jun 14, 2024

Choose a reason for hiding this comment

idelpivnitskiy commented Jun 12, 2024 •

edited

Loading

bryce-anderson Jun 14, 2024 •

edited

Loading