stdio: optimize readln algorithm for non-char type by ljmf00 · Pull Request #7681 · dlang/phobos

ljmf00 · 2020-10-27T15:09:24Z

Instead of using buf.length = 0, use buf = buf[0..0] slice technique to clean
the array and reuse, if possible, the buffer to reduce GC allocations.

Signed-off-by: Luís Ferreira contact@lsferreira.net

Benchmark: https://gist.github.com/run-dlang/7d0a5264d186ac8db487447ec5352366

dlang-bot · 2020-10-27T15:09:26Z

Thanks for your pull request and interest in making D better, @ljmf00! We are looking forward to reviewing it, and you should be hearing from a maintainer soon.
Please verify that your PR follows this checklist:

My PR is fully covered with tests (you can see the coverage diff by visiting the details link of the codecov check)
My PR is as minimal as possible (smaller, focused PRs are easier to review than big ones)
I have provided a detailed rationale explaining my changes
New or modified functions have Ddoc comments (with Params: and Returns:)

Please see CONTRIBUTING.md for more information.

If you have addressed all reviews or aren't sure how to proceed, don't hesitate to ping us with a simple comment.

Bugzilla references

Your PR doesn't reference any Bugzilla issue.

If your PR contains non-trivial changes, please reference a Bugzilla issue or create a manual changelog.

Testing this PR locally

If you don't have a local development environment setup, you can use Digger to test this PR:

dub run digger -- build "master + phobos#7681"

ljmf00 · 2020-10-27T16:17:36Z

Force pushed due to codestyle issues.

ljmf00 · 2020-10-27T16:19:55Z

Btw @wilzbach I think its better to discard approval states on active repositories when force push or new commits to avoid merge unwanted and unapproved changes.

wilzbach · 2020-10-27T16:23:52Z

I appreciate the concern, but I think it's fine here as there isn't so much traffic. We already have the bot automatically remove auto-merge from PRs so that it's not possible for an untrusted contributor to sneak code in.

schveiguy · 2020-10-27T16:45:12Z

Instead of using buf.length = 0, use buf = buf[0..0] slice technique to clean
the array and use an appender to do less GC allocations.

buf.length = 0 is identical to buf = buf[0 .. 0].

Not only that, but you are only doing that from within the case where s.length is 0. So you are not trying to reuse the buffer at all, right?

The original did not reuse the buffer (even though it claims to in the docs). Any speedups here are purely from using appender vs. the slower runtime append. Reusing the buffer should add a pretty significant improvement.

ljmf00 · 2020-10-27T17:31:35Z

Instead of using buf.length = 0, use buf = buf[0..0] slice technique to clean
the array and use an appender to do less GC allocations.

buf.length = 0 is identical to buf = buf[0 .. 0].

Not only that, but you are only doing that from within the case where s.length is 0. So you are not trying to reuse the buffer at all, right?

buf.length = 0

uses core.internal.array.capacity._d_arraysetlengthTImpl runtime function template, which internally will basically do:

if (newlength <= (*p).length)
{
    *p = (*p)[0 .. newlength];
    void* newdata = (*p).ptr;
    return newdata[0 .. newlength];
}

which for 0 is basically buf = buf[0..0]. For the compiler this may be hard to optimize and at the end its the same.

The original did not reuse the buffer (even though it claims to in the docs). Any speedups here are purely from using appender vs. the slower runtime append. Reusing the buffer should add a pretty significant improvement.

Yes, you are right, I can reuse the existing buffer space instead of reallocate a new one. ~~Done the changes, please review.~~

ljmf00 · 2020-10-27T17:33:56Z

I appreciate the concern, but I think it's fine here as there isn't so much traffic. We already have the bot automatically remove auto-merge from PRs so that it's not possible for an untrusted contributor to sneak code in.

Ok, fair point 👍

schveiguy · 2020-10-27T18:22:47Z

uses core.internal.array.capacity._d_arraysetlengthTImpl runtime function template

ugh, you are right. When did this change? I thought setting array length to 0 was optimized, I don't think it used to call a runtime function at all.

If I use AST on run.dlang.io to give me the difference between setting length to 0 and slicing, it's... disturbing.

std/stdio.d

schveiguy

Looks good, and simpler. Nice!

Instead of using buf.length = 0, use buf = buf[0..0] slice technique to clean the array and reuse, if possible, the buffer to reduce GC allocations. Signed-off-by: Luís Ferreira <contact@lsferreira.net>

ljmf00 · 2020-10-27T23:47:35Z

ugh, you are right. When did this change? I thought setting array length to 0 was optimized, I don't think it used to call a runtime function at all.

If I use AST on run.dlang.io to give me the difference between setting length to 0 and slicing, it's... disturbing.

I made a PR to the compiler frontend to fix that dlang/dmd#11912 .

ljmf00 requested review from CyberShadow and schveiguy as code owners October 27, 2020 15:09

thewilsonator approved these changes Oct 27, 2020

View reviewed changes

ljmf00 force-pushed the optimize-readln branch from 816fd02 to e17d642 Compare October 27, 2020 16:06

ljmf00 force-pushed the optimize-readln branch from e17d642 to a9c092f Compare October 27, 2020 17:31

schveiguy requested changes Oct 27, 2020

View reviewed changes

std/stdio.d Outdated Show resolved Hide resolved

ljmf00 force-pushed the optimize-readln branch from a9c092f to 6ac656b Compare October 27, 2020 20:21

ljmf00 requested review from schveiguy and thewilsonator October 27, 2020 20:22

schveiguy approved these changes Oct 27, 2020

View reviewed changes

stdio: optimize readln algorithm for non-char type

2c4f731

Instead of using buf.length = 0, use buf = buf[0..0] slice technique to clean the array and reuse, if possible, the buffer to reduce GC allocations. Signed-off-by: Luís Ferreira <contact@lsferreira.net>

ljmf00 force-pushed the optimize-readln branch from 6ac656b to 2c4f731 Compare October 27, 2020 23:45

schveiguy added the Merge:auto-merge label Oct 28, 2020

dlang-bot merged commit 3ea6197 into dlang:master Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

stdio: optimize readln algorithm for non-char type#7681

stdio: optimize readln algorithm for non-char type#7681
dlang-bot merged 1 commit intodlang:masterfrom
ljmf00:optimize-readln

ljmf00 commented Oct 27, 2020 •

edited

Loading

Uh oh!

dlang-bot commented Oct 27, 2020

Uh oh!

ljmf00 commented Oct 27, 2020

Uh oh!

ljmf00 commented Oct 27, 2020

Uh oh!

wilzbach commented Oct 27, 2020

Uh oh!

schveiguy commented Oct 27, 2020

Uh oh!

ljmf00 commented Oct 27, 2020 •

edited

Loading

Uh oh!

ljmf00 commented Oct 27, 2020 •

edited

Loading

Uh oh!

schveiguy commented Oct 27, 2020

Uh oh!

Uh oh!

schveiguy left a comment

Uh oh!

ljmf00 commented Oct 27, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

ljmf00 commented Oct 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dlang-bot commented Oct 27, 2020

Bugzilla references

Testing this PR locally

Uh oh!

ljmf00 commented Oct 27, 2020

Uh oh!

ljmf00 commented Oct 27, 2020

Uh oh!

wilzbach commented Oct 27, 2020

Uh oh!

schveiguy commented Oct 27, 2020

Uh oh!

ljmf00 commented Oct 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ljmf00 commented Oct 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

schveiguy commented Oct 27, 2020

Uh oh!

Uh oh!

schveiguy left a comment

Choose a reason for hiding this comment

Uh oh!

ljmf00 commented Oct 27, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ljmf00 commented Oct 27, 2020 •

edited

Loading

ljmf00 commented Oct 27, 2020 •

edited

Loading

ljmf00 commented Oct 27, 2020 •

edited

Loading