Add Iterator trait TrustedLen to enable better FromIterator / Extend #37306

bluss · 2016-10-20T13:15:08Z

This trait attempts to improve FromIterator / Extend code by enabling it to trust the iterator to produce an exact number of elements, which means that reallocation needs to happen only once and is moved out of the loop.

TrustedLen differs from ExactSizeIterator in that it attempts to include more iterators by allowing for the case that the iterator's len does not fit in usize. Consumers must check for this case (for example they could panic, since they can't allocate a collection of that size).

For example, chain can be TrustedLen and all numerical ranges can be TrustedLen. All they need to do is to report an exact size if it fits in usize, and None as the upper bound otherwise.

The trait describes its contract like this:

An iterator that reports an accurate length using size_hint.

The iterator reports a size hint where it is either exact
(lower bound is equal to upper bound), or the upper bound is `None`.
The upper bound must only be `None` if the actual iterator length is
larger than `usize::MAX`.

The iterator must produce exactly the number of elements it reported.

# Safety

This trait must only be implemented when the contract is upheld.
Consumers of this trait must inspect `.size_hint()`’s upper bound.

Fixes #37232

rust-highfive · 2016-10-20T13:15:20Z

r? @alexcrichton

(rust_highfive has picked a reviewer for you, use r? to override)

bluss · 2016-10-20T13:18:56Z

Benchmark and result at https://gist.github.com/bluss/baa98105d141cff3949dda1c1f2d8cce

Using SetLenOnDrop was required to avoid the aliasing troubles in the case of bench_rev_2.

The .chain() cases unfortunately are not optimized well (remains as one loop that goes through the chain's state logic, instead of splitting it into two loops for the front and back part).

bluss · 2016-10-20T13:19:13Z

cc @eddyb @arielb1

Mark-Simulacrum · 2016-10-20T15:28:46Z

I believe at least this adapter for Result also should have TrustedLen implemented on it: https://github.com/rust-lang/rust/blob/master/src/libcore/result.rs#L997. I know it currently doesn't have a size_hint, but @eddyb had me add it in #37270, since we believe it should have it.

bluss · 2016-10-20T15:33:09Z

@Mark-Simulacrum Thanks. Yes, there's a ton of iterators that will need the marker

alexcrichton · 2016-10-20T18:22:31Z

Nice! Does this mean that we can remove some of the specialization around extending vectors from slices as well?

bluss · 2016-10-20T18:30:08Z

@alexcrichton I think it does. We can probably remove .extend_from_slice()'s implementation entirely.

bluss · 2016-10-20T19:07:11Z

Nevermind, extend_from_slice is Clone and extend(&[T]) requires Copy.

The new Vec::extend covers the duties of .extend_from_slice() and some previous specializations.

bluss · 2016-10-21T12:09:11Z

Ah, that's how to do it. Now extend_from_slice's own implementation is gone, and the specialization that redirected to it. (Sorry #37094 !). I verified performance using a simple benchmark (only).

alexcrichton · 2016-10-21T16:54:07Z

Is the specialization of Vec::append(Vec<T>) also still needed? With TrustedLen it seems like that should also optimize down to the same thing, right?

alexcrichton · 2016-10-21T16:57:55Z

The more I read this as well the more I feel like we need more eyes on this. This kind of usage of specialization seems like it could make for some really nasty bugs if we get this wrong. Could we perhaps start more conservatively with implementations of TrustedLen? Maybe those on just slice iterators for now?

The addition of TrustedLen to all the Range iterators makes me a little uneasy as they've always waffled a bit on how precise they are on a various systems and platforms with their size hints.

alexcrichton · 2016-10-21T16:58:02Z

Ah and cc @rust-lang/libs

bluss · 2016-10-21T17:00:04Z

Right, I think it should compile to the same thing. I had the idea that it could reuse other's allocation in some cases (it doesn't, maybe it should?), so I didn't touch it.

I would like to keep the Vec::append specialization, since it's less code for the optimizer to chew on, it doesn't rely on release mode optimizations, and because it's easy to keep what's there.

The Range iterators should be unproblematic, given TrustedLen's contract.

alexcrichton · 2016-10-21T17:03:00Z

We in general have a large problem of lots of code to chew on in LLVM, and while specialization can indeed solve a lot of that I'd personally prefer to err on the side of less source code as there's just fewer unsafe blocks and fewer hoops you need to jump through to see how extend is implemented.

Perhaps some debug assertions could be added to the trusted_len method that the lower bound is the exact same as the upper bound if it's Some? This just feels like a bug waiting to happen...

bluss · 2016-10-21T17:05:49Z

A reason that there is no trusted_len method in the trait itself, is that I was thinking about the best way to avoid bugs, for example that iterators would implement such a hypothetical method correctly, but not size hint. So implementors only need to implement size_hint.

I guess we can add a debug assertion to where this is used.

bluss · 2016-10-21T17:06:32Z

Vec::append is moving all the elements by calling memcpy. We give llvm more to chew on if that's replaced by the two loops in .extend().

alexcrichton · 2016-10-25T03:06:04Z

@rfcbot fcp merge

I personally feel that we should get rid of the other specialization of Vec::extend(Vec<T>) -> Vec::append, but that's a pretty minor concern. Curious what others think!

Vec::append is moving all the elements by calling memcpy. We give llvm more to chew on if that's replaced by the two loops in .extend().

Right it gives LLVM more to chew on, but we've practically never optimized the standard library for minimizing the amount of code we send to LLVM, so I don't think now's really the time to start. Removing the extra specialization keeps it a little more understandable as there's only one specialization to consider, not multiple.

rfcbot · 2016-10-25T03:09:24Z

Team member @alexcrichton has proposed to merge this. The next step is review by the rest of the tagged teams:

No concerns currently listed.

Once these reviewers reach consensus, this will enter its final comment period. If you spot a major issue that hasn't been raised at any point in this process, please speak up!

See this document for info about what commands tagged team members can give me.

bluss · 2016-10-25T07:47:42Z

@alexcrichton Oh, I can be on board with that. As long as Vec::append's own implementation stays as it is.

alexcrichton · 2016-10-25T16:21:23Z

Oh yeah I wouldn't imagine any change to the implementation of Vec::append

This now produces as good code (with optimizations) using the TrustedLen codepath.

bluss · 2016-10-26T23:51:18Z

The special case for <Vec<T>>::extend(Vec<T>) has been removed. The regular trusted len .extend() call was verified to be equivalent using a microbenchmark (but only with optimizations applied).

bluss · 2016-10-27T17:21:56Z

Benchmark file for this PR https://gist.github.com/bluss/3828cadbd92f5a94d020a474b9879f6c

One highlight is the .map() case speeding up, like requested in the bug report:

name                            extend-before-1.log ns/iter  extend-after-1.log ns/iter    diff ns/iter   diff %
bench_map_fast                  6,061                        6,245                                  184    3.04%
bench_map_regular               14,108                       6,429                               -7,679  -54.43%

map_fast is this code:

pub fn map_fast(l: &[(u32, u32)]) -> Vec<u32> {
    let mut result = Vec::with_capacity(l.len());
    for i in 0..l.len() {
        unsafe {
            *result.get_unchecked_mut(i) = l[i].0;
            result.set_len(i);
        }
    }
    result
}

and the other one is just v.extend(data.iter().map(|t| t.1));

bluss · 2016-10-27T17:24:42Z

Something that's not in this PR is to replace the for loop in Vec::extend with .fold(). That has some dramatic effects on .chain() when using extend/collect: https://gist.github.com/bluss/fad6a046491896ae4a4bf4655869b869

Remember that most of these benchmarks are best cases, where the iterator element is a small value and where loop optimizations like unrolling or converting to memcpy have a big impact.

brson · 2016-11-03T23:18:34Z

Waiting for @Kimundi to chime in.

bluss · 2016-11-03T23:25:59Z

Opened a tracking issue and linked it in.

rfcbot · 2016-11-03T23:55:27Z

🔔 This is now entering its final comment period, as per the review above. 🔔

psst @alexcrichton, I wasn't able to add the final-comment-period label, please do so.

alexcrichton · 2016-11-03T23:58:29Z

Looks like travis error is legit?

bluss · 2016-11-04T00:01:53Z

Yes, sorry, syntax error in the stability attr. Fixed by amending that commit.

alexcrichton · 2016-11-04T00:03:48Z

@bors: r+

bors · 2016-11-04T00:03:49Z

📌 Commit f0e6b90 has been approved by alexcrichton

Add Iterator trait TrustedLen to enable better FromIterator / Extend This trait attempts to improve FromIterator / Extend code by enabling it to trust the iterator to produce an exact number of elements, which means that reallocation needs to happen only once and is moved out of the loop. `TrustedLen` differs from `ExactSizeIterator` in that it attempts to include _more_ iterators by allowing for the case that the iterator's len does not fit in `usize`. Consumers must check for this case (for example they could panic, since they can't allocate a collection of that size). For example, chain can be TrustedLen and all numerical ranges can be TrustedLen. All they need to do is to report an exact size if it fits in `usize`, and `None` as the upper bound otherwise. The trait describes its contract like this: ``` An iterator that reports an accurate length using size_hint. The iterator reports a size hint where it is either exact (lower bound is equal to upper bound), or the upper bound is `None`. The upper bound must only be `None` if the actual iterator length is larger than `usize::MAX`. The iterator must produce exactly the number of elements it reported. This trait must only be implemented when the contract is upheld. Consumers of this trait must inspect `.size_hint()`’s upper bound. ``` Fixes rust-lang#37232

bors · 2016-11-04T17:40:30Z

⌛ Testing commit f0e6b90 with merge 81601cd...

Add Iterator trait TrustedLen to enable better FromIterator / Extend This trait attempts to improve FromIterator / Extend code by enabling it to trust the iterator to produce an exact number of elements, which means that reallocation needs to happen only once and is moved out of the loop. `TrustedLen` differs from `ExactSizeIterator` in that it attempts to include _more_ iterators by allowing for the case that the iterator's len does not fit in `usize`. Consumers must check for this case (for example they could panic, since they can't allocate a collection of that size). For example, chain can be TrustedLen and all numerical ranges can be TrustedLen. All they need to do is to report an exact size if it fits in `usize`, and `None` as the upper bound otherwise. The trait describes its contract like this: ``` An iterator that reports an accurate length using size_hint. The iterator reports a size hint where it is either exact (lower bound is equal to upper bound), or the upper bound is `None`. The upper bound must only be `None` if the actual iterator length is larger than `usize::MAX`. The iterator must produce exactly the number of elements it reported. This trait must only be implemented when the contract is upheld. Consumers of this trait must inspect `.size_hint()`’s upper bound. ``` Fixes #37232

bors · 2016-11-04T21:14:46Z

jonhoo · 2017-12-05T16:24:07Z

Was there a particular reason behind not taking advantage of TrustedLen for HashMap::from_iter?

bluss · 2017-12-05T16:33:53Z

No reason, if it makes sense, just explore it

bluss added 4 commits October 20, 2016 14:07

Introduce iterator trait TrustedLen

9ae9930

Use TrustedLen for Vec's FromIterator and Extend

4955711

Implement TrustedLen for more iterators

69b9400

Document TrustedLen’s contract

a3cab90

rust-highfive assigned alexcrichton Oct 20, 2016

vec: Use Vec::extend specializations in extend_from_slice and more

622f24f

The new Vec::extend covers the duties of .extend_from_slice() and some previous specializations.

vec: Add a debug assertion where TrustedLen is used

ee84ec1

alexcrichton added the T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. label Oct 25, 2016

bluss added 2 commits October 27, 2016 00:18

impl TrustedLen for vec::IntoIter

2411be5

vec: Remove the Vec specialization for .extend()

5dc9db5

This now produces as good code (with optimizations) using the TrustedLen codepath.

brson added the relnotes Marks issues that should be documented in the release notes of the next release. label Nov 1, 2016

bluss mentioned this pull request Nov 3, 2016

Tracking issue for TrustedLen (trusted_len) #37572

Open

Link the tracking issue for TrustedLen

f0e6b90

bluss force-pushed the trusted-len branch from 9688dce to f0e6b90 Compare November 4, 2016 00:01

sophiajt mentioned this pull request Nov 4, 2016

Rollup of 17 pull requests #37581

Closed

bors merged commit f0e6b90 into rust-lang:master Nov 4, 2016

bluss deleted the trusted-len branch November 4, 2016 22:03

bluss mentioned this pull request Nov 26, 2016

Serialization performance regression #38021

Closed

bluss mentioned this pull request Jul 16, 2017

Use internal iteration in FromIterator and Extend implementations. #43255

Closed

oberien mentioned this pull request Dec 30, 2017

Add UnboundedIterator Trait #47082

Closed

ssomers mentioned this pull request Aug 1, 2021

BTree: add drain methods #81075

Closed

Add Iterator trait TrustedLen to enable better FromIterator / Extend #37306

Add Iterator trait TrustedLen to enable better FromIterator / Extend #37306

Uh oh!

Conversation

bluss commented Oct 20, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rust-highfive commented Oct 20, 2016

Uh oh!

bluss commented Oct 20, 2016

Uh oh!

bluss commented Oct 20, 2016

Uh oh!

Mark-Simulacrum commented Oct 20, 2016

Uh oh!

bluss commented Oct 20, 2016

Uh oh!

alexcrichton commented Oct 20, 2016

Uh oh!

bluss commented Oct 20, 2016

Uh oh!

bluss commented Oct 20, 2016

Uh oh!

bluss commented Oct 21, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexcrichton commented Oct 21, 2016

Uh oh!

alexcrichton commented Oct 21, 2016

Uh oh!

alexcrichton commented Oct 21, 2016

Uh oh!

bluss commented Oct 21, 2016

Uh oh!

alexcrichton commented Oct 21, 2016

Uh oh!

bluss commented Oct 21, 2016

Uh oh!

bluss commented Oct 21, 2016

Uh oh!

alexcrichton commented Oct 25, 2016

Uh oh!

rfcbot commented Oct 25, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bluss commented Oct 25, 2016

Uh oh!

alexcrichton commented Oct 25, 2016

Uh oh!

bluss commented Oct 26, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bluss commented Oct 27, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bluss commented Oct 27, 2016

Uh oh!

brson commented Nov 3, 2016

Uh oh!

bluss commented Nov 3, 2016

Uh oh!

rfcbot commented Nov 3, 2016

Uh oh!

alexcrichton commented Nov 3, 2016

Uh oh!

bluss commented Nov 4, 2016

Uh oh!

alexcrichton commented Nov 4, 2016

Uh oh!

bors commented Nov 4, 2016

Uh oh!

bors commented Nov 4, 2016

Uh oh!

bors commented Nov 4, 2016

Uh oh!

jonhoo commented Dec 5, 2017

Uh oh!

bluss commented Dec 5, 2017

Uh oh!

Reviewers

bluss commented Oct 20, 2016 •

edited

Loading

bluss commented Oct 21, 2016 •

edited

Loading

rfcbot commented Oct 25, 2016 •

edited

Loading

bluss commented Oct 26, 2016 •

edited

Loading

bluss commented Oct 27, 2016 •

edited

Loading