Specialise count, last and nth for Cloned and Map iterators #28125

nagisa · 2015-08-31T12:20:14Z

Map/Cloned does not call the mapping/cloning function unnecessarily anymore for count, last and nth, which has a serious benefit to performance, especially with expensive mapping functions.

For example

(1..10).map(|x| {
    for y in 1..50 { black_box(y); }
    black_box(x)
}).last()

runs in 25 ns/iter (+/- 9) compared to 238 ns/iter (+/- 21) with nightly.

rust-highfive · 2015-08-31T12:20:28Z

r? @nikomatsakis

(rust_highfive has picked a reviewer for you, use r? to override)

nagisa · 2015-08-31T13:18:30Z

Thinking now, this might need an RFC since it breaks people’s assumptions with their side-effectful Clone impls and mapping functions.

Please advise.

bluss · 2015-08-31T13:26:28Z

My idea has been that these can only be specialized if there is no user-visible difference from the default Iterator-trait provided implementation of last, nth, count. We can change, good to discuss explicitly.

bluss · 2015-08-31T13:27:49Z

I believe these specializations aren't of very high importance. Better support for bidirectional or random access would be bigger.

ranma42 · 2015-08-31T13:30:04Z

src/libcore/iter.rs

+    #[inline]
+    fn last(self) -> Option<B> {
+        let f = self.f;
+        self.iter.last().map(f)


Just nitpicking... is there a reason why last does not use the same syntax as next and nth? (i.e. self.iter.last().map(|a| (self.f)(a)))

Because it doesn’t compile this way. self is by value here and is therefore partially moved by self.iter which does not allow capturing self into the closure.

Oh, you're right, I did not realise that the signature was different... but then self.iter.last().map(self.f) should work just fine, right?

Yeah, it works.

nagisa · 2015-08-31T13:31:42Z

I think they are important. If another method is added to Iterator in future releases, there could be incentive to make it run clone/mapping over all elements (because that’s what other functions do currently) even if not doing so would be very advantageous to performance of said method on Map-like iterators.

alexcrichton · 2015-08-31T18:08:23Z

I'm totally fine with this, pedantically this is a breaking change but practically I highly doubt that it will be. I'll tag this with T-libs so we can discuss but our conclusion may just be to clarify the documentation of the methods on the Iterator trait to indicate that the entire iterator may not be exhausted.

bluss · 2015-08-31T18:10:58Z

@alexcrichton It's not about breaking changes but about how we want iterators to behave

bluss · 2015-09-01T12:54:00Z

@alexcrichton It breaks existing code. Examples: (1), (2), (3), (4)

.count() is a method known to consume the iterator, with side effects. I think we should maintain it can only be specialized if the difference is invisible (for example: the slice iterator).

nikomatsakis · 2015-09-03T20:00:40Z

r? @alexcrichton

nagisa · 2015-09-05T21:06:23Z

I’m willing to limit the scope to Cloned only, as suggested by @bluss on IRC.

While it is still theoretically breaking, I think the impact should be non-existent in the real world.

Stebalien · 2015-09-08T15:03:05Z

Definitely not map. The last documentation even explicitly says "loops through the entire iterator". I'm hesitant about cloned but, if you do want to do that, you should change the documentation to specify that it only clones when necessary.

ranma42 · 2015-09-08T16:42:35Z

If I am not mistaken, this would impact both side-effectful Clone and Drop implementations (it might be obvious to most devs, but the Drop part has not yet been mentioned so far).

Has there been any investigation on what implementations of the Iterator trait need an explicit specialisation for count, last and nth and which ones can be automatically optimised by LLVM?
This would be more fragile, because updates to the code generation logic and to LLVM might cause regressions, but we might try to make it more robust by extending the codegen tests to also verify that (when optimisations are enabled), rustc generates the desired code.

Kimundi · 2015-09-23T23:11:48Z

The count()-not-exhausting issue might be an argument for just giving in and adding an .drain() method to iterators that runs them to exhaustion, and is defined as such.

This would address the fair-to-common usecase where someone just wants to chain a method to run the iterator chain, instead of using the for syntax with a _ pattern, and where the best current workaround is calling count() and ignoring the result.

alexcrichton · 2015-09-24T15:33:53Z

Just as an update, the libs team has had quite a full agenda the past few weeks (dealing with FCP both ending and starting anew), but we'll be sure to get around to this next week!

bluss · 2015-09-26T12:01:52Z

@Kimundi I think an actual .foreach(f) is the strictly more versatile alternative; it can serve as both. It's requested all the time by, I guess, a section of programmers used to that kind of style from other languages. Either way, we can't change the semantics of .count().

alexcrichton · 2015-10-02T15:56:12Z

Ok, thanks for the patience here! The libs team talked about this and the conclusion was that these iterator adaptor methods should always preserve the same semantics as the current default implementations, specifically with respect to side effects that may be seen.

In light of that I'm going to close this as "unfortunately we can't do this", and otherwise I've opened a tracking issue for auditing those changes we've already made in the standard library.

bluss · 2015-10-02T22:11:44Z

Thank you, that sounds great for iterators.

…).last() Iterator::last() consumes the entire iterator, even for DoubleEndedIterator, see rust-lang/rust#28125 (comment) Because of this, "at_line_start()" took 90% of fish_indent share/completions/git.fish making it take 1000ms instead of 30 ms. Fix that.

rust-highfive assigned nikomatsakis Aug 31, 2015

nagisa mentioned this pull request Aug 31, 2015

Change explicit BytesDeref impl into Cloned iterator #28119

Merged

ranma42 reviewed Aug 31, 2015
View reviewed changes

nagisa force-pushed the clonediter branch from 899bd7d to 8d1e5ea Compare August 31, 2015 13:43

alexcrichton added the T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. label Aug 31, 2015

rust-highfive assigned alexcrichton and unassigned nikomatsakis Sep 3, 2015

Specialise count, last, nth for Cloned iterator

17b22f7

nagisa force-pushed the clonediter branch from 8d1e5ea to 17b22f7 Compare September 5, 2015 21:09

aturon added the I-needs-decision Issue: In need of a decision. label Sep 16, 2015

alexcrichton mentioned this pull request Oct 2, 2015

Audit iterator specializations for side effects #28810

Closed

alexcrichton closed this Oct 2, 2015

alexcrichton removed the I-needs-decision Issue: In need of a decision. label Oct 2, 2015

bluss mentioned this pull request Sep 26, 2016

iter.map() should not guarantee that the closure is executed for all the iterated elements. rust-lang/rfcs#1757

Open

Stebalien mentioned this pull request Sep 27, 2016

Specialize methods on iter::Cloned<I> where I::Item: Copy. #36791

Closed

nagisa mentioned this pull request Jan 13, 2017

Specialize methods on iter::Cloned<I> where I::Item: Copy. #39022

Closed

nagisa mentioned this pull request Jun 10, 2017

Specialize Iterator::last for DoubleEndedIterator. #42584

Closed

Specialise count, last and nth for Cloned and Map iterators #28125

Specialise count, last and nth for Cloned and Map iterators #28125

Uh oh!

Conversation

nagisa commented Aug 31, 2015

Uh oh!

rust-highfive commented Aug 31, 2015

Uh oh!

nagisa commented Aug 31, 2015

Uh oh!

bluss commented Aug 31, 2015

Uh oh!

bluss commented Aug 31, 2015

Uh oh!

ranma42 Aug 31, 2015

Choose a reason for hiding this comment

Uh oh!

nagisa Aug 31, 2015

Choose a reason for hiding this comment

Uh oh!

ranma42 Aug 31, 2015

Choose a reason for hiding this comment

Uh oh!

nagisa Aug 31, 2015

Choose a reason for hiding this comment

Uh oh!

nagisa commented Aug 31, 2015

Uh oh!

alexcrichton commented Aug 31, 2015

Uh oh!

bluss commented Aug 31, 2015

Uh oh!

bluss commented Sep 1, 2015

Uh oh!

nikomatsakis commented Sep 3, 2015

Uh oh!

nagisa commented Sep 5, 2015

Uh oh!

Stebalien commented Sep 8, 2015

Uh oh!

ranma42 commented Sep 8, 2015

Uh oh!

Kimundi commented Sep 23, 2015

Uh oh!

alexcrichton commented Sep 24, 2015

Uh oh!

bluss commented Sep 26, 2015

Uh oh!

alexcrichton commented Oct 2, 2015

Uh oh!

bluss commented Oct 2, 2015

Uh oh!

Uh oh!