implement fold() on array::IntoIter to improve flatten().collect() perf #87431

the8472 · 2021-07-24T15:59:29Z

With #87168 flattening array::IntoIters is now TrustedLen, the FromIterator implementation for Vec has a specialization for TrustedLen iterators which uses internal iteration. This implements one of the main internal iteration methods on array::Into to optimize the combination of those two features.

This should address the main issue in #87411

# old
test vec::bench_flat_map_collect                         ... bench:   2,244,024 ns/iter (+/- 18,903)

# new
test vec::bench_flat_map_collect                         ... bench:     172,863 ns/iter (+/- 2,141)

rust-highfive · 2021-07-24T15:59:32Z

r? @kennytm

(rust-highfive has picked a reviewer for you, use r? to override)

``` # old test vec::bench_flat_map_collect ... bench: 2,244,024 ns/iter (+/- 18,903) # new test vec::bench_flat_map_collect ... bench: 172,863 ns/iter (+/- 2,141) ```

kennytm · 2021-07-24T20:50:24Z

while this LGTM, shouldn't the original issue be addressed by implementing SpecExtend<T, std::array::IntoIter<T>> for Vec<T, A>?

the8472 · 2021-07-24T21:00:35Z

The original issue involved Flatten which results in several adapters sitting between SpecExtend and the IntoIter.

kennytm · 2021-07-24T21:40:08Z

library/core/src/array/iter.rs

+        (&mut self.alive)
+            .try_fold::<_, _, Result<_, !>>(init, |acc, idx| {
+                // SAFETY: idx is obtained by folding over the `alive` range, which implies the
+                // value is currently considered alive but as the range is being consumed each value
+                // we read here will only be read once and then considered dead.
+                Ok(fold(acc, unsafe { data.get_unchecked(idx).assume_init_read() }))
+            })
+            .unwrap()


can we call fold here instead of try_fold?

Suggested change

(&mut self.alive)

.try_fold::<_, _, Result<_, !>>(init, |acc, idx| {

// SAFETY: idx is obtained by folding over the `alive` range, which implies the

// value is currently considered alive but as the range is being consumed each value

// we read here will only be read once and then considered dead.

Ok(fold(acc, unsafe { data.get_unchecked(idx).assume_init_read() }))

})

.unwrap()

self.alive.fold(init, |acc, idx| {

// SAFETY: idx is obtained by folding over the `alive` range, which implies the

// value is currently considered alive but as the range is being consumed each value

// we read here will only be read once and then considered dead.

fold(acc, unsafe { data.get_unchecked(idx).assume_init_read() })

})

I tried that, array::IntoIter has a Drop impl, so alive can't be move out, but that would be required to call fold(self), that's why I used try_fold instead.

@the8472 oops right.

(&mut self.alive).fold(init, ...) should work though.

Yeah but that would go through impl Iterator for &mut I which is less optimized.

self.alive is a std::ops::Range<usize> and AFAIK there is no special-cased implementation of fold or try_fold for Range<usize> nor &mut Range<usize>.

even if we do the mem::take it will just turn self.alive to 0..0 and then leaks everything which is safe 🙃 (compared with self.alive.clone().fold(...) which will cause double-free).

I have now tried (&mut self.alive).fold instead of try_fold, it undoes all perfomance gains. I guess somehow the indirection through &mut inhibits optimizations.

Which is due to this not being #[inline]

rust/library/core/src/iter/traits/iterator.rs

Lines 3474 to 3478 in 71a6c7c

impl<I: Iterator + ?Sized> Iterator for &mut I {

type Item = I::Item;

fn next(&mut self) -> Option<I::Item> {

(**self).next()

}

heh.

wdyt should we just add the #[inline] or leave a FIXME comment explaining the performance regression if we use fold instead of try_fold? either way is fine for me.

I'll add a FIXME, changing inlining on such central methods can have mixed impact on compile time even if runtime performance is better, so that should be done on a separate PR.

kennytm · 2021-07-27T08:50:38Z

@bors r+ rollup=iffy

bors · 2021-07-27T08:50:39Z

📌 Commit 2276c5e has been approved by kennytm

bors · 2021-07-27T10:38:46Z

⌛ Testing commit 2276c5e with merge 99d6692...

bors · 2021-07-27T13:24:18Z

☀️ Test successful - checks-actions
Approved by: kennytm
Pushing 99d6692 to master...

inline next() on &mut Iterator impl In [rust-lang#87431](https://github.com/rust-lang/rust/pull/87431/files#diff-79a6b417b85ecf4f1a4ef2235135fedf540199caf6e9e1d154ac6a413b40a757R132-R136) I found that `(&mut range).fold` doesn't optimize well because the default impl for for `fold` on `&mut Iterator` doesn't inline `next`. In that particular case it was worked around by using `try_fold` which takes a `&mut self` instead of `self`. Let's see if this can be fixed more broadly.

rust-highfive assigned kennytm Jul 24, 2021

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jul 24, 2021

This comment has been minimized.

Sign in to view

the8472 force-pushed the array-iter-fold branch from 5187ca4 to f4ea250 Compare July 24, 2021 16:12

This comment has been minimized.

Sign in to view

the8472 force-pushed the array-iter-fold branch from f4ea250 to cb585e9 Compare July 24, 2021 16:37

the8472 mentioned this pull request Jul 24, 2021

FlatMap, Flatten appear to optimize badly #87411

Closed

the8472 added the T-libs Relevant to the library team, which will review and decide on the PR/issue. label Jul 24, 2021

implement fold() on array::IntoIter to improve flatten().collect() perf

e015e9d

``` # old test vec::bench_flat_map_collect ... bench: 2,244,024 ns/iter (+/- 18,903) # new test vec::bench_flat_map_collect ... bench: 172,863 ns/iter (+/- 2,141) ```

the8472 force-pushed the array-iter-fold branch from cb585e9 to e015e9d Compare July 24, 2021 17:24

kennytm reviewed Jul 24, 2021

View reviewed changes

from review: add a comment why try_fold was chosen instead of fold

2276c5e

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Jul 27, 2021

bors added the merged-by-bors This PR was explicitly merged by bors. label Jul 27, 2021

bors merged commit 99d6692 into rust-lang:master Jul 27, 2021

rustbot added this to the 1.56.0 milestone Jul 27, 2021

the8472 mentioned this pull request Oct 11, 2021

inline next() on &mut Iterator impl #89774

Merged

lcnr added the A-const-generics Area: const generics (parameters and arguments) label Dec 11, 2021

the8472 mentioned this pull request Apr 3, 2022

Fix array::IntoIter::fold to use the optimized Range::fold #95602

Merged

	impl<I: Iterator + ?Sized> Iterator for &mut I {
	type Item = I::Item;
	fn next(&mut self) -> Option<I::Item> {
	(**self).next()
	}

implement fold() on array::IntoIter to improve flatten().collect() perf #87431

implement fold() on array::IntoIter to improve flatten().collect() perf #87431

Uh oh!

Conversation

the8472 commented Jul 24, 2021

Uh oh!

rust-highfive commented Jul 24, 2021

Uh oh!

This comment has been minimized.

This comment has been minimized.

kennytm commented Jul 24, 2021

Uh oh!

the8472 commented Jul 24, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

the8472 Jul 24, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kennytm Jul 24, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kennytm commented Jul 27, 2021

Uh oh!

bors commented Jul 27, 2021

Uh oh!

bors commented Jul 27, 2021

Uh oh!

bors commented Jul 27, 2021

Uh oh!

Uh oh!

the8472 commented Jul 24, 2021 •

edited

Loading

the8472 Jul 24, 2021 •

edited

Loading

kennytm Jul 24, 2021 •

edited

Loading