needless_collect: confusing suggestion & questionable transformation #6164

roy-work · 2020-10-12T19:47:55Z

Take the following example:

fn main() {
  let mock_array = &["foo", "BAR", "baz"];
  let mock_lowercase = mock_array.iter().map(|i| i.to_lowercase()).collect::<Vec<_>>();
  // This is some stuff,
  // to pretend that we have some more
  // code present here
  // like we did
  // in the real
  // deal.
  for _ in 0..2 {
    println!("Is bar present? {:?}", mock_lowercase.contains(&"bar".to_owned()));
  }
}

(Note that in a real-world scenario, all of mock_array, the number of iterations of the for loop, and the input to .contains() might all be dynamic. Don't read too much into that things in the example are constant.)

Running cargo clippy on this gives the following output:

warning: avoid using `collect()` when not needed
  --> src/main.rs:3:3
   |
3  | /   let mock_lowercase = mock_array.iter().map(|i| i.to_lowercase()).collect::<Vec<_>>();
4  | |   // This is some stuff,
5  | |   // to pretend that we have some more
6  | |   // code present here
...  |
10 | |   for _ in 0..2 {
11 | |     println!("Is bar present? {:?}", mock_lowercase.contains(&"bar".to_owned()));
   | |_____________________________________^
   |
   = note: `#[warn(clippy::needless_collect)]` on by default
help: Check if the original Iterator contains an element instead of collecting then checking
   |
3  |
4  |   // This is some stuff,
5  |   // to pretend that we have some more
6  |   // code present here
7  |   // like we did
8  |   // in the real
 ...

warning: 1 warning emitted

There are two issues with this:

The suggestion is elided

The bottom of the diagnostic from clippy quotes lines 3–8; that leaves the reader wondering … why? If I remove those lines (here, a large comment, but in our actual code that we hit this with, it was a mix of code, comments, and blank lines), we see then that clippy is attempting to output a suggestion:

help: Check if the original Iterator contains an element instead of collecting then checking
  |
3 |
4 |   for _ in 0..2 {
5 |     println!("Is bar present? {:?}", mock_array.iter().map(|i| i.to_lowercase()).any(|x| x == &"bar".to_owned()));

Unfortunately, in both our real world case & in this test case, the actual suggestion gets elided.

The suggestion is questionable

Here, the suggestion to not use collect in combination with the for loop means that we are now repeating the .map() call multiple times, for each item. In the original code, the work of the map is done once, outside the for loop, and the collected results then allow us to amortize the cost of that over the many searches done by the for loop. Whether it is worth it to repeat the .map() in each search or to .collect the result into a Vec that the for loop can re-use depends heavily on what the map is doing.

In our test case, we're lowercasing a bunch of strings. Each of the to_lowercase calls will require a heap allocation to store the result. It's enough that I think the original author's use of a Vec isn't wrong.

More meta

Worse, in our real world case, since clippy's suggestion was truncated, this resulted in dropping the .collect call, making the iterator mut for any, and replacing contains with any: this transformation is subtly different from the suggestion clippy failed to make: we don't move the creation of the iterator into the for loop; on subsequent passes through the loop, we re-use the partially or fully exhausted iterator. We caught this in code-review, and the subsequent discussion led to this bug report.

The original PR for this lint seemed to just detect cases of ….collect().contains(), which I think is good, always. A subsequent PR added a check for indirection, that is, to still lint that pattern even if we break up the .collect() and .contains() calls, like,

let x = <maps, filters, etc.>.collect();
// …
x.contains(…);

Which still mostly makes sense; I think the trouble is when that indirection crosses into a loop of some sort. Then, you're not just doing whatever iterator work you were doing once, you're doing it once & amortizing it across the loop. But, the lint doesn't detect that, I think.

I think the some of the relevant code is this? https://github.com/rust-lang/rust/blob/085e4170873f3e411c87ee009572f7d2b5130856/src/tools/clippy/clippy_lints/src/loops.rs#L2598-L2640

The text was updated successfully, but these errors were encountered:

flip1995 · 2020-10-12T20:57:00Z

So the things that should be improved here, are

The suggestion, so that it more clearly shows what should be done (easy)
The indirection should only apply when in the same scope (medium)

(2.) would fix most of your problems. Also it kind of makes sense. When binding before opening a new scope (for loop / if-else / just a block), the author probably has a reason for that.

yvonmanzi · 2020-10-14T07:35:09Z

Hi @flip1995 I've been learning Rust over the last few months and am looking for opportunities to contribute. I generally understand the issue here, and with you help, I can help solve it. Can I work on this?

flip1995 · 2020-10-17T16:32:45Z

Yeah sure, go ahead! If you have any questions, feel free to ask here, on Zulip or open a WIP PR.

Improve needless_collect output changelog: Improve needless_collect output Fixes #6908 Partially addresses #6164

y21 · 2023-06-16T00:44:30Z

Code in the original post seems to no longer trigger the lint: https://play.rust-lang.org/?version=stable&mode=debug&edition=2021&gist=4e58f8626b5e262501063e9e1dec55cb
Is this issue still relevant today?

roy-work added the C-bug Category: Clippy is not doing the correct thing label Oct 12, 2020

giraffate mentioned this issue Oct 13, 2020

Improve a suggestion in needless_collect with many comments #6171

Closed

phansch added the I-false-positive Issue: The lint was triggered on code it shouldn't have label Dec 18, 2020

camsteffen mentioned this issue Mar 22, 2021

needless_collect triggers inside for loop, can be questionable #6909

Open

camsteffen mentioned this issue Apr 2, 2021

Improve needless_collect output #7020

Merged

bors added a commit that referenced this issue Apr 2, 2021

Auto merge of #7020 - camsteffen:needless-collect, r=Manishearth

86fb0e8

Improve needless_collect output changelog: Improve needless_collect output Fixes #6908 Partially addresses #6164

q9f mentioned this issue Sep 30, 2021

fix stable clippy::needless_collect ChainSafe/forest#1238

Merged

J-ZhengLi added the L-nursery Lint: Currently in the nursery group label Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

needless_collect: confusing suggestion & questionable transformation #6164

needless_collect: confusing suggestion & questionable transformation #6164

roy-work commented Oct 12, 2020

flip1995 commented Oct 12, 2020 •

edited

Loading

yvonmanzi commented Oct 14, 2020

flip1995 commented Oct 17, 2020

y21 commented Jun 16, 2023

needless_collect: confusing suggestion & questionable transformation #6164

needless_collect: confusing suggestion & questionable transformation #6164

Comments

roy-work commented Oct 12, 2020

The suggestion is elided

The suggestion is questionable

Meta

More meta

flip1995 commented Oct 12, 2020 • edited Loading

yvonmanzi commented Oct 14, 2020

flip1995 commented Oct 17, 2020

y21 commented Jun 16, 2023

flip1995 commented Oct 12, 2020 •

edited

Loading