-X should batch the number of passed files to the maximum supported by the shell #410

lespea · 2019-02-15T21:32:41Z

It appears that if you run getconf ARG_MAX it returns the maximum length that the command string can be. Possibly include a command to artificially limit the number of arguments as well?

$ fd -IH . -tf -X wc -l
[fd error]: Problem while executing command: Argument list too long (os error 7)

The text was updated successfully, but these errors were encountered:

sharkdp · 2019-02-16T06:41:47Z

Thank you for reporting this. That was a known limitation when we first implemented --exec-batch, but we should definitely try to fix this.

Thank you for the information about getconf ARG_MAX. Looks like this should work on all POSIX systems. We will have to check how to get that information on Windows.

tavianator · 2019-02-18T04:27:58Z

Note that actually counting the size of your arguments is not straightforward: https://github.com/tavianator/bfs/blob/master/exec.c#L61

On Linux at least, you have to count the total length of the command line arguments and environment variables (and auxiliary vector), including NUL terminators, plus the length of the argv/environ pointer arrays themselves (including the final NULL elements). The kernel actually allocates these a page at a time, so you have to round up to the nearest page size. And POSIX recommends you leave an additional 2048 bytes just in case.

This is all finicky enough that I also implemented code in bfs to detect and recover from E2BIG by trying fewer and fewer arguments until it works, just in case the ARG_MAX accounting is wrong or some other platform counts them differently. I don't know if that's feasible in rust.

Artoria2e5 · 2020-05-02T21:05:08Z

It could be easier to implement on Windows, as the accounting only involves the size of the ''lpCommandLine'' parameter. However, it is still non-trivial since the escapes performed by the rust std will increase the size of the command-line. Since we are going to break the std::process::Command abstraction either way, it might make sense to just ask rust-lang to make such a thing available.

tavianator · 2020-05-03T04:09:37Z

See rust-lang/rust#40384

sharkdp · 2021-08-07T20:31:16Z

In #768, @BurritoBurrato suggests to add a --batch-size argument, possibly in addition to an automatically computed (max) batch size. This would probably be much easier to implement. And it might be better to have this option instead of having nothing (inevitably causing "Argument list too long" errors).

This makes me think. Is there a reasonable (sub-optimal) limit that we could set for the batch size that should work for most platforms/environments? This wouldn't be ideal, but better than the current situation.

tmccombs · 2021-08-08T04:28:41Z

Is there a reasonable (sub-optimal) limit that we could set for the batch size

I'm not sure. On Linux at least, the limit is a combination of all environment variables, and all command line arguments. So if it is run on an unusually large environment, the space remaining may be unusually small.

If it's helpful, the output of xargs --show-limits on my linux system is:

POSIX upper limit on argument length (this system): 2091895
POSIX smallest allowable upper limit on argument length (all systems): 4096
Maximum length of command we could actually use: 2088686
Size of command buffer we are actually using: 131072

I'm not sure where xargs gets the buffer size of 131072 from.

devonhollowood · 2021-10-20T05:26:40Z

I can take a shot at implementing a --batch-size option (as discussed above). I figure that gives a good work around, and leaves open the option of later adding a default that is something like "however many options fit". Potentially --batch-size 0 would be equivalent to "no limit". Sound good?

tmccombs · 2021-10-20T06:06:37Z

Sounds good to me.

tavianator · 2021-10-23T14:39:30Z

I don't really think this is fixed yet. --batch-size is a useful work around but it requires you to guess a limit. Eventually we should properly respect ARG_MAX

tmccombs · 2021-10-24T07:49:01Z

Agreed.

sharkdp · 2021-10-24T21:09:47Z

Eventually we should properly respect ARG_MAX

I started to implement this. I took the liberty to do a 1:1 translation of your implementation in bfs, @tavianator. I hope you are okay with this (I haven't put a license on the project so far). The idea is to have a std::process::Command wrapper that has a try_arg function which returns false if no more argument could be added. This is far from finished.

Limitations:

Linux only
No additional env. variables can be added for the child process
Only a part of the process::Command API is provided

That being said, I was able to get the exact same numbers as the ones in the bfs -D exec … debug output (which was a bit tricky to compare, as the environment is not exactly the same due to things like the _ environment variable).

The library also contains a test that determines the limit experimentally (via a binary search on echo foo foo foo …). And I can confirm that the "bfs"-computed limit is below (but not too much below) the experimental limit.

https://github.com/sharkdp/argmax

tavianator · 2021-10-24T21:46:18Z

Oh awesome! I was actually going to do that today too but didn't get around to it.

I'm definitely okay with it, and pretty much any licence is compatible with bfs's.

I think the implementation is probably good enough for any unix, not just Linux.

sharkdp · 2021-11-16T22:36:57Z

tavianator · 2021-11-16T23:01:39Z

For i686-unknown-linux-gnu, I wonder if the issue is that the (presumably) 64-bit kernel and the 32-bit test binary disagree about sizeof(char *)?

Edit: seems like that is what's happening. This workaround fixes the failure for me:

diff --git a/src/unix.rs b/src/unix.rs
index 9f4c419..04c7501 100644
--- a/src/unix.rs
+++ b/src/unix.rs
@@ -48,7 +48,7 @@ fn size_of_environment() -> i64 {
 /// Required size to store a single ARG argument and the corresponding
 /// pointer in argv**.
 pub(crate) fn arg_size<O: AsRef<OsStr>>(arg: O) -> i64 {
-    size_of::<*const c_char>() as i64 // size for the pointer in argv**
+    size_of::<u64>() as i64 // size for the pointer in argv**
       + arg.as_ref().len() as i64     // size for argument string
       + 1 // terminating NULL
 }

I don't know what the best fix is. To be perfectly accurate we'd have to know whether the binary we're executing is a 32- or 64-bit binary. Maybe it's okay to conservatively assume that pointers are always 8 bytes.

sharkdp · 2022-01-23T12:08:59Z

I released a first version of the argmax crate with a pretty minimalistic API. But it might be enough to test the integration into fd if someone wants to work on this.

sambacha · 2022-03-21T06:17:46Z

having this issue on macOS 10.15 as well

tavianator · 2022-05-19T18:03:21Z

I am working on this, but the details are pretty awkward. Since fd supports things like

$ fd -X echo before {} after

I have to support adding arguments after the paths. But that's pretty awkward with try_arg(), since if try_arg("after") fails, I have to somehow back up and remove a path or something.

What I think we want is something like

if cmd.has_room_for([arg, "after", ...]) {
    assert!(cmd.try_arg(arg));
} else {
    assert!(cmd.try_args(["after", ...]));
    cmd.spawn();
    cmd = Command::new(...);
}

sharkdp · 2022-05-19T18:06:47Z

Right - I was afraid that try_arg would not be the most convenient API we could come up with. argmax certainly needs some polishing and I am happy to integrate a more user-friendly interface or some convenience methods.

Fixes sharkdp#410.

Fixes #410.

sharkdp added help wanted feature-request labels Feb 16, 2019

sharkdp mentioned this issue Dec 29, 2019

Sort search results when using -X option if results are not many. Solve Issue #441 #513

Closed

MarcoIeni mentioned this issue Jan 2, 2020

Sort search results when using -X option #524

Merged

sharkdp mentioned this issue Feb 11, 2020

Add -printf formatting like GNU find #533

Open

sharkdp mentioned this issue Apr 17, 2020

Add -l/--list-details option #556

Merged

sharkdp mentioned this issue May 12, 2020

Request: add --tree flag #283

Closed

This was referenced May 16, 2021

support for longer argument lists with -X? #766

Closed

Able to set batch size? #768

Closed

sharkdp added this to the fd 9 milestone Aug 8, 2021

tavianator mentioned this issue Aug 9, 2021

Only list text files (i.e., exclude binaries) #749

Closed

devonhollowood mentioned this issue Oct 20, 2021

Implement --batch-size #866

Merged

tmccombs closed this as completed in #866 Oct 22, 2021

tavianator reopened this Oct 23, 2021

tavianator mentioned this issue Nov 17, 2021

Accounting is wrong when 32-bit build launches a 64-bit binary sharkdp/argmax#1

Closed

sharkdp mentioned this issue Nov 18, 2021

Fix tests on all platforms sharkdp/argmax#2

Closed

27 tasks

fruttasecca mentioned this issue Dec 3, 2021

Set g+s to directories not having g+s already set orchest/orchest#574

Merged

1 task

tavianator self-assigned this May 19, 2022

tavianator mentioned this issue May 19, 2022

exec: Execute batches before they get too long #1020

Merged

tavianator added a commit to tavianator/fd that referenced this issue May 19, 2022

exec: Execute batches before they get too long

48d35ec

Fixes sharkdp#410.

tavianator added a commit to tavianator/fd that referenced this issue May 24, 2022

exec: Execute batches before they get too long

29ca925

Fixes sharkdp#410.

tavianator added a commit to tavianator/fd that referenced this issue May 25, 2022

exec: Execute batches before they get too long

20ad77e

Fixes sharkdp#410.

tavianator added a commit to tavianator/fd that referenced this issue May 25, 2022

exec: Execute batches before they get too long

6538190

Fixes sharkdp#410.

tavianator added a commit to tavianator/fd that referenced this issue May 25, 2022

exec: Execute batches before they get too long

edf8d41

Fixes sharkdp#410.

sharkdp closed this as completed in #1020 May 28, 2022

sharkdp pushed a commit that referenced this issue May 28, 2022

exec: Execute batches before they get too long

40b368e

Fixes #410.

sharkdp mentioned this issue Nov 18, 2022

[BUG] [fd error]: Problem while executing command: Argument list too long (os error 7) when using "-l" #1179

Closed

1 task

kkew3 mentioned this issue Feb 17, 2023

[BUG] fd -X don't process all files found #1259

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

-X should batch the number of passed files to the maximum supported by the shell #410

-X should batch the number of passed files to the maximum supported by the shell #410

lespea commented Feb 15, 2019

sharkdp commented Feb 16, 2019

tavianator commented Feb 18, 2019

Artoria2e5 commented May 2, 2020

tavianator commented May 3, 2020

sharkdp commented Aug 7, 2021

tmccombs commented Aug 8, 2021

devonhollowood commented Oct 20, 2021 •

edited

Loading

tmccombs commented Oct 20, 2021

tavianator commented Oct 23, 2021

tmccombs commented Oct 24, 2021

sharkdp commented Oct 24, 2021

tavianator commented Oct 24, 2021

sharkdp commented Nov 16, 2021 •

edited

Loading

tavianator commented Nov 16, 2021 •

edited

Loading

sharkdp commented Jan 23, 2022 •

edited

Loading

sambacha commented Mar 21, 2022

tavianator commented May 19, 2022

sharkdp commented May 19, 2022

-X should batch the number of passed files to the maximum supported by the shell #410

-X should batch the number of passed files to the maximum supported by the shell #410

Comments

lespea commented Feb 15, 2019

sharkdp commented Feb 16, 2019

tavianator commented Feb 18, 2019

Artoria2e5 commented May 2, 2020

tavianator commented May 3, 2020

sharkdp commented Aug 7, 2021

tmccombs commented Aug 8, 2021

devonhollowood commented Oct 20, 2021 • edited Loading

tmccombs commented Oct 20, 2021

tavianator commented Oct 23, 2021

tmccombs commented Oct 24, 2021

sharkdp commented Oct 24, 2021

tavianator commented Oct 24, 2021

sharkdp commented Nov 16, 2021 • edited Loading

tavianator commented Nov 16, 2021 • edited Loading

sharkdp commented Jan 23, 2022 • edited Loading

sambacha commented Mar 21, 2022

tavianator commented May 19, 2022

sharkdp commented May 19, 2022

devonhollowood commented Oct 20, 2021 •

edited

Loading

sharkdp commented Nov 16, 2021 •

edited

Loading

tavianator commented Nov 16, 2021 •

edited

Loading

sharkdp commented Jan 23, 2022 •

edited

Loading