clar: use internal functions instead of /bin/cp and /bin/rm #5528

ethomson · 2020-05-23T15:10:02Z

clar has historically shelled out to /bin/cp to copy test fixtures into a sandbox and /bin/rm to clean them up. This has two deficiencies:

/bin/cp is slower than simply opening the source and destination and copying them in a read/write loop. On my Mac, the /bin/cp based approach takes ~2:40 for a full test pass. Using a read/write loop to copy the files ourselves takes ~1:50. Similarly, /bin/rm is slower than doing this internally. Moving to an internal traversal shaves another 20 seconds off.

These numbers are less impressive on Linux, but using sendfile there makes a notable improvement.
It's noisy. Since the leak detector follows fork/exec, we'll end up running the leak detector on /bin/cp and /bin/rm. This would be fine, except that the leak detector spams the console on startup and shutdown, so it adds a lot of additional information to the test runs that is useless. By not forking and using this internal system, we see much less output.

Since this repo actually has tests and fixtures, I'm opening this PR here and will port it to https://github.com/clar-test/clar once approved.

pks-t

I had created a similar implementation multiple years ago which used our own internal futil helpers to do this, so I'm very much in favor of doing the conversion! Your approach definitely makes more sense, though, as it's also upstreamable.

There's a few small remarks, but overall this looks good to me!

pks-t · 2020-06-01T13:20:52Z

tests/clar/fs.h

+				cl_assert(ret <= (ssize_t)len);
+				len -= ret;
+			}
+			cl_assert(ret >= 0);


Nit: this can be cl_assert(ret == 0)

Writing the last chunk of the buffer will return a positive ret (indicating the number of bytes written). That will be subtracted from len, which will now be 0. In this case, ret will be positive but we should stop the loop successfully.

pks-t · 2020-06-01T13:21:01Z

tests/clar/fs.h

+
+	close(in);
+	close(out);
+


Nit: trailing newline

pks-t · 2020-06-01T13:23:42Z

tests/clar/fs.h

+		const char *base;
+		int base_len;
+
+		/* Target exists; append basename */


I think this is the case where we copy a file into a pre-existing directory, right? In that case, we should also assert S_ISDIR(dest_st.mode) so that one gets a proper error message if it isn't.

pks-t · 2020-06-01T13:26:13Z

It's noisy. Since the leak detector follows fork/exec, we'll end up running the leak detector on /bin/cp and /bin/rm. This would be fine, except that the leak detector spams the console on startup and shutdown, so it adds a lot of additional information to the test runs that is useless. By not forking and using this internal system, we see much less output.

Ah, interesting. I always wondered where all those messages on macOS came from, but that makes a lot of sense to me!

clar has historically shelled out to `/bin/cp` to copy test fixtures into a sandbox. This has two deficiencies: 1. It's slower than simply opening the source and destination and copying them in a read/write loop. On my Mac, the `/bin/cp` based approach takes ~2:40 for a full test pass. Using a read/write loop to copy the files ourselves takes ~1:50. 2. It's noisy. Since the leak detector follows fork/exec, we'll end up running the leak detector on `/bin/cp`. This would be fine, except that the leak detector spams the console on startup and shutdown, so it adds a _lot_ of additional information to the test runs that is useless. By not forking and using this internal system, we see much less output.

Similar to how clar has used `/bin/cp` to copy files, it's used `/bin/rm` to remove them. This has similar deficiencies; meaning that leaks is noisy and it's slow. Move it to an internal function.

ethomson force-pushed the ethomson/clar_internal branch from 467d2a9 to fdfa742 Compare May 23, 2020 15:23

pks-t reviewed Jun 1, 2020

View reviewed changes

ethomson added 4 commits June 2, 2020 09:03

clar: copy files with sendfile on linux

d03fd33

clar: remove files internally instead of /bin/rm

ee9e916

Similar to how clar has used `/bin/cp` to copy files, it's used `/bin/rm` to remove them. This has similar deficiencies; meaning that leaks is noisy and it's slow. Move it to an internal function.

clar: remove unused shell_out function

2a2c5b4

ethomson force-pushed the ethomson/clar_internal branch from fdfa742 to 2a2c5b4 Compare June 2, 2020 08:06

ethomson merged commit d4b953f into master Jun 2, 2020

ethomson deleted the ethomson/clar_internal branch June 3, 2020 09:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

clar: use internal functions instead of /bin/cp and /bin/rm #5528

clar: use internal functions instead of /bin/cp and /bin/rm #5528

Uh oh!

ethomson commented May 23, 2020

Uh oh!

pks-t left a comment

Uh oh!

pks-t Jun 1, 2020

Uh oh!

ethomson Jun 2, 2020

Uh oh!

pks-t Jun 1, 2020

Uh oh!

pks-t Jun 1, 2020

Uh oh!

pks-t commented Jun 1, 2020

Uh oh!

Uh oh!

clar: use internal functions instead of /bin/cp and /bin/rm #5528

clar: use internal functions instead of /bin/cp and /bin/rm #5528

Uh oh!

Conversation

ethomson commented May 23, 2020

Uh oh!

pks-t left a comment

Choose a reason for hiding this comment

Uh oh!

pks-t Jun 1, 2020

Choose a reason for hiding this comment

Uh oh!

ethomson Jun 2, 2020

Choose a reason for hiding this comment

Uh oh!

pks-t Jun 1, 2020

Choose a reason for hiding this comment

Uh oh!

pks-t Jun 1, 2020

Choose a reason for hiding this comment

Uh oh!

pks-t commented Jun 1, 2020

Uh oh!

Uh oh!