Releases · sosy-lab/benchexec

A complete rewrite of the HTML tables produced by table-generator.
The tables are now based on React, load much faster, and provide features like pagination, sorting, and more intuitive filters. More information can be found in PR #477.
Thanks @bschor for this!
Note that the tables are not usable without JavaScript anymore.
The old kind of HTML tables can still be produced with the command-line flag --static-table, but this is deprecated and will be removed in BenchExec 3.0 in January 2020 (cf. #479).
Recursively clean up cgroups after a run.
This enables nesting runexec in itself, but only if --full-access-dir /sys/fs/cgroup is passed to the outer runexec, which means that the processes in the outer container have full access to the cgroup hierarchy and could use this to circumvent resource limits.
benchexec filters the tasks to execute depending on the expected verdict, if <propertyfile expectedverdict="..."> in used the benchmark definition.
BenchExec now stores a timestamp for the start time of each run, and timestamps for start and end of reach run set.
benchexec will store arbitrary user-defined text as benchmark description together with the results if specified with benchexec --description-file ....
Support for execution on Python 3.8.
Fix crash in runexec if the tool's stdout/stderr contain invalid UTF-8.
Fix hanging benchexec in container mode if tool cannot be executed (e.g., if executable is missing).
New tool-info modules and updates for SV-COMP'20 and Test-Comp'20.

Assets 12

27 Sep 08:42

PhilippWendler

2.2

905d743

Release 2.2

This release fixes two security issues, all users are encouraged to update:

Since BenchExec 2.1, the setup of the container for the tool-info module (which was added in BenchExec 1.20) could silently fail, for example if user namespaces are disabled on the system. In this case the tool-info module would be executed outside of the container.
Run execution was not affected.
The kernel offers a keyring feature for storage of keys related to features like Kerberos and ecryptfs. Before Linux 5.2, there existed one keyring per user, and BenchExec did not prevent access from the tool inside the container to the kernel keyring of the user who started BenchExec.
Now such accesses are forbidden (on all kernel versions) using seccomp if libseccomp2 is installed, which should the case on any standard distribution.
Note that seccomp filters do have a slight performance impact and could prevent some binaries on exotic architectures from working. In such a case please file a bug report.

Assets 12

17 Sep 13:32

PhilippWendler

2.1

bb19118

Release 2.1

benchexec can now partition the Level 3 cache of the CPU for parallel runs and measure cache usage and memory bandwidth, at least on some Intel CPUs and if the pqos and pqos_wrapper are installed. More information is in the documentation.

Furthermore, some error messages for systems without container support were improved.

Assets 12

18 Jul 13:16

PhilippWendler

2.0

151db25

Release 2.0

This release does not add new features compared to BenchExec 1.22, but removes several deprecated features and brings several other backwards-incompatible changes to make BenchExec more consistent and user-friendly:

Support for Python 3.2 and 3.3 is removed, the minimal Python version is now 3.4.
Additionally, runexec/RunExecutor continue to support Python 2.7 until end of 2019.
Support for running benchmarks as a different user with sudo is removed (parameters --user/--users).
Use container mode as better method for isolating runs.
Container mode is enabled by default.
It can be disabled with --no-container, but this decreases reliability of benchmarking.
If the cpuacct cgroup is not available, CPU-time measurements and limits are not supported.
Either container mode or the freezer cgroup are required to ensure protection against fork bombs.
Niceness of benchmarked process is not changed, previously it was increased by 5.
Changes to input of benchexec:
- The memory limit given to benchexec requires an explicitly specified unit.
- Support for <test> tags, <sourcefiles> tags, and variables named ${sourcefile_*} removed from benchmark definitions.
  Use <rundefinition>, <tasks>, and ${inputfile_*} instead.
- Variables named ${taskdef_*} are defined only if task-definition files are used, and variables named ${inputfile_*} only otherwise.
Changes to table-generator:
- A column named memUsage is automatically renamed to memory.
- A column named memory is automatically converted to Megabytes.
  Both conversions are only applied if no <column> tags are used.
Changes to run-result data:
- In case of aborted or failed runs, no dummy results (e.g., cputime of 0s) are present.
- The memory results of benchexec are named memory, not memUsage.
- Memory results have the unit B explicitly specified.
  Furthermore, units are present in all attributes of the result XML files where they were still missing.
- Result item exitcode is removed, only RunExecutor.execute_run() still returns it, but as an object instance instead of an int.
  Use returnvalue and exitsignal instead.
Module benchexec.test_tool_wrapper is removed, use benchexec.test_tool_info instead.
BenchExec (both benchexec and runexec) terminates itself cleanly after aborting all runs if it receives one of the signals SIGTERM, SIGINT (Ctrl+C), or SIGQUIT.

Additionally, this release adds a fix for the container that is used since BenchExec 1.20 for the tool-info module. In this container, the environment variable HOME did not point to /home/benchexec as expected but to the user's real home directory. This broke tools like Ultimate if the /home was configured to be hidden or read-only.

Furthermore, we declare the following features deprecated and plan on removing them for BenchExec 3.0, which is expected to be released in January 2020:

Support for Python 2.7 and 3.4 (cf. #438)
Support for checking correctness of run results and computing scores if task-definition files are not used (cf. #439)

Please respond in the respective issue if one of these deprecations is a problem for you.

Assets 12

11 Jul 16:49

PhilippWendler

1.22

6a811d0

Release 1.22

More robust handling of Ctrl+C in benchexec.
For example, output files are now always fully written, whereas previously pressing Ctrl+C at the wrong time could result in truncated files. A side effect of this is that if you call benchexec.benchexec.BenchExec().start() in own Python code, you must now add a signal handler for SIGINT. The same was already true for users of RunExecutor, this is now documented.
Fix Ctrl+C for benchexec in container mode.
In BenchExec 1.21, one would need to press Ctrl+C twice to stop benchexec.
Fix unreliable container mode on Python 3.7.
Some robustness improvements and fixes of rare deadlocks.
Decreased overhead of benchexec while runs are executing.

Assets 12

25 Jun 08:56

PhilippWendler

1.21

20f06bd

Release 1.21

This release contains only a few bug fixes:

Forwarding signals to the benchmarked process (and thus, stopping runs via Ctrl+C), was broken on Python 2.
If the freezer cgroup was available but mounted in a separate hierarchy, it was not used reliably as protection against fork bombs when killing processes.
Since BenchExec 1.19, an exception would occur if a non-existing command was started in container mode.
Since BenchExec 1.19, copying output files from a container would occur while subprocesses are still running and would be counted towards the walltime limit. This is fixed, although subprocesses will still be running if the freezer cgroup is not available (cf. #433).

Assets 12

13 Jun 13:26

PhilippWendler

1.20

7c05590

Release 1.20

If benchexec --container is used, all code that is part of the tool-info module (as well as all processes started by it) are now run in a separate container with the same layout and restrictions as the run container.
Note, however, that it is not the same container, so any modifications made by the tool-info module to files on disk are not visible in the runs!
The test_tool_info utility also has gained a parameter --container for testing how a tool-info module behaves in a container.
Nested containers are now supported.
Due to a change to the internal implementation of the container mode, commands like the following succeed now:
containerexec -- containerexec --hidden-dir /sys -- /bin/bash.
(Some parts of /sys need to be excluded because of kernel limitations.)
Note that nesting runexec or benchexec is still not supported, because nested cgroups are not implemented, so any cgroup-related features (resource limitations and measurements) are missing. But nesting containerexec and runexec --container (or vice-versa) now works.
/etc/hostname in container now also shows the container's host name that exists since BenchExec 1.19.
Change how CPUs with several NUMA nodes per CPU are handled:
BenchExec will now treat each NUMA node like a separate CPU package and avoid creating runs that span several NUMA nodes. Thanks @alohamora!

Assets 12

06 Jun 12:25

PhilippWendler

1.19

1351a25

Release 1.19

In container mode, all temp directories are now on a tmpfs "RAM disk".
This affects everything written to directories in the hidden or overlay modes. Files written there are now included in the memory measurements and the memory limit! The advantage is that performance should be more deterministic, especially if several runs use much I/O in parallel. This feature can be disabled with --no-tmpfs.
/dev/shm and /run/shm are now available inside the container and provide a tmpfs instance (even with --no-tmpfs) as required by some tools for shared memory.
Container mode now recommends LXCFS and automatically uses it if available for a better container isolation (e.g., uptime measures container uptime, not host uptime). On Debian/Ubuntu, just use sudo apt install lxcfs.
Several small bug fixes and other improvements of isolation for container mode (e.g., host name in container is no longer the real host name).
Add benchexec --no-hyperthreading, which restricts core assignments to a single virtual core per physical CPU core (all other sibling cores will stay unused). Thanks @alohamora!

Assets 12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: sosy-lab/benchexec

Release 2.5.1

Release 2.5

Release 2.3

Release 2.2

Release 2.1

Release 2.0

Release 1.22

Release 1.21

Release 1.20

Release 1.19