torte is a declarative workbench for reproducible experiments in feature-model analysis research.
Why torte? Take your pick:
- "Tseitin or not Tseitin?" Evaluator
- CNF Transformation Workbench
- KConfig Extractor that Tackles Evolution
- Towards Reproducible Feature-Model Transformation and Extraction
- That's an Obviously Reverse-Engineered Tool Name
- KConfig = 🍰 config ∧ 🍰 = torte ∎
torte can be used to
- extract feature models from KConfig-based configurable software systems (e.g., the Linux kernel),
- transform feature models between various formats (e.g., FeatureIDE, UVL, and DIMACS), and
- analyze feature models with solvers to evaluate the extraction and transformation impact,
all in a fully declarative and reproducible fashion backed by reusable Docker containers. This way, you can
- draft experiments for selected feature models first, then generalize them to a larger corpus later,
- execute experiments on a remote machine without having to bother with technical setup,
- distribute fully-automated reproduction packages when an experiment is ready for publication, and
- adapt and update existing experiments without needing to resort to clone-and-own practices.
This one-liner will get you started with the default experiment (Docker required).
curl -s https://ekuiter.github.io/torte/ | sh
Read on if you want to know more details.
To run torte, you need:
- an
x86_64
orarm64
system 1 with Linux, macOS, or Windows with WSL - Git, curl, GNU tools (bash, coreutils, make, grep, and sed)
- Docker (preferably in rootless mode on Linux)
Experiment files in torte are self-executing - so, you can just create or download an experiment file (e.g., from the experiments
directory) and run it.
The following instructions will get you started on a fresh system.
By default, each of these instruction sets will install torte into the torte
directory.
All experiment data will then be stored in the directories input
and output
in your working directory.
# install and set up dependencies
sudo apt-get update
sudo apt-get install -y curl git make uidmap dbus-user-session
# install Docker (see https://docs.docker.com/desktop/install/linux-install/)
curl -fsSL https://get.docker.com | sh
dockerd-rootless-setuptool.sh install
# download and run the default experiment
curl -s https://ekuiter.github.io/torte/ | sh
# install and set up dependencies (this will replace macOS' built-in bash with a newer version)
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
(echo; echo 'eval "$(/opt/homebrew/bin/brew shellenv)"') >> $HOME/.zprofile
eval "$(/opt/homebrew/bin/brew shellenv)"
brew install bash coreutils gnu-sed grep
# install Docker (see https://docs.docker.com/desktop/install/mac-install/)
curl -o Docker.dmg https://desktop.docker.com/mac/main/arm64/149282/Docker.dmg
sudo hdiutil attach Docker.dmg
sudo /Volumes/Docker/Docker.app/Contents/MacOS/install --accept-license
sudo hdiutil detach /Volumes/Docker
rm Docker.dmg
open /Applications/Docker.app
# download and run the default experiment
curl -s https://ekuiter.github.io/torte/ | sh
# install WSL (see https://learn.microsoft.com/windows/wsl/install)
powershell
wsl --install
# install Docker (see https://docs.docker.com/desktop/install/windows-install/)
Invoke-WebRequest https://desktop.docker.com/win/main/amd64/149282/Docker%20Desktop%20Installer.exe -OutFile Docker.exe
Start-Process Docker.exe -Wait -ArgumentList 'install', '--accept-license'
Remove-Item Docker.exe
# restart your computer, start Docker, then install and set up dependencies
wsl
sudo apt-get update
sudo apt-get install -y curl git make
# download and run the default experiment
curl -s https://ekuiter.github.io/torte/ | sh
Above, we run the default experiment, which extracts, transforms, and analyzes the feature model of BusyBox 1.36.0 as a demonstration.
To execute another experiment, run curl -s https://ekuiter.github.io/torte/ | sh -s - <experiment>
(a list of predefined experiments is available here).
You can also write your own experiments by adapting an existing experiment file.
Further Tips
- As an alternative to the self-extracting installer shown above, you can clone this repository and run experiments with
./torte.sh <experiment-file>
. - A running experiment can be stopped with
Ctrl+C
. If this does not respond, tryCtrl+Z
, then./torte.sh stop
. - Run
./torte.sh help
to get further usage information (e.g., running an experiment over SSH and im-/export of Docker containers). - Developers are recommended to use ShellCheck to improve code quality.
- If Docker is running in rootless mode, experiments must not be run as
sudo
. Otherwise, experiments must be run assudo
. - The first execution of torte can take a while (~30 minutes), as several complex Docker containers need to be built.
This can be avoided by loading a reproduction package that includes Docker images (built by
./torte.sh export
).
This is a list of all subject systems for which feature-model extraction has been tested and confirmed to work for at least one extraction tool.
Other systems or revisions may also be supported.
Detailed system-specific information on potential threats to validity is available in the scripts/subjects
directory.
System | Revisions | Notes |
---|---|---|
axtls | 1.0.0 - 2.0.0 | |
buildroot | 2009.02 - 2024.05 | |
busybox | 1.3.0 - 1.36.0 | |
embtoolkit | 1.0.0 - 1.8.0 | |
fiasco | 5eed420 (2023-04-18) | 2 |
freetz-ng | d57a38e (2023-04-18) | 2 |
linux | 2.5.45 - 6.11 | 3 4 5 6 |
toybox | 0.4.5 - 0.8.9 | 7 |
uclibc-ng | 1.0.2 - 1.0.47 |
The following tools are bundled with torte and can be used in experiments for extracting, transforming, and analyzing feature models.
Most tools are not included in this repository, but cloned and built with tool-specific Docker files in the docker
directory.
The bundled solvers are listed in a separate table below.
For transparency, we document the changes we make to these tools and known limitations. There are also some general known limitations of torte. 8
Tool | Version | Date | Notes |
---|---|---|---|
arminbiere/cadiback | 2e912fb | 2023-07-21 | |
ckaestne/kconfigreader | 913bf31 | 2016-07-01 | 9 10 11 12 13 14 |
ekuiter/clausy | 6b816a9 | 2024-01-15 | |
ekuiter/SATGraf | 2677015 | 2023-04-05 | 15 |
FeatureIDE/FeatJAR | e27aea7 | 2023-04-11 | 16 17 |
FeatureIDE/FeatureIDE | 3.9.1 | 2022-12-06 | 18 19 17 |
paulgazz/kmax | 4.5.2 | 2023-12-20 | 10 11 20 21 14 |
Z3Prover/z3 | 4.11.2 | 2022-09-04 | 22 |
The following solvers are bundled with torte and can be used in experiments for analyzing feature-model formulas.
The bundled solver binaries are available in the docker/solver
directory.
Solvers are grouped in collections to allow several versions of the same solver to be used.
In addition to the solvers listed below, z3
(already listed above) can be used as a satisfiability and SMT solver.
These #SAT solvers (available here) were used in the evaluations of several papers:
- Evaluating State-of-the-Art #SAT Solvers on Industrial Configuration Spaces (EMSE 2023)
- Tseitin or not Tseitin? The Impact of CNF Transformations on Feature-Model Analyses (ASE 2022)
The #SAT solvers from the collection model-counting-competition-2022
should be preferred for new experiments.
Solver | Version | Date | Notes |
---|---|---|---|
countAntom | 1.0 | 2015-05-11 | 23 |
d4 | ? | ? | |
dSharp | ? | ? | 24 |
Ganak | ? | ? | |
sharpSAT | ? | ? |
These #SAT solvers (available here) were used in the model-counting competition 2022. Not all evaluated solvers are included here, as some solver binaries (i.e., for MTMC and ExactMC) have not been disclosed.
Solver | Notes |
---|---|
c2d | |
d4 | |
DPMC | |
gpmc | |
TwG | 25 |
SharpSAT-TD | 23 |
SharpSAT-td+Arjun | 23 26 |
These are miscellaneous solvers from various sources.
Solver | Version | Date | Class | Notes |
---|---|---|---|---|
ApproxMC | 4.1.9 | 2023-02-22 | Approximate #SAT Solver | |
backbone_kissat.py | - | - | Backbone Extractor | |
d4v2 | c1f6842 | 2023-02-15 | #SAT Solver, d-DNNF compiler, PMC | |
kissat_MAB-HyWalk | ? | ? | SAT Solver | |
SAT4J | 2.3.6 | 2020-12-14 | SAT Solver |
A subset of these SAT solvers was used in the evaluation of the paper Tseitin or not Tseitin? The Impact of CNF Transformations on Feature-Model Analyses (ASE 2022). Each solver is the gold medal winner in the main track (SAT+UNSAT) of the SAT competition in the year encoded in its file name. These binaries were obtained from the SAT competition, SAT heritage, and SAT museum initiatives.
Year | Solver |
---|---|
2002 | zchaff |
2003 | Forklift |
2004 | zchaff |
2005 | SatELiteGTI |
2006 | MiniSat |
2007 | RSat |
2008 | MiniSat |
2009 | precosat |
2010 | CryptoMiniSat |
2011 | glucose |
2012 | glucose |
2013 | lingeling-aqw |
2014 | lingeling-ayv |
2015 | abcdSAT |
2016 | MapleCOMSPS_DRUP |
2017 | Maple_LCM_Dist |
2018 | MapleLCMDistChronoBT |
2019 | MapleLCMDiscChronoBT-DL-v3 |
2020 | Kissat-sc2020-sat |
2021 | Kissat_MAB |
2022 | Kissat_MAB-HyWalk |
2023 | sbva_cadical |
This is a list of all predefined experiments in the experiments
directory and their purposes.
Please create a pull request if you want to publish your own experiment.
Experiments starting with draft-
are experimental.
Experiment | Purpose |
---|---|
busybox-history-full |
Extraction of all feature models of BusyBox (for every commit that touches the feature model) 27 |
default |
"Hello-world" experiment that extracts and transforms a single feature model |
feature-model-collection |
Extraction, transformation, and analysis of several feature-model histories |
feature-model-collection-learning |
Learning from feature-model histories |
feature-model-differences |
Extraction and comparison of all feature models of several feature-model histories |
linux-history-releases |
Extraction, transformation, and analysis of a history of Linux feature models |
linux-history-weekly |
Extraction of a weekly history of Linux feature models |
linux-recent-release |
Extraction and transformation of a recent Linux feature model |
prepare-linux-fork |
Clones and rewrites the Linux Git repository to avoid issues with case-insensitive file systems |
tseitin-or-not-tseitin |
Evaluation for the paper Tseitin or not Tseitin? The Impact of CNF Transformations on Feature-Model Analyses (ASE 2022) |
This project has evolved through several stages and intends to replace them all:
kmax-vm > feature-model-repository-pipeline > tseitin-or-not-tseitin > torte
- kmax-vm was intended to provide an easy-to-use environment for integrating kmax with PCLocator in a virtual machine using Vagrant/VirtualBox. It is now obsolete due to our Docker integration of kmax.
- feature-model-repository-pipeline extended kmax-vm and could be used to extract feature models from Kconfig-based software systems with kconfigreader and kmax. The results were stored in the feature-model-repository. Its functionality is completely subsumed by torte and more efficient and reliable due to our Docker integration.
- tseitin-or-not-tseitin extended the feature-model-repository-pipeline to allow for transformation and analysis of feature models. It was mostly intended as a reproduction package for a single academic paper. Its functionality is almost completely subsumed by torte, which can be used to create reproduction packages for many different experiments.
If you are looking for a curated collection of feature models from various domains, have a look at our feature-model-benchmark.
If you have any feedback, please contact me at kuiter@ovgu.de. New issues, pull requests, or any other kinds of feedback are always welcome.
The source code of this project is released under the LGPL v3 license. To ensure reproducibility, we also provide binaries (e.g., for solvers) in this repository. These binaries have been collected or compiled from public sources. Their usage is subject to each binaries' respective license - please contact me if you perceive any licensing issues.
Footnotes
-
On
arm64
systems (e.g., Windows tablets and Apple Silicon Macs), torte cross-compiles some Docker images to ensure that precompiled binaries (e.g., JavaSMT, Z3, and all solvers) function correctly. This may negatively impact performance on some systems (e.g., ARM-based Windows tablets), although recent Macs should not be affected due to Rosetta. (If you encounter errors like this one, try to disable "Use Rosetta for x86_64/amd64 emulation on Apple Silicon" in the Docker settings. This setting can be re-enabled after the Docker images have been built.) Executing torte from within a virtual machine has only been confirmed to work with Linux guest systems onx86_64
host systems. Despite our efforts, some functionality involving precompiled binaries is still known to cause problems onarm64
systems. If such functionality is required, the easiest solution is to switch to anx86_64
system (e.g., with SSH). ↩ -
This system does not regularly release tagged revisions, so only a single revision has been tested. ↩ ↩2
-
Most revisions and architectures of Linux (since the introduction of KConfig) can be extracted successfully. The user-mode architecture
um
is currently not supported, as it requires setting an additional sub-architecture. ↩ -
Due to extractor limitations, we ignore the more recently introduced KConfig constructs defined in Linux'
scripts/Kconfig.include
. Most of these only add machine specific-default values or dependencies (affecting about 100 features in the kernel's history up tov6.3
). However, these constructs do not affect our feature-model extraction, as we want to ignore machine-dependent restrictions. ↩ -
Currently, we use the KConfig parser of Linux 2.6.9 for all revisions of Linux up to Linux 2.6.9, as older versions of the parser cannot be compiled. However, our experiments showed that the chosen parser version typically does not seem to affect the extracted formula, should it succeed in extracting a formula. ↩
-
For Linux, specifying arbitrary commit hashes is not enabled by default, because we must perform a complete Git history rewrite (resetting the commit hashes in the process) in order to ensure that checking out the repository also succeeds cross-platform on case-insensitive file systems (e.g., APFS). To specify arbitrary and up-to-date commit hashes, use
LINUX_CLONE_MODE=original|filter
(seescripts/subject/linux.sh#post-clone-hook-linux
:original
only works on case-sensitive file systems, whilefilter
is cross-platform, but takes several hours to run). This does not affect typical use cases that involve tag and branch identifiers. ↩ -
Feature models for this system are currently likely to be incomplete due to an inaccurate extraction. ↩
-
Currently, non-Boolean variability (e.g., constraints on numerical features) is only partially supported (e.g., encoded naively into Boolean constraints). It is recommended to check manually whether non-Boolean variability is represented as desired in generated files. ↩
-
We added the class
TransformIntoDIMACS.scala
to kconfigreader to decouple the extraction and transformation of feature models, so kconfigreader can also transform feature models extracted with other tools (e.g., kmax). ↩ -
We majorly revised the native C bindings
dumpconf.c
(kconfigreader) andkextractor.c
(kmax), which are intended to be compiled against a system's Kconfig parser to get accurate feature models. Our improved versions adapt to the KConfig constructs actually used in a system, which is important to extract evolution histories with evolving KConfig parsers. Our changes are generalizations of the original versions ofdumpconf.c
andkextractor.c
and should pose no threat to validity. Specifically, we added support forE_CHOICE
(treated asE_LIST
),P_IMPLY
(treated asP_SELECT
, see smba/kconfigreader), andE_NONE
,E_LTH
,E_LEQ
,E_GTH
,E_GEQ
(ignored). ↩ ↩2 -
Compiling the native C bindings of kconfigreader and kmax is not possible for all KConfig-based systems (e.g., if the Python-based Kconfiglib parser is used). In that case, you can try to reuse a C binding from an existing system with similar KConfig files; however, this may limit the extracted model's accuracy. ↩ ↩2
-
The DIMACS files produced by kconfigreader may contain additional variables due to Plaisted-Greenbaum transformation (i.e., satisfiability is preserved, model counts are not). Currently, this behavior is not configurable. ↩
-
Feature models and formulas produced by kconfigreader have nondeterministic clause order. This does not impact semantics, but it possibly influences the efficiency of solvers. ↩
-
The formulas produced by kconfigreader and kmax do not explicitly mention unconstrained features (i.e., features that do not occur in any constraints). However, for many analyses that depend on knowing the entire feature set (e.g., simply listing all configurable features or calculating model counts), this is a threat to validity. We do not modify the extracted formulas, to preserve the original output of kconfigreader and kmax. To address this threat, we instead offer the transformation stage
transform-into-unconstrained-features
, which explicitly computes these features. ↩ ↩2 -
We forked the original SATGraf tool and migrated it to Gradle. We also added a new feature for exporting the community structure visualization as a JPG file, avoiding the graphical user interface. ↩
-
FeatJAR is still in an experimental stage and its results should generally be cross-validated with FeatureIDE. ↩
-
DIMACS files produced by FeatJAR and FeatureIDE do not contain additional variables (i.e., equivalence is preserved). Currently, this behavior is not configurable. ↩ ↩2
-
We perform all transformations with FeatureIDE from within a FeatJAR instance, which does not affect the results. ↩
-
Transformations with FeatureIDE into XML and UVL currently only encode a flat feature hierarchy, no feature-modeling notation is reverse-engineered. ↩
-
We added the script
kclause2model.py
to kmax to translate kclause's pickle files into the kconfigreader's feature-model format. This file translates Boolean variability correctly, but non-Boolean variability is not supported. ↩ -
We do not use kmax's
kclause_to_dimacs.py
script for CNF transformation, as it has had some issues in the past. Instead, we have a separate Docker container for Z3. ↩ -
The DIMACS files produced by Z3 may contain additional variables due to Tseitin transformation (i.e., satisfiability and model counts are preserved). Currently, this behavior is not configurable. ↩
-
This solver currently crashes on some or all inputs. ↩ ↩2 ↩3
-
This version of dSharp is known to produce inaccurate results for some inputs, so use it with caution. ↩
-
For TwG, two configurations were provided by the model-counting competition (
TwG1
andTwG2
). As there was no indication as to which configuration was used in the competition, we arbitrarily choseTwG1
. ↩ -
For SharpSAT-td+Arjun, two configurations were provided by the model-counting competition (
conf1
andconf2
). As only the second configuration actually runs SharpSAT-td, we choseconf2
(conf1
probably implements the approximate counterSharpSAT-td-Arjun+ApproxMC
). ↩ -
As noted by Kröher et al. 2023, the feature model of BusyBox is scattered across its
.c
source code files in special comments and therefore not trivial to extract as a full history (because we need to detect changes in any KConfig files to identify relevant commits). We solve this problem by iterating over all commits to generate all feature models, committing them to a newbusybox-models
repository, in which each commit represents one version of the feature model. ↩