- Advanced Usage
The Kmax Tool Suite (kmax) contains a set of tools for performing automated reasoning on Kconfig and Kbuild constraints. It consists of the following tools:
klocalizer
takes the name of a compilation unit and automatically generates a.config
file that, when compiled, will include the given compilation unit. It uses the logical models fromkclause
andkmax
. The configuration localization problem is described in an SPLC 2018 challenge paper (preprint).kclause
"compiles" Kconfig constraints into logical formulas.kextractor
uses Linux's own Kconfig parser to perform an extraction of the syntax of configuration option names, types, dependencies, etc. It is described in our ESEC/FSE 2021 research paper (preprint).kmax
collects configuration information from Kbuild Makefiles. It determines, for each compilation unit, a symbolic Boolean expression that represents the conditions under which the file gets compiled and linked into the final program. Its algorithm is described in our ESEC/FSE 2017 research paper (preprint) and the original implementation is in version 1.0.kismet
does verification-based static analysis to find unmet dependency bugs and generates test cases. It is described in our ESEC/FSE 2021 research paper (preprint).koverage
checks whether a Linux configuration file includes a set of (sourcefile,line) for compilation. It utilizes the Linux build system to determine coverage.
Install the prerequisites
sudo apt install -y python3-setuptools python3-dev
Clone and install kmax
git clone https://github.com/paulgazz/kmax.git
cd kmax
python3 -m venv ~/kmax_env/
source ~/kmax_env/bin/activate
pip3 install ./
Alternatively, installing for development to obviate the need to reinstall when making changes to the code
pip3 install -e ./
klocalizer --repair
can repair existing configuration files to include given source code directories, files, and lines from Linux kernel source code. For instance, let's use it to repair the minimal allnoconfig
so that it includes specific lines of kernel/bpf/cgroup.c
. allnoconfig
will not even include the source file,
cd ~/linux-5.16/
make allnoconfig
make kernel/bpf/cgroup.o
results in an error: make[2]: *** No rule to make target 'kernel/bpf/cgroup.o'. Stop.
We can ensure that the relevant configuration options are modified in allnoconfig
in order to include this file and any specified lines:
make allnoconfig # configuration file stored in .config
klocalizer --repair .config --include-mutex kernel/bpf/cgroup.o:[1354,1357-1359]
This produces a new version of the .config
file in 0-x86_64.config
. To build with it, install it with olddefconfig
and attempt to build the source file:
cp 0-x86_64.config .config
make olddefconfig
make kernel/bpf/cgroup.o
This time, the source file is successfully built: CC kernel/bpf/cgroup.o
. Any number of --include-mutex
constraints may be added. If there is mutual-exclusion among source files, klocalizer
will as many configuration files needed to cover all constraints. Always on or off constraints can be added with --include
or --exclude
. See the documentation on klocalizer
and krepair
for more usage information.
git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
cd linux/
git checkout -f 775da83005cb; git diff HEAD~~ > patch.diff; make defconfig; cp .config defconfig; klocalizer -v -a x86_64 --repair defconfig --include-mutex patch.diff --formulas ../formulacache; rm -rf koverage_files/; koverage -v --arch x86_64 --config 0-x86_64.config --check-patch patch.diff -o out;
commit=1fc31638eb79eff8b1fb0e1bfcd5f9dbddbf16e2; git checkout -f $commit; git show > patch.diff; make defconfig; cp .config defconfig; klocalizer -v --arch x86_64 --repair defconfig --include-mutex patch.diff; rm -rf koverage_files/; koverage -v --config 0-x86_64.config --check-patch patch.diff -o out1 --arch x86_64;
commit=64ba2eb35fa076d5914e3a3e374898ca31c29e84; git checkout -f $commit; git show > patch.diff; make defconfig; cp .config defconfig; klocalizer -v -a x86_64 --repair defconfig --include-mutex patch.diff; rm -rf koverage_files/; koverage -v --config 0-x86_64.config --check-patch patch.diff -o out2 --arch x86_64
- kmax failed, superc failed
- unconstrained
- headers not included
Try running java superc.SuperC -sourcelinePC /dev/null /dev/null
. If the class is not found, double-check the SuperC installation and your CLASSPATH.
If you get a java.lang.NoSuchMethodError: 'com.microsoft.z3.BoolExpr com.microsoft.z3.Context.mkAnd(com.microsoft.z3.BoolExpr[])'
error, then there is a mismatch between the z3 java library. Please be sure that your superc.jar
and z3-4.8.12-x64-glibc-2.31/bin/com.microsoft.z3.jar
jarfiles downloaded according to the SuperC installation directions. Or try building SuperC from scratch.
To repair allnoconfig
to include changed lines from a range of commits, first get the diff file, checkout the version having the patch(es) applied, then call repair:
cd ~/
git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
cd linux
# create a patchset from a range of commits
git diff 44779a4b85abd..6fc88c354f3af > patchset.diff
# be sure to checkout the last commit so that krepair can see the full patched source code
git checkout 6fc88c354f3af
# get allnoconfig
make ARCH=x86_64 allnoconfig
# repair allnoconfig, excluding specific directories
klocalizer --repair .config -a x86_64 --include-mutex patchset.diff --exclude drivers/dma
# build the affected source files using the repaired configuration file
KCONFIG_CONFIG=./0-x86_64.config make.cross ARCH=x86_64 olddefconfig clean kernel/bpf/cgroup.o net/ipv4/af_inet.o net/ipv4/udp.o net/ipv6/af_inet6.o net/ipv6/udp.o net/unix/af_unix.o
klocalizer
will also take patches ending in .patch
, e.g., from git format-patch
in addition to .diff
as long as the file is in the unified diff format. Be sure to run klocalizer
from the already-patched source tree, since this contains the constraints of the resulting code from the patch.
kismet
's routine consists of a three step analyses pipeline, each at different precisions.
After the identification of the select constructs, each construct is seen as a potential unmet direct dependency case, thus, marked to raise unmet dependency alarm. The subsequent stages works on the alarms of the previous stage, trying to rule out alarms as unmet dependency safe, thus, increasing the precision. While any stage is at least as precise as the stages before it, the earlier stages helps to increase the performance.
Given the fact that a select construct is unmet dependency safe if the selectee has no direct dependency, this stage marks such constructs as unmet safe. This is very fast, and does not involve any SAT solvers but only syntactical analysis of the constructs.
For a select construct to be unmet dependency free, the kclause constraints for the architecture together with the additional unmet dependency free constraints (which we call optimized constraints) must be satisfiable. We name this whole set of constraints as precise constraints. Given the observation that the optimized constraints are much smaller than the precise constraints, SAT solvers can decide the optimized constraints much faster. In this stage, kismet
discharges the optimized constraints to a SAT solver to rule out more constructs as unmet dependency safe.
In this stage, kismet
discharges the whole set of constraints (optimized and kclause architecture constraints, i.e., precise constraints) to a SAT solver to analyse remaining alarms and rule them as safe. This is the last stage of the static analysis pipeline, where any construct could not be ruled to be unmet safe raise unmet alarms.
Upon completing the analysis, kismet
writes a detailed summary in CSV format to the standard output.
This includes the following columns:
selectee
: The selectee of the select construct.selector
: The selector of the select construct.visib_id
: The visibility id of the select construct. One (selectee, selector) pair might appear multiple times on different visibility constraints (e.g., withinCONFIG_SELECTOR
, there might be two entries, such as:select SELECTEE if VISIB1
,select SELECTEE if VISIB2
). Such constructs are distinguished with this visibility id.constraint_type
: There are possibly multiple ways to satisfy the optimized constraints. If exploring the whole unmet space is enabled (disabled by default, use'--explore-whole-unmet-space
to enable), all expressions to satisfy the optimized constraints are explored individually, called as SAT options. On the other hand, some random solution to precise constraints is called generic option, which suffices to verify alarms. In the summary, generic option hasconstraint_type
0 while exhaustively explored SAT options haveconstraint_type
ids starting from 1.analysis_result
: one of:UNMET_ALARM
,UNMET_SAFE_SYNTACTIC_PASS
,UNMET_SAFE_OPTIMIZED_PASS
,UNMET_SAFE_PRECISE_PASS
.verified
: If the analysis result is an alarm, includes whether the generated test case for the related construct verifies the alarm.forced_target_udd_only
: If the analysis result is an alarm, includes whether forcing the target unmet dependency only was successful.testcase
: The path to the generated test case, i.e., sample Kconfig config file.
koverage
checks whether a Linux configuration file includes a set of
(source/header file, line) for compilation. It utilizes the Linux build system
to determine code coverage.
cd ~/linux-5.16/
make.cross ARCH=x86_64 allyesconfig
koverage --config .config --arch x86_64 --check kernel/fork.c:[259,261] -o coverage_results.json
This will check whether lines 256 and 261 of kernel/fork.c
are included for compilation by Linux v5.16 allyesconfig.
The coverage results, coverage_results.json
, will have:
{
"headerfile_loc": {},
"sourcefile_loc": {
"kernel/fork.c": [
[259,"INCLUDED"],
[261,"LINE_EXCLUDED_FILE_INCLUDED"]
]
}
}
Looking at kernel/fork.c
, lines 259 and 261 are encapsulated by two different branches of a conditional preprocessor directive (#ifdef
).
koverage
reports that the line from the first branch (line 259) is included for compilation while line from the other branch (line 261) is excluded, as expected since the configuration file enables the first branch.
koverage
will also take patch files in unified diff format, and check patch coverage of a .config
file.
cd ~/linux-5.16/ #< assuming this is a git clone
# create the patch file for commit 7471e1afabf8 in unified diff format
git diff 7471e1afabf8~..7471e1afabf8 > patch.diff
# be sure to checkout to the patched source code
git checkout 7471e1afabf8
make.cross ARCH=x86_64 allyesconfig
koverage --config .config --arch x86_64 --check-patch patch.diff -o coverage_results.json
koverage
will determine a set of coverage requirements for covering the input patch, and check whether these are satisfied. Below is the content of output coverage_results.json
file, showing all modified
(file:line) pairs from the patch are included for compilation by the .config
file.
{
"headerfile_loc": {
"include/trace/events/io_uring.h": [
[293,"INCLUDED"],
[297,"INCLUDED"],
[298,"INCLUDED"],
[299,"INCLUDED"],
[300,"INCLUDED"],
[306,"INCLUDED"],
[313,"INCLUDED"],
[315,"INCLUDED"],
[316,"INCLUDED"],
[317,"INCLUDED"],
[318,"INCLUDED"]
]
},
"sourcefile_loc": {
"fs/io_uring.c": [
[1509,"INCLUDED"],
[1510,"INCLUDED"]
]
}
}
Any number of --check
targets may be added, and mixed with --check-patch
and --check-covreq
.
The results may one of three values for each line for both headers and source files (compilation units), and should be interpreted as follows:
INCLUDED
: (file,line) is included.LINE_EXCLUDED_FILE_INCLUDED
- If compilation unit, unit is successfully preprocessed but line is not included.
- If header, header file is included by some compilation unit specified in the coverage requirements, but line is not. However, line might be included by some unseen compilation unit.
FILE_EXCLUDED
- If compilation unit, file is not included (preprocessing failed).
However, this might be a false alarm due to build issues (i.e.,
a compiler error that prevents preprocessing while
.config
file has constraints to include (file,line)). - If header, file is not included in any of the compilation units preprocessed. However, (file,line) might be included by some unseen compilation unit.
- If compilation unit, file is not included (preprocessing failed).
However, this might be a false alarm due to build issues (i.e.,
a compiler error that prevents preprocessing while
TIMEOUT_MAKE_OLDDEFCONFIG
: Runningmake olddefconfig
on the input configuration file timed out.TIMEOUT_MAKE
: Runningmake
on the build target timed out.
Run klocalizer for a given compilation unit, e.g.,
klocalizer drivers/usb/storage/alauda.o
This will search each architecture for constraint satisfiability,
stopping once one is found (or no architecture's constraints are
satisfiable). klocalizer
writes this configuration to .config
and
prints the architectures, e.g., x86_64
, to standard out.
To build the compilation unit using the generated .config
, use
make.cross.
First set any defaults for the .config
file:
make.cross ARCH=x86_64 olddefconfig
Then build the compilation unit:
make.cross ARCH=x86_64 drivers/usb/storage/alauda.o
If you cannot get a configuration or it is still not buildable, see the Troubleshooting section.
While allyesconfig
strives to enable all options, some have conflicting dependencies or are mutually exclusive choices. For instance, fs/squashfs/decompressor_multi.o
is not compiled when using allyesconfig
:
make allyesconfig
make fs/squashfs/decompressor_multi.o
make
fails:
make[3]: *** No rule to make target 'fs/squashfs/decompressor_multi.o'. Stop.
Let us take a look at the unit's Kbuild dependencies:
klocalizer --view fs/squashfs/decompressor_multi.o
The output in part is
fs/squashfs/decompressor_multi.o
[And(CONFIG_SQUASHFS, CONFIG_SQUASHFS_DECOMP_MULTI)]
The unit is not included in allyesconfig
because it on both CONFIG_SQUASHFS
and CONFIG_SQUASHFS_DECOMP_MULTI
. The latter is disabled by default, being mutually exclusive with SQUASHFS_DECOMP_SINGLE
which is selected by allyesconfig:
make allyesconfig
egrep "(CONFIG_SQUASHFS|CONFIG_SQUASHFS_DECOMP_SINGLE|CONFIG_SQUASHFS_DECOMP_MULTI)( |=)" .config
grep
shows us the relevant settings:
CONFIG_SQUASHFS=y
CONFIG_SQUASHFS_DECOMP_SINGLE=y
# CONFIG_SQUASHFS_DECOMP_MULTI is not set
klocalizer
can find a configuration that includes the unit:
klocalizer fs/squashfs/decompressor_multi.o
egrep "(CONFIG_SQUASHFS|CONFIG_SQUASHFS_DECOMP_SINGLE|CONFIG_SQUASHFS_DECOMP_MULTI)( |=)" .config
grep
shows us what klocalizer
set:
CONFIG_SQUASHFS=y
CONFIG_SQUASHFS_DECOMP_MULTI=y
# CONFIG_SQUASHFS_DECOMP_SINGLE is not set
Finally, building the configuration
make olddefconfig
make fs/squashfs/decompressor_multi.o
gives us
CC fs/squashfs/decompressor_multi.o
A kernel user or developer may want a smaller kernel that includes a specific compilation unit, rather than having to build allyesconfig
. For instance, drivers/infiniband/core/cgroup.o
is not built by default:
make defconfig
make drivers/infiniband/core/cgroup.o
The output contains
make[2]: *** No rule to make target 'drivers/infiniband/core/cgroup.o'. Stop.
klocalizer
can look for a configuration containing the compilation unit that closely matches a given configuration without it by successively removing conflicting constraints until the configuration is valid:
make defconfig
klocalizer --approximate .config drivers/infiniband/core/cgroup.o
Now when building the configuration, the compilation unit is included:
make olddefconfig
make drivers/infiniband/core/cgroup.o
The output contains:
CC drivers/infiniband/core/cgroup.o
Sometimes a compilation unit is only available for certain architectures. Compiling drivers/block/ps3disk.o
won't compile on an x86
machine.
make allyesconfig
klocalizer drivers/block/ps3disk.o
Its output contains
make[3]: *** No rule to make target 'drivers/block/ps3disk.o'. Stop.
klocalizer --view drivers/block/ps3disk.o
shows us that it depends on CONFIG_PS3_DISK
. It turns out that this configuration option in turn depends on, among others options, the powerpc architecture.
klocalizer
can try the constraints from each architecture:
klocalizer drivers/block/ps3disk.o
It tells us that powerpc
is a satisfying architecture. We can use make.cross
to cross-compile for powerpc
.
make.cross ARCH=powerpc olddefconfig
make.cross ARCH=powerpc drivers/block/ps3disk.o
Its output contains
CC drivers/block/ps3disk.o
We can combine several klocalizer
features to build an allnoconfig
kernel that adds in the ps3disk.o
compilation unit and sets all tristate
options to modules.
make.cross ARCH=powerpc allnoconfig
klocalizer -a powerpc --match .config --modules --define CONFIG_MODULES drivers/block/ps3disk.o
make.cross ARCH=powerpc olddefconfig
make.cross ARCH=powerpc drivers/block/ps3disk.o
Its output contains
CC [M] drivers/block/ps3disk.o
./scripts/kconfig/merge_config.sh -n partialconfigfile > mergeout
By default, klocalizer
checks each architecture's Kconfig
constraints against the Kbuild constraints for the given compilation
unit. The following are examples of how to customize this process.
-
Controlling the search of architectures
Use
-a
to only search a specific architecture.klocalizer -a x86_64 drivers/usb/storage/alauda.o
Specify multiple
-a
arguments to search the given architectures in given order.klocalizer -a x86_64 -a sparc drivers/watchdog/cpwd.o
Specify
-a
and-all
to search all architectures, trying the ones given in-a
first.klocalizer -a x86_64 -a arm --all drivers/watchdog/cpwd.o
-
Generating an arbitrary configuration for an architecture
Pass a single architecture name without the compilation unit to generate an arbitrary configuration for that architecture. Passing multiple architectures is not supported.
klocalizer -a x86_64 drivers/watchdog/cpwd.o
-
Finding all architectures in which the compilation can be configured
klocalizer --report-all
-
Setting additional configuration options
Multiple
--define
and--undefine
arguments can be used to force configurations on or off when searching for constraints.klocalizer --define CONFIG_USB --define CONFIG_FS --undefine CONFIG_SOUND drivers/usb/storage/alauda.o
Note that this can prevent finding a valid configuration.
klocalizer -a x86_64 --undefine CONFIG_USB drivers/usb/storage/alauda.o # no configuration possible because alauda depends on USB
-
Investigating unsatisfied constraints
Use
--show-unsat-core
to see what constraints are causing the issue:klocalizer --show-unsat-core -a x86_64 --undefine CONFIG_USB drivers/usb/storage/alauda.o # no configuration possible because alauda depends on USB
-
Closely match a given configuration
Klocalizer will attempt to match a given configuration, while still
maintaing the configuration options necessary to build the given
compilation unit. This works by passing it an existing configuration,
e.g., allnoconfig
, with the --approximate
flag.
make allnoconfig
mv .config allnoconfig
klocalizer --approximate allnoconfig drivers/usb/storage/alauda.o
klocalizer with specific file
-
Viewing the Kbuild constraints
View the Kbuild constraints for a compilation unit and each of its subdirectories with
klocalizer --view-kbuild drivers/usb/storage/alauda.o
-
Building as modules instead of built-in
Use the
--modules
flag to set any tristate options tom
instead ofy
. Be sure to enable theCONFIG_MODULES
option as well.klocalizer --modules --define CONFIG_MODULES drivers/usb/storage/alauda.o make olddefconfig make drivers/block/ps3disk.o
-
Using new formulas
Override the default formulas with the following:
klocalizer --kmax-formula kmax --kclause-formulas kclause drivers/watchdog/cpwd.o
-
Generating multiple configurations
klocalizer -a x86_64 --random-seed 7849 --sample 8 --sample-prefix config
-
If you see a message like
ERROR:Kextract failed
, it likely means the Kconfig parser is out-of-date (or klocalizer cannot figure out what version of the Kconfig parser to use.) Please submit an issues to have the Kconfig parser updated. -
Use the
CONFIG_
prefix on variables when referring to them in user constraints. -
Use the
.o
ending for compilation units (thoughklocalizer
will change it automatically.) -
The extracted formulas may not be exact. No resulting configuration is a sign that the formulas are overconstrained. A resulting configuration that does not include the desired compilation unit may mean the formulas are underconstrained.
-
Compilation unit not buildable. There are several possible reasons:
-
The compilation unit has already been compiled. First clean with
make clean
-
While most compilation units can be built individually with make, some cannot. In these cases, build the parent directory instead, e.g.,
klocalizer drivers/char/ipmi/ipmi_devintf.o # finds it buildable in x86_64 make.cross ARCH=x86_64 olddefconfig make.cross ARCH=x86_64 drivers/char/ipmi/
-
Composites do not correspond to source files and are not built directly via
make
. Instead they are composed of other compilation units. For instance,drivers/block/zram/zram.o
is comprised ofzcomp.o
andzram_drv.o
. After finding a satisfying configuration, build the parent directory to see the source files that comprise it built.klocalizer --approximate .config drivers/block/zram/zram.o make olddefconfig make drivers/block/zram/
-
The configuration causes the unit to be built, but it has a compile-time error.
klocalizer drivers/block/amiflop.o # finds it buildable in make.cross ARCH=m68k olddefconfig make.cross ARCH=m68k drivers/block/amiflop.o # Makefile sees it, but causes compiler error.
-
Klocalizer's formulas were wrong in some cases. Please file an issue with source version number and klocalizer command used.
-
-
If the unit's configuration constraints depend on
CONFIG_BROKEN
, thenklocalizer
, by default, which detect it and stop searching, because the compilation unit may not be (easily) compilable.klocalizer drivers/watchdog/pnx833x_wdt.o # stops after finding a dependency on `CONFIG_BROKEN`
To get a configuration anyway, use
--allow-config-broken
klocalizer --allow-config-broken drivers/watchdog/pnx833x_wdt.o # finds dependency on mips make.cross ARCH=mips olddefconfig make.cross ARCH=mips drivers/watchdog/pnx833x_wdt.o # won't be included in the build, due to CONFIG_BROKEN
We provide a Dockerfile for extracting the Linux kernel model as DIMACS and SMT as well.
Refer to Docker installation site if you have not already installed Docker.
The provided script works on v3.x and newer Linux kernels and takes the Linux version as an argument for the build command.
Check out the Linux kernel archive to select the Linux version of interest.
For instance, we can use the script to generate the Linux kernel model for version 5.16
.
Make sure that your docker service is running. If you use systemd, you can use:
sudo systemctl start docker
To build the docker image, execute the following from the scripts
directory of this repository and replace <version>
by the kernel version number of interest (e.g., 5.16
):
sudo docker build -t kmax ./ --build-arg linux_version=<version>
The build process takes some time. In the end, it produced a DIMACS and SMT model inside the image. To extract it, you have to start a container:
sudo docker run -d -t kmax
You should get a container id. If not, you can additionally use the following command to retrieve the docker container id:
sudo docker container ls
Afterwards, you can copy the files using docker cp
to your current directory:
sudo docker cp <containerID>:/home/linux-<version>.smt .
sudo docker cp <containerID>:/home/linux-<version>.dimacs .
Be aware to replace <version>
by your provided version number.
Now you can find a model in DIMACS and SMT of the given linux version.
The .kmax
holds the formula cache to avoid having to regenerate
formulas on each run of klocalizer. klocalizer will also look for
prebuilt formulas to download and store in this directory. To support
caching multiple versions of the Linux source, i.e., when using a git
repo, the .kmax
folder's subfolders are Linux versions, i.e., git
tags. klocalizer will gather the current version of Linux (either via
git describe --abbrev=0 --tags
or make kernelversion
).
Get the BusyBox source:
git clone https://git.busybox.net/busybox
cd busybox
git checkout 1_28_0 # or whatever version you need
Prepare directories for formulas:
mkdir .kmax/
Generate the Config.in files:
make gen_build_files
Get the Kconfig constraint formulas:
kextract --module-version 3.19 --extract Config.in > .kmax/kextract
kclause --remove-orphaned-nonvisible < .kmax/kextract > .kmax/kclause
The number of dictionary entries will be fewer than the total number of configuration options, because this map only stores configuration options that have dependencies. Options without dependencies will not have a dictionary key (although they may be used in the dependencies of other options).
Get the Kbuild file constraint formulas:
kmaxall $(find | grep "Kbuild$" | cut -c3-) | tee .kmax/kmax
Unlike Linux, BusyBox will build a .o
with make
, even if it is not configured in, e.g.,
make clean
make allnoconfig
make coreutils/fsync.o
This will compile coreutils/fsync.o
even though it wouldn't have been built with make
, e.g.,
make clean
make allnoconfig
make
The coreutils/fsync.o
file should not exist
$ ls coreutils/fsync.o
ls: cannot access 'coreutils/fsync.o': No such file or directory
Here is how to use klocalizer
to create a config that includes coreutils/fsync.o
make clean
make allnoconfig
mv .config allnoconfig
klocalizer --approximate allnoconfig coreutils/fsync.o # produces .config file that builds fsync.o
yes "" | make oldconfig # to accept default values for other options
make
The coreutils/fsync.o
file should now be there
$ ls coreutils/fsync.o
coreutils/fsync.o
The reason for approximating allnoconfig
is to avoid adding
configuration options that may break the build on certain systems.
Using yes ""
accepts any default values of options not forced by
constraints.
This will run Kmax on the example from the paper on Kmax.
kmax tests/kbuild_tests/paper_example
This will output the list of configuration conditions for each compilation unit file in the example Kbuild file. By default, Kmax to treat configuration options as Boolean options (as opposed to Kconfig tristate options). Pass -T
for experimental support for tristate.
unit_pc tests/kbuild/fork.o 1
unit_pc tests/kbuild/probe_32.o (CONFIG_A && CONFIG_B)
unit_pc tests/kbuild/probe_64.o ((! CONFIG_A) && CONFIG_B)
The unit_pc
lines have the format of compilation unit name followed by the Boolean expression, in C-style syntax. The Boolean expression describes the constraints that must be satisfied for the compilation unit to be included. Use -z
to emit the z3 formulas in smtlib2 format.
There is a script that will run Kmax on all Kbuild Makefiles from a project, e.g., the Linux kernel source code.
First get the Linux source and prepare its build system.
wget https://cdn.kernel.org/pub/linux/kernel/v5.x/linux-5.3.11.tar.xz
tar -xvf linux-5.3.11.tar.xz
cd linux-5.3.11
make defconfig # any config will work here. it's just to setup the build system.
To try Kmax on a particular Kbuild Makefile, use the kbuildplus.py
tool:
kmax ipc/
This will run Kmax on a single Kbuild Makefile, and show the symbolic configurations for each compilation unit and subdirectory. Kmax can also recursively analyze Kbuild Makefiles by following subdirectories, use the kmaxdriver.py
which uses kbuildplus.py
to process each Kbuild Makefile and recursively process those in subdirectories. -g
means collect the symbolic constraints.
kmaxall -g net/
Kmax includes a Makefile hack to get all the top-level Linux directories. Combined with kmaxall
this command will collect the symbolic constraints for the whole (x86) source, saving them into unit_pc
. Be sure to change /path/to/kmax
to your kmax installation to get the Makefile shunt.
kmaxall -g $(make CC=cc ARCH=x86 -f /path/to/kmax/scripts/makefile_override alldirs) | tee kmax
kreader drivers/usb/storage/alauda.o kernel/trace/trace_i
kreader --kmax-formula .kmax/kclause/x86_64/kmax drivers/ | grep ".o$" | wc -l kreader --kmax-formula .kmax/kclause/arm/kmax drivers/ | grep ".o$"| wc -l
Kmax uses both BDDs (as described in the original paper) and a Z3 solver. Z3 performs better when constructing the logical formulas, but slower when checking satisfiability than BDDs. Z3 is best for displaying the resulting formula, since it doesn't require the exhaustive enumeration of the BDD, and BDD is best for checking the (un)satisfiability of formulas, which Kmax uses to trim infeasible states during analysis.
Kclause extracts a logical model from Kconfig. It works in two stages:
-
The
kextract
tool uses the Kconfig parser shipped with Linux to extract configuration variables dependencies to an intermediate language. -
The
kclause
tool takes this intermediate language and generates a z3 formula.
Then, from the root of a Linux source tree, run the following:
kextract --extract -e ARCH=x86_64 -e SRCARCH=x86 -e KERNELVERSION=kcu -e srctree=./ -e CC=cc -e LD=ld Kconfig > kextract
kclause --remove-orphaned-nonvisible < kextract > kclause
# all the configs that have a prompt condition
grep "^prompt " kconfig.kclause | cut -f2 -d\ | sort | uniq | tee visible.txt
# all the configs
grep "^config " kconfig.kclause | cut -f2 -d\ | sort | uniq | tee configs.txt
# the visibles should be a subset of the configs
diff configs.txt visible.txt | grep ">"
// whole file
file := line*
line := config_line | prompt_line | env_line | nonbool_line | clause_line | boolchoice_line | dep_line | bi_line | constraint_line
// lines
config_line := 'config' config_var type_name
prompt_line := 'prompt' config_var '(' expr ')'
env_line := 'env' config_var
def_bool_line := 'def_bool' config_var bool_value '(' expr ')'
def_nonbool_line := 'def_nonbool' config_var nonbool_value '|' '(' expr ')'
clause_line := 'clause' clause_elem+
choice_line := choice_type config_var+ '|' '(' expr ')'
choice_type := 'bool_choice' | 'tristate_choice' | 'bool_opt_choice' | 'tristate_opt_choice'
dep_line := dep_name config_var '(' expr ')'
select_line := 'select' config_var config_var '(' expr ')'
dep_name := 'dep' | 'rev_dep'
bi_line := expr '|' expr
contraint_line := expr
// expressions
type_name := 'bool' | 'string' | 'number'
bool_value := '1' | '0'
nonbool_value := string
clause_elem := '-'? config_var
expr := expr 'and' expr | expr 'or' expr | 'not' expr | config_var | nonbool_expr | '1' | '0'
config_var := string
- All
config_line
s should come before anything else related to the declared config var. - All
prompt_line
s,def_bool_line
s, anddef_non_line
s should come before dependencies. - A config with no
prompt_line
is unconditionally nonvisible to the user. - An
env_line
says that a variable can be set by the user via an environment variable. clause
s are symbolic dimacs cnf clauses, where instead of numbers, they use the string name of the variablebool_choice
is a mutually-exclusive choice between the given config vars. The entire choice can have a dependency. Depedencies on the individual choice variables' dependencies are expressed with separatedep
lines.dep
s are a kconfig dependency. selectingconfig_var
implies that theexpr
holds true. we can assume this line comes after the definition of the variable withbool_line
, etc.- the
expr
may contain non-boolean relations, which can themselves be treated as a boolean variable. 1 and 0 mean true and false respectively. select
says that the first var gets selected by the second var, under the given conditional expression. note that the given conditional expression may be a duplicate of the selecting vars dependencies and can be deduplicated. note also there may be repeatedselect
s for the same combination of selected and selecting variables, because the same config can be declared multiple times, each selecting the same variable, optionally with differing conditions.def_bool
defines (possibly multiple) defaults for a boolean variable. This is only meant for nonselectable booleans, since it will constrain a variable to that value.def_nonbool
defines (possibly multiple) defaults for variable nonboolean.bi_line
andcontraint_line
allow for converting expressions directly to the DIMACS format. Any variables used must be declared first with aconfig_line
.
- Copy scripts/kconfig to kextractors (following naming pattern)
- Copy kextractor.c, kextractor_extension.c, Makefile, and README.md from nearby version
- Update expr.h and others based on README.md
- Update kextractor_extension.c to have updated version name (e.g.,
3_19
to3_2
) - Add the extension's list of files to setup.py
- Add the extension's list variable name to ext_modules in setup.py
- Add a mapping from the version number to the module in kmax/kextractcommon.py