Initial Fuzzing Infrastructure #611

fitzgen · 2019-11-21T00:08:54Z

I plan on laying out some foundational fuzzing infrastructure for Wasmtime in the next few weeks. I'd like to use this issue as a kind of meta issue to keep track of this work. I'd also appreciate feedback on the plan from anyone with experience fuzzing or domain knowledge of a particular thing we plan on fuzzing.

Goals

Find bugs!
- Bugs that we wouldn't otherwise find until our users hit them.
- Bugs that are hard to manually write test cases for, or that you wouldn't even think of testing for.
Make bugs (fuzzer-found or otherwise) easier to debug via automatic test case reduction.

Strategy

Breadth not Depth

At least initially, let's build out a few different fuzzing approaches enough that they start identifying bugs, but not spend a ton of time building bespoke tools tailored for exactly the problems we have at hand.

My assumptions are that

we have low-hanging fruit available, since we haven't done a ton of fuzzing for a bunch of corners yet, and
different fuzzing approaches tend to uncover different sets of bugs.

Therefore, by making a bunch of different just-good-enough fuzzers, we will repeatedly discover new, unique low-hanging fruit bugs.

Additionally, this gives us a nice foundation that we can spring board off of in the future when we decide to go deeper in any particular direction.

Decouple Generators and Oracles

A generator creates test cases (usually given an RNG or a random byte stream input). An oracle determines if executing a test case uncovered a bug. In general, it is good software engineering to separate concerns, but separating these two parts specifically allows us to:

reuse oracles during automatic test case reduction (a la creduce), and
swap out existing, off-the-shelf generators with more intelligent, custom generators the future.

Implementation

In general, I recommend that we use libFuzzer to drive our fuzzing. It is coverage-guided, which means it can find interesting code paths more quickly than testing purely random inputs will. It also has a nice Rust interface in the form of cargo-fuzz.

Any custom generators we create should take libFuzzer-provided input bytes and then re-interpret that as a sequence of random values to drive choices inside the generator. This lets us combine the benefits of smart, structure-aware generators with those of coverage-guided fuzzing. We can implement this by implementing our custom generators in terms of the arbitrary crate's Arbitrary trait.

As far as test case reduction goes, when a generator is creating Wasm files, it should be relatively easy to use binaryen's wasm-reduce on the Wasm file, or use creduce on the WAT disassembly. We can, however, do some small things to make the process turnkey:

Write glue scripts for running wasm-reduce and/or creduce on a Wasm test case with any of our various oracles

For generators that are creating custom in-memory data structures by implementing the Arbitrary trait, test case reduction requires we implement some custom logic. The Arbitrary trait supports defining a custom shrink method that takes &self and returns an iterator of smaller instances of Self. We can use this to create custom test case reduction for each of our custom test case generators.

Finally, any custom generator we create (and any generator we wrap that supports turning the generation of individual test case features on/off) should support swarm testing. Swarm testing is where we randomly turn on/off the generation of various test case features (such as, should a generator create Wasm test cases that use call_indirect or not?) so that we are more likely to generate pathological test cases where bugs are more likely to be found. This is relatively easy implement and should yield

Fuzzing Wasmtime's Embedding API

This is a case where, unfortunately, we can't really use existing off-the-shelf solutions.

Generators

Build a custom generator that creates a sequence of API calls. It shouldn't perform the calls, just describe them. This generator should have some smarts about knowing how to generate valid API calls.

Oracles

Interpret API call descriptions and perform the actual API call. Find unexpected panics, assertion failures, and segfaults.

Wasm Execution Fuzzing

We should fuzz our execution of Wasm. Yes, Cranelift has some fuzzing in SpiderMonkey, but we should also make sure that all of our Wasmtime-specific JIT'ing machinery is well fuzzed, as well as our WASI implementation and sandboxing.

Generators

Use wasm-opt -ttf to generate random, valid Wasm files.
Write a custom generator that creates Wasm files that make sequences of WASI syscalls.

Oracles

More Stuff to Explore in the Future

Add support for code-coverage in Cranelift and leverage it to build equivalence-module-inputs testing and coverage-guided fuzzing for Wasmtime
- Alternatively, we could MacGyver some custom code coverage scheme via instrumenting Wasm files with Walrus instead of doing this inside Cranelift at the clif level.
Create test case generators and oracles for our Wasm interface types support? What would be involved here is not super clear to me yet.

Questions

Should the fuzzing corpus be committed into the git repo? Or perhaps should it be a separate repo that we include as a git submodule?
What work here should we prioritize?
- In particular, what variants would be most valuable to compare / most likely to uncover high-priority bugs in differential fuzzing of Wasm execution?
Is there anything here you think we should not implement?
Are there any other WASI-targeted oracles we can create? The strace idea is pretty half-baked right now. I'd appreciate some more ideas from folks more involved in the WASI side of things than I am...

The text was updated successfully, but these errors were encountered:

acfoltzer · 2019-11-21T00:24:29Z

In Lucet, I wrote a simple fuzzing script that uses Csmith-generated C programs: https://github.com/bytecodealliance/lucet/blob/master/lucet-wasi-fuzz/src/main.rs

The approach is to run each program via Lucet on WASI:

.c -[wasm32-wasi-clang]-> .wasm -[lucetc]-> .so -[lucet-wasi]-> stdout

Then compare the stdout against a native oracle:

.c -[i686-linux-clang]-> a.out -[exec]-> stdout

It's pretty bare-bones, other than the ability to run a creduce loop when a failure is found, but it should be possible to hook it up to libfuzzer and wasmtime.

kubkon · 2019-11-21T05:55:05Z

@fitzgen I haven't done a lot of fuzzing in the past, but I'll be more than happy to learn on the job and help out any way I can in testing out our WASI implementation. @acfoltzer lemme know if you need any help in potentially reusing your Lucet fuzzing harness in Wasmtime!

alexcrichton · 2019-11-21T15:42:22Z

For interface types specifically I suspect that the generator won't be too too different than what wasm generator we might have, unless we heavily base it on wasm-opt in which case we'd have to write our own fuzz case generator.

For an oracle I think our best bet will be to have someone entirely disconnected from the wasmtime interface types work to write an interpreter, and then we'd compare the two implementations against each other. I suspect we'd discover bugs in both, but I don't think we have much of an oracle otherwise right now.

acfoltzer · 2019-11-21T17:50:33Z

@acfoltzer lemme know if you need any help in potentially reusing your Lucet fuzzing harness in Wasmtime!

Thanks, @kubkon! I'm actually not going to have time to work on this for a few weeks at least, so if you're feeling eager, don't worry about jumping in and pinging me if you need any support.

sunfishcode · 2019-11-21T21:21:31Z

On the topic of oracles, the strace idea is appealing, as it doesn't require admin privileges and doesn't depend on cooperation from the VM. Ideally we'd write our own ptrace utility rather than literally using strace, so that we can catch sandbox violations when they happen, which protects the host system better and gives fuzzers a better picture of what's happening.

Another option is to use LD_PRELOAD to interpose between the application and libc, which ought to be faster than ptrace, and simpler to implement, though it would depend on applications being dynamically linked to libc.

fitzgen · 2019-11-21T22:10:57Z

Good point that there are a few different tools we have at our disposal to observe syscalls. There is probably some eBPF APIs and perf tools we could use too.

I would lean towards whatever is both

easy to implement, and
doesn't require us to blacklist each individual syscall, but instead lets us whitelist things we don't care about (that is, the default should be that we are checking things, without us having to do O(N) work to observe N different kinds of syscalls)

Unless I'm mistaken, LD_PRELOAD wouldn't work well for the latter, since we would have to manually implement overwriting a symbol for every libc API we wanted to observe.

This crate is intended to hold all of our various test case generators and oracles. The fuzz targets we have at `wasmtime/fuzz/fuzz_targets/*` will eventually be ~one-liner glue code calling into this crate. Part of bytecodealliance#611

Part of bytecodealliance#611

jfoote · 2019-11-22T21:52:57Z

Use wasm-opt -ttf to generate random, valid Wasm files.

This is a good idea. We did something similar for a cranelift fuzz target. One downside to this approach is that the fuzz target cannot be seeded with a distilled corpus of valid-ish Wasm modules (since the input is a bitstring). Likewise, corpuses that are accumulated as fuzzers run will not be readily recyclable between generators that consume bitstrings (IIUC).

These are not good reasons not to take this approach, but something to consider for future work. Overall this looks great. I like the equivalence checking idea.

fitzgen · 2019-11-25T23:27:33Z

Should the fuzzing corpus be committed into the git repo? Or perhaps should it be a separate repo that we include as a git submodule?

FYI, some discussion about this over here: rust-fuzz/cargo-fuzz#194

pventuzelo · 2019-11-26T18:46:29Z

Hi guys, i'm planning to do some fuzzing on lightbeam in the next weeks ;)

Just to give you a bit of context about me, I'm the guy behind webassembly-security.com and I'm teaching WebAssembly security and Rust security. I'm focused on fuzzing and vulnerability research on both WebAssembly (module & VM) and Rust code, so don't hesitate to ping me if needed ;)

I agree with @jfoote, regarding using binaryen translate_to_fuzz for fuzzing. Main issue will be crash replay because I think (need to be verify) binaryen is not consistent on wasm generation (i.e. same input can generate 2 different module). Also, generated wasm section are often the same in the final wasm module, meaning some part of the VM/parser will be difficult to reach

@fitzgen Regarding where to store fuzzing corpus, i would suggest a specific repo or server not link to this one to prevent user to download all those files accidentally. Also, corpus need to be minimize before being pushed in this storage repo.

In general, you should have one fuzz target per APIs and per backends since corpus will evolved differently depending of the code triggered.

fitzgen · 2019-11-26T18:51:33Z

binaryen is not consistent on wasm generation (i.e. same input can generate 2 different module).

wasm-opt -ttf will generate the same output given the same input; it is deterministic.

fitzgen · 2019-11-27T00:01:41Z

I've set up a repo for the libFuzzer corpora here: https://github.com/bytecodealliance/wasmtime-libfuzzer-corpus

pventuzelo · 2019-12-03T10:47:31Z

binaryen is not consistent on wasm generation (i.e. same input can generate 2 different module).

wasm-opt -ttf will generate the same output given the same input; it is deterministic.

Right ;)

Regarding the libfuzzer corpus, have you evaluate the actual code coverage?

fitzgen · 2019-12-03T18:25:45Z

Right ;)

Right.

I've never seen the same input to wasm-opt -ttf generate different outputs. There may be bugs somewhere, but I've never hit them. If you know of bugs, I'm sure that they would love to have bug reports.

Regarding the libfuzzer corpus, have you evaluate the actual code coverage?

I have not. So far, I haven't been focused on doing the fuzzing itself so much as setting up the infrastructure, implementing oracles, etc.

jfoote · 2019-12-17T14:50:29Z

Hello all. I looked into using oss-fuzz for continuous fuzzing of libFuzzer/cargo fuzz/libfuzzer-sys fuzz targets. oss-fuzz is appealing since it supplies significant free computational resources for fuzzing open source projects, supplies a private bug tracker and surrounding policy/process for coordination, provides a useful source code coverage mapping web UI, etc.

Here are my notes on a basic few gaps that would need to be addressed to integrate with oss-fuzz as-is:

builds should use oss-fuzz-supplied feedback-coverage instrumentation flags
- oss-fuzz supplies CC/CXX coverage instrumentation for building fuzz targets via CFLAGS/CXXFLAGS, e.g. -fsanitize=fuzzer and -fsanitizer=fuzzer-no-link (ref)
- cargo fuzz uses a statically defined set of flags. These may be compatible with the set used by oss-fuzz/-fsanitize=fuzzer today, I did not check
builds must use oss-fuzz-supplied sanitizer instrumentation
- cargo fuzz and libfuzzer-sys support the ASAN and UBSAN, the default sanitizers used by oss-fuzz. The syntax for passing them via the command line varies ofc.
builds should statically link oss-fuzz's version of libfuzzer (i.e. libFuzzingEngine) into the fuzz target
- libfuzzer-sys uses a vendored copy of libfuzzer that is updated periodically by the maintainers
builds must statically link the fuzz target binary to copy out to clusterfuzz
- also, the binary must support libfuzzer-compatible command flags
- cargo fuzz already builds a standalone fuzz target binary as part of cargo fuzz run, but building does not appear to be exposed as a standalone step
builds should support clang coverage builds
- oss-fuzz uses clang source-based coverage to generate precise coverage data for its source code coverage-mapping UI
- there is discussion of supporting an analogous feature in Rust, with some recent activity

Last year there was some discussion in the oss-fuzz project of supporting Rust targets directly, where a maintainer (kcc) mentioned deviating from the norm and not supporting coverage builds, etc. If we want to pursue oss-fuzz for fuzzing Rust targets directly we could engage with the team to see if we might be able to do something less than ideal to get started, or if they are planning to change the interface to support cargo fuzz/libfuzzer-sys fuzz targets. There are alternatives to oss-fuzz available as well.

This seemed like the right place to share and discuss this; if I am off-topic here just let me know (and please pardon me!).

fitzgen · 2019-12-18T00:45:23Z

Thanks for looking into this @jfoote!!

There are a couple projects already that use cargo in their build.sh so I suspect that we can make something work here.

builds should statically link oss-fuzz's version of libfuzzer (i.e. libFuzzingEngine) into the fuzz target

libfuzzer-sys uses a vendored copy of libfuzzer that is updated periodically by the maintainers

I think we can work around this in the build.sh via

export CUSTOM_LIBFUZZER_PATH="$LIB_FUZZING_ENGINE"

See https://github.com/rust-fuzz/libfuzzer-sys/blob/master/build.rs#L2 for details.

cargo fuzz already builds a standalone fuzz target binary as part of cargo fuzz run, but building does not appear to be exposed as a standalone step

Yep, we should fix this issue by adding a new build subcommand to cargo fuzz. In fact, it is something that's been asked for before: rust-fuzz/cargo-fuzz#175

Overall, for our next steps, I think it makes sense to

add a build subcommand to cargo fuzz (see above), and then
get a docker image, project.yaml, and build.sh set up for oss-fuzz that works but maybe doesn't exactly check all the boxes due to them not having a lot of Rust projects, and finally
open a PR to oss-fuzz with a disclaimer of what boxes aren't fully checked and why, opening the discussion up with them.

Sound like a plan?

I can take the first bullet point, and also continue working on the other bits mentioned in this issue. Can you take on the last two bullet points @jfoote?

jfoote · 2019-12-18T16:23:17Z

There are a couple projects already that use cargo in their build.sh so I suspect that we can make something work here.

At first blush my sense was these projects might be using cargo to build non-instrumented dependencies that are linked into the fuzz targets. I didn't dive into them though.

I think we can work around this in the build.sh via export CUSTOM_LIBFUZZER_PATH

Excellent, TIL.

Sound like a plan?

SGTM. Even if the compile/instrumentation flags are not passed as expected I think a basic PR will be a good way to get the conversation started with the oss-fuzz team.

Can you take on the last two bullet points @jfoote?

Sure thing. I am in a pre-US-holiday crunch right now so there might be a little delay, but I will get to this ASAP.

fitzgen · 2019-12-18T18:46:01Z

Great -- thanks! I don't think there is any giant rush here, so if this gets bumped to after the holidays, that seems 100% OK with me :)

fitzgen · 2019-12-20T18:53:03Z

Yep, we should fix this issue by adding a new build subcommand to cargo fuzz. In fact, it is something that's been asked for before: rust-fuzz/cargo-fuzz#175

This is done, and part of the new cargo fuzz 0.6.0 release.

jfoote · 2020-01-16T17:48:55Z

Quick update here: I was able to link the oss-fuzz build environment libfuzzer library (libFuzzingEngine.a) into the wasmtime/fuzz compile fuzz target after patching rust-fuzz/libfuzzer to select c++ std lib based on an env var. Executing the binary for a few seconds yields the expected results; it seems to work.

Building with asan (the default) is OK, but specifying sanitizer=memory yields a linking error. I fiddled with the bug a little and suspect an incompatibility in the instrumenting/linking used in libFuzzingEngine.a and what rustc/libfuzzer-sys are using, but I did not root-cause it.

The other sanitizer that oss-fuzz can optionally build with is ubsan, but it is not supported by our toolchain here at this time AFAIK.

My recommendation (and plan at this point, unless directed otherwise) is to ignore the sanitizer flag supplied by oss-fuzz, set the fuzz target configs to use only asan for good measure, and proceed to write a build script for the wasmtime/fuzz targets. I'll then make a PR to oss-fuzz after rust-fuzz/libfuzzer#56 is merged to get the conversation started.

Part of bytecodealliance#611

To support oss-fuzz PoC, see bytecodealliance#611

jfoote · 2020-01-20T16:50:28Z

Hello @fitzgen! I have the strawman PR for the wasmtime oss-fuzz integration staged. Before we move forward with that, can you take a look at the project acceptance PR diff (jfoote/oss-fuzz@06542db) and see if it looks OK to you?

Basically I set myself as the maintainer for now and added an email alias for you as well as [email protected]. Those addresses are used to get notifications when the fuzzers find something or the build breaks. Note that if aliases listed there have associated google accounts they will get access to the oss-fuzz dashboard and bug tracker. Should we add anyone else initially?

I have the strawman integration PR WIP staged here: jfoote/oss-fuzz@c1ae8ea
- Once Update libfuzzer-sys dependency version number #840 lands I'll change the wasmtime clone back to upstream
And here is a draft of the text I plan to include with the initial project acceptance PR once we have it settled (note for onlookers that I may delete this gist later/after we submit the PR)

fitzgen · 2020-01-21T17:25:34Z

@jfoote looks great! 👍 I left a couple comments on the draft text. Everything else looks ready to go!

jfoote · 2020-02-07T18:29:42Z

Quick update for posterity and onlookers: we've successfully integrated the wasmtime fuzz targets with oss-fuzz, with the caveats outlined in the comments and referenced PRs above. Thanks to @fitzgen and @alexcrichton for making this happen!

Fix bytecodealliance#611

Hyperion101010 · 2020-04-04T06:21:02Z

@fitzgen sir this was a gsoc2020 project idea, I worked in the application period and submitted a proposal. Given the time I had at I hand i wasn't able to get complete idea about the different vulnerabilities like ABI abstractions, Heap and Stack safety. I want to voluntarily contribute for the idea, but couldn't do the same before I clear out some doubts.
I would like to start understanding the fuzzing process more closely and contributing by writing fuzzers perhaps. During the application process I wrote mails for the project details, but I never got any reply which is completely fine given the situation we have now.
Is there any way we can do a conversation for the doubts I have, I see that there used to be a IRC channel for wasmtime one year ago, but now they migrated to Matrix which unfortunately doesn't has any such channel. If you are available on any channel of Mozilla/(other open source org) please let me know.
Good day!

bjorn3 · 2020-04-04T07:14:15Z

https://bytecodealliance.zulipchat.com/ is the primary discussion channel.

bjorn3 · 2021-02-03T20:43:56Z

I think this can be closed.

kubkon mentioned this issue Nov 21, 2019

testsuite now requires installing wasm32-wasi target #595

Closed

fitzgen added a commit to fitzgen/wasmtime that referenced this issue Nov 21, 2019

Split our existing fuzz targets into separate generators and oracles

ea50877

Part of bytecodealliance#611

fitzgen mentioned this issue Nov 21, 2019

Introduce the wasmtime-fuzzing crate #619

Merged

fitzgen added a commit to fitzgen/wasmtime that referenced this issue Nov 21, 2019

Split our existing fuzz targets into separate generators and oracles

6e3a8ce

Part of bytecodealliance#611

fitzgen added a commit to fitzgen/wasmtime that referenced this issue Nov 21, 2019

Split our existing fuzz targets into separate generators and oracles

58ba066

Part of bytecodealliance#611

fitzgen mentioned this issue Nov 27, 2019

Remove in-repo fuzz corpus #643

Merged

This was referenced Dec 3, 2019

fuzzing: Provide dummy imports for instantion oracle #660

Merged

Run our fuzz targets on our corpora in CI #662

Merged

fuzzing: Add initial API call fuzzer #685

Merged

jfoote mentioned this issue Jan 15, 2020

Add env var to select c++ std lib rust-fuzz/libfuzzer#56

Merged

fitzgen added a commit to fitzgen/wasmtime that referenced this issue Jan 17, 2020

Add initial differential fuzzing

8a49584

Part of bytecodealliance#611

fitzgen mentioned this issue Jan 17, 2020

Add initial differential fuzzing #833

Merged

fitzgen added a commit to fitzgen/wasmtime that referenced this issue Jan 18, 2020

Add initial differential fuzzing

1917eb5

Part of bytecodealliance#611

fitzgen added a commit to fitzgen/wasmtime that referenced this issue Jan 18, 2020

Add initial differential fuzzing

1bf8de3

Part of bytecodealliance#611

jfoote added a commit to jfoote/wasmtime that referenced this issue Jan 20, 2020

Update libfuzzer-sys dependency version number

47f8c1e

To support oss-fuzz PoC, see bytecodealliance#611

jfoote mentioned this issue Jan 20, 2020

Update libfuzzer-sys dependency version number #840

Merged

This was referenced Jan 23, 2020

Acceptance for wasmtime project and Rust fuzz targets google/oss-fuzz#3285

Merged

[wasmtime] initial integration google/oss-fuzz#3292

Merged

arkpar pushed a commit to paritytech/wasmtime that referenced this issue Mar 4, 2020

Fix cranelift_preopt panic

4ee2747

Fix bytecodealliance#611

cfallin closed this as completed Feb 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial Fuzzing Infrastructure #611

Initial Fuzzing Infrastructure #611

fitzgen commented Nov 21, 2019 •

edited

Loading

acfoltzer commented Nov 21, 2019

kubkon commented Nov 21, 2019

alexcrichton commented Nov 21, 2019

acfoltzer commented Nov 21, 2019

sunfishcode commented Nov 21, 2019

fitzgen commented Nov 21, 2019

jfoote commented Nov 22, 2019 •

edited

Loading

fitzgen commented Nov 25, 2019

pventuzelo commented Nov 26, 2019

fitzgen commented Nov 26, 2019

fitzgen commented Nov 27, 2019

pventuzelo commented Dec 3, 2019

fitzgen commented Dec 3, 2019

jfoote commented Dec 17, 2019 •

edited

Loading

fitzgen commented Dec 18, 2019 •

edited

Loading

jfoote commented Dec 18, 2019

fitzgen commented Dec 18, 2019

fitzgen commented Dec 20, 2019

jfoote commented Jan 16, 2020

jfoote commented Jan 20, 2020

fitzgen commented Jan 21, 2020

jfoote commented Feb 7, 2020

Hyperion101010 commented Apr 4, 2020

bjorn3 commented Apr 4, 2020

bjorn3 commented Feb 3, 2021

Initial Fuzzing Infrastructure #611

Initial Fuzzing Infrastructure #611

Comments

fitzgen commented Nov 21, 2019 • edited Loading

Goals

Strategy

Breadth not Depth

Decouple Generators and Oracles

Implementation

Fuzzing Wasmtime's Embedding API

Generators

Oracles

Wasm Execution Fuzzing

Generators

Oracles

More Stuff to Explore in the Future

Questions

acfoltzer commented Nov 21, 2019

kubkon commented Nov 21, 2019

alexcrichton commented Nov 21, 2019

acfoltzer commented Nov 21, 2019

sunfishcode commented Nov 21, 2019

fitzgen commented Nov 21, 2019

jfoote commented Nov 22, 2019 • edited Loading

fitzgen commented Nov 25, 2019

pventuzelo commented Nov 26, 2019

fitzgen commented Nov 26, 2019

fitzgen commented Nov 27, 2019

pventuzelo commented Dec 3, 2019

fitzgen commented Dec 3, 2019

jfoote commented Dec 17, 2019 • edited Loading

fitzgen commented Dec 18, 2019 • edited Loading

jfoote commented Dec 18, 2019

fitzgen commented Dec 18, 2019

fitzgen commented Dec 20, 2019

jfoote commented Jan 16, 2020

jfoote commented Jan 20, 2020

fitzgen commented Jan 21, 2020

jfoote commented Feb 7, 2020

Hyperion101010 commented Apr 4, 2020

bjorn3 commented Apr 4, 2020

bjorn3 commented Feb 3, 2021

fitzgen commented Nov 21, 2019 •

edited

Loading

jfoote commented Nov 22, 2019 •

edited

Loading

jfoote commented Dec 17, 2019 •

edited

Loading

fitzgen commented Dec 18, 2019 •

edited

Loading