Investigate deeper fuzzing #975

turbolent · 2021-06-03T15:31:06Z

Issue To Be Solved

We currently support fuzzing the parser and checker using go-fuzz. This helped during the development of the new parser last year.

Extend the fuzzing support to get deeper coverage and also support fuzzing the interpreter.

Currently, input is random, so the coverage for the checker is small, as only few inputs are valid syntactically. By providing syntactically, and further also semantically correct inputs, the coverage for the checker and interpreter can be improved.

Inspiration

https://blog.trailofbits.com/2021/03/23/a-year-in-the-life-of-a-compiler-fuzzing-campaign/

bluesign · 2021-06-04T09:17:28Z

Also native fuzzing for golang[0] can be useful I guess

[0] https://blog.golang.org/fuzz-beta

fxamacker · 2021-06-04T12:57:32Z

Maybe we can use https://github.com/thepudds/fzgo until Go's proposed fuzzer is released. It's basically a prototype of Go's proposed fuzzer but it's already being used by some non-hobby projects. I might use it to stress test storage PoC.

I'd love to have custom mutators as an option in go-fuzz. 😆 I'd also love for go-fuzz to display cover as a percent during fuzzing.

A slightly customized version of dvyukov/go-fuzz@fca3906 was used to test fxamacker/cbor v2.3.0 release last week and I wish go-fuzz had the missing features.

Curated Lists of Fuzzing Resources

turbolent · 2021-06-04T18:07:45Z

@bluesign wow, what are the odds! I had seen the GitHub issue tracking the development of that, and now they finally announced it just in time 🙂 We should definitely check that out!

@fxamacker Oh I hadn't seen this, thanks for sharing! Yes, we should also check that out and see how we can get good mutation for out use-case 😄 💯

robert-e-davidson3 · 2021-12-14T05:05:37Z

Discussed intent and scope with @turbolent and @j1010001. Here's the summary:

The goal of fuzzing is to quickly increase test coverage. Concretely this means fuzzing can run for a long time (say, a week) without finding any new bugs.

Our original fuzzing was quick and dirty: feed some bytes to the parser and fail on panics. The search space of this approach is so broad that it cannot run fast enough to provide much test coverage - almost every input it gives is syntactically invalid. This code lives in onflow/cadence and can be ignored.

We needed a grammar-based fuzzer to cut down the search space to just syntactically-valid inputs. The first idea was to write Cadence's grammar into the EBNF format so we could leverage existing tools. Unfortunately Cadence doesn't fit into EBNF cleanly enough to satisfy those tools.

Joe had a solution: modify the Cadence parser so that, at any point, it can give a list of valid tokens that can follow. This "parser-based fuzzer" is an implicit grammar-based fuzzer. That code lives in onflow/fuzzer as an old fork of onflow-cadence and was run on FuzzBuzz.io.

The next step is to evaluate the status of the onflow/fuzzer code and how it integrates with FuzzBuzz.io. Once that's established, rebase the latest Cadence code from onflow/cadence:master to onflow/fuzzer:master and make adjustments so the fuzzer still works.

That might be sufficient. If it isn't then we need to investigate alternatives such as go-fuzz or the built-in fuzzing support in Go 1.18. The approach here is to use mainnet contracts and transactions as the seed corpus. That will require maintenance because as the Cadence language is developed and so diverges from the deployed contracts.

If that isn't sufficient then we need to investigate writing and integrating our own mutation algorithm. Most likely this would merge both prior methods, mutating existing contracts and transactions randomly but based on the existing contracts.

j1010001 · 2021-12-14T19:57:04Z

Hey team! Please add your planning poker estimate with ZenHub @dsainati1 @robert-e-davidson3 @SupunS @turbolent

robert-e-davidson3 · 2021-12-15T01:27:13Z

Deprecating this ticket in favor of epic #1309.

turbolent added Feature Feedback labels Jun 3, 2021

turbolent assigned turbolent and unassigned turbolent Jun 3, 2021

turbolent added Improvement and removed Feature Feedback labels Jun 3, 2021

turbolent added the Epic label Jul 12, 2021

j1010001 assigned robert-e-davidson3 Dec 7, 2021

robert-e-davidson3 mentioned this issue Dec 15, 2021

Fuzzing Testing #1309

Closed

robert-e-davidson3 closed this as completed Dec 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate deeper fuzzing #975

Investigate deeper fuzzing #975

turbolent commented Jun 3, 2021 •

edited

Loading

bluesign commented Jun 4, 2021

fxamacker commented Jun 4, 2021

turbolent commented Jun 4, 2021

robert-e-davidson3 commented Dec 14, 2021

j1010001 commented Dec 14, 2021

robert-e-davidson3 commented Dec 15, 2021

Investigate deeper fuzzing #975

Investigate deeper fuzzing #975

Comments

turbolent commented Jun 3, 2021 • edited Loading

Issue To Be Solved

Suggested Solution

Stages

Inspiration

bluesign commented Jun 4, 2021

fxamacker commented Jun 4, 2021

Curated Lists of Fuzzing Resources

turbolent commented Jun 4, 2021

robert-e-davidson3 commented Dec 14, 2021

j1010001 commented Dec 14, 2021

robert-e-davidson3 commented Dec 15, 2021

turbolent commented Jun 3, 2021 •

edited

Loading