feat(forge): isolated execution #7186

klkvr · 2024-02-20T10:52:26Z

Motivation

aka authentic execution described in #6910

The idea is to execute all test/script-level calls as separate transactions initialized with empty journaled state and triggering all pre/post transaction actions and clean-ups, such as: including calldata, base 21k gas cost, clearing transient storage, correctly processing selfdestructs, etc

Solution

All calls of depth 1 are getting caught in InspectorStack and executed as following:

Commit all changes from journal to db.
Create a separate EVMImpl with TxEnv for the current call/create context.
Transact
Collect state changeset
Merge changeset to the existing journal

When we are transacting the inner EVM, the call depth decreases from 1 to 0, and because of that, I've added logic to adjust journaled_state.depth which inspectors receive if we are in inner EVM context.

revm will also call InspectorStack hooks for the second time for the call that's being delegated (first time in main context with depth 1, second time for inner context with depth 0), but we want it to be processed only in the main context.

Open questions

I've been testing it on several codebases with a lot of various tests and it seems to work, however, there are some issues with that approach (and ci will probably show more):

Performance. Earlier we could have ran some tests without ever requiring mutable access to db, because everything was kept in the journal. Right now, we have to commit changes -> clone and it becomes more expensive. I've observed 40-50% test execution time increase in some cases.
Pranks. Currently, by using prank cheatcodes it's possible to craft test-level contract interaction where tx.origin != msg.sender. It's not possible when the call is a top-level transaction. Some of foundry and forge-std tests breake because of that
Gas reports. Right now we collect gas usage data for function calls on all depths. With this impl, test-level calls will cost more because of 21k+other additional costs, thus we should probably only use gas cost of those calls.
Commiting. Right now my impl always commits changes in-between calls and it seems to be working fine. But I am wondering if we have any logic relying on assumption that InspectorStack does not commit?

Philogy · 2024-02-21T03:49:42Z

This solution seems overly complicated, why clone and merge states? Wouldn't it be simpler to maintain two separate VMs: the test runner & the actual environment. Then you map calls from the test environment as transactions in the actual and vice versa map results/receipts as return values.

klkvr · 2024-02-21T13:50:15Z

@Philogy we'd have to clone VMs if we want to commit between calls

I agree that approach with test EVM + N other EVMs might feel more intuitive, but I don't think that it's less complicated

for example, if we wouldn't merge states, we'd have to figure out workarounds for correct processing of BALANCE, DELEGATECALL, EXTCODESIZE, etc opcodes on the level of test env as changes to actual EVM won't appear in its journal

atm this PR is a draft I'm experimenting with to find out all implicit assumtions we and users have about foundry EVM behavior. once it's figured out ths impl will probably change a lot and might become closer to the approach you've mentioned

roninjin10 · 2024-02-21T16:38:58Z

Thanks for kicking off this work klkvr. Commenting so I am subscribed

klkvr · 2024-02-21T23:01:21Z

testdata/repros/Issue3653.t.sol

@@ -11,13 +11,13 @@ contract Issue3653Test is DSTest {
    Token token;

    constructor() {
-        fork = vm.createSelectFork("rpcAlias", 10);
+        fork = vm.createSelectFork("rpcAlias", 1000000);


block 10 had gas limit of 5000 which is not enough to deploy a contract in isolated mode

klkvr · 2024-02-22T10:04:47Z

So right now CI passess except for 1 test from sablier v2 which I believe has incorrect assumptions about the foundry EVM. On latest nightly that call reverts with MemoryOOG which I believe is unrealted to gas limit, and with isolated execution call executes without halts and reaches revert in contract with custom error type, causing revert reason mismatch in expectRevert

The current impl has several workarounds to make its behavior closer to how non-isolated EVM works now. I believe that it was important to create a PoC showing that we can switch to isolated mode without breaking changes, however, not sure if all semantics of non-isolated mode should be kept:

Nonces. When executing CALL as a separate transaction, it increases sender nonce, which does not happen when CALL is performed in scope of another transaction. This was breaking some external tests hardcoding addresses.
tx.origin. Right now tx.origin is patched to match the tx.origin which we had before starting isolated call. This is needed for tx.origin pranks to work correctly and for some external tests with hardcoded tx.origin assertions (optimism has some)
Gas limits. Not sure how we should treat tests using .transfer and .send or simply hardcoded .call{gas: XXX}() relying on default 21K cost not being present. Right now tx gas limit is being manually increased by 21K so that we have all our tests pass, but this does not feel as a decent fix

foundry/crates/evm/evm/src/inspectors/stack.rs

Line 440 in 9f10bae

data.env.tx.gas_limit = std::cmp::min(gas_limit + 21000, data.env.block.gas_limit.to());

In general, the final impl of inspector call chain looks like this:

-> root call/create hook is called (depth 0)
-> inspectors are called
    -> test-level call/create hook is called (depth 1)
        -> inspectors are called
        -> journaled state is commited to db
        -> tx env is adjusted and transaction is initiated
            -> call/create hook is called for test-level call again, this time in scope of isolated tx (depth 0 is adjusted to 1)
                -> inspectors are not called, because we already called them for depth 1 outside of isolation
                -> tx.origin and sender_nonce are patched to match values outside of isolation
                    ... deeper calls are processed as usual, with only difference being depth adjustment
                -> call_end/create_end hook is called for test-level call
                     -> inspectors are not called, we simply skip this inspector invocation as we want to process it outside of isolation
         -> After transaction is completed, we merge new isolated journaled state into previous state and return tx result (Halt/Success/Revert) from the inspector's `call/create` hook
     -> test-level call_end/create_end hook is called (depth 1)
         -> Here we notify inspectors about the end of the call, at this point we already have updated gas data received from call/create hook which executed tx in isolation
       
```

klkvr · 2024-02-22T10:07:45Z

I will do more experiments with optimism codebase as there are some tests that are failing, and I am still not sure why.

After that, I plan to make this an opt-in, look into performance and gas metering

upd: seems like failing optimist tests are caused by hardcoded gas limits

klkvr · 2024-02-22T17:39:02Z

Performance

snekmate:
latest nightly:

this pr:

solady:
latest nightly:

this pr:

optimism (with failing tests excluded):
latest nightly:

this pr:

That way, we are looking at 20-30% execution time increase

Gas reports

I've made isolated execution an opt-in, currently it is enabled only when running tests via --gas-report. Also, we are now only filtering out traces on depth 1 when constructing gas report, to avoid deeper calls executed in a non-isolated manner affecting reports.

klkvr · 2024-02-23T12:43:30Z

I've added a couple tests for tstore/tload and selfdestructs which are only passing with isolation mode.

It's possible now to enable isolation by either using --isolate flag for forge test or forge script or via setting isolate = true parameter in config.

Isolation is enabled automatically when gas report is requested for tests via --gas-report

mattsse

only have a few pedantic nits,

imo this is still fairly readable

ptal @DaniPopes

crates/common/src/evm.rs

crates/evm/evm/src/inspectors/stack.rs

onbjerg

lgtm

gakonst · 2024-02-27T23:40:29Z

crates/cheatcodes/src/inspector.rs

+                            data.journaled_state.state().get_mut(&broadcast.new_origin).unwrap();
+
+                        account.info.nonce += 1;
+                    }


what is this change for?

currently we are increasing nonce before broadcasting CALL to simulate nonce increase during on-chain transaction

this is incorrect for--isolate because we need nonce to be up-to-date at the point when we are creating a transaction

so the change is to increase nonce after the CALL

however, thinking of it now, with new workaround when we explicitly decrease nonces in isolation, this is not really needed as long as we touch the account when pre-increase its nonce, updated in 33814e5

yeah can we also doc that

gakonst · 2024-02-27T23:54:36Z

crates/forge/src/gas_report.rs

+        // Only include top-level calls which accout for calldata and base (21.000) cost.
+        // Only include Calls and Creates as only these calls are isolated in inspector.
+        if trace.depth != 1 &&
+            (trace.kind == CallKind::Call ||
+                trace.kind == CallKind::Create ||
+                trace.kind == CallKind::Create2)
+        {
+            return;
+        }
+


gakonst · 2024-02-28T00:03:00Z

crates/evm/evm/src/inspectors/stack.rs

+    fn transact_inner<DB: DatabaseExt + DatabaseCommit>(
+        &mut self,
+        data: &mut EVMData<'_, DB>,
+        transact_to: TransactTo,
+        caller: Address,
+        input: Bytes,
+        gas_limit: u64,
+        value: U256,
+    ) -> (InstructionResult, Option<Address>, Gas, Bytes) {


would love if we unit tested this with a simple regression test, but can do in follow up

we don't really run any tests for isolation rn besides two I've added for selfdestruct/tstore, not sure how we can address this without running all tests for both modes

Should we add a flag to run these on CI perhaps every 24hrs on --isolate? Something like that?

yeah, that could work, we'd have to filter some tests out for such runs because we have some fixtures with hardcoded gas usage which doesn't match for --isolate and also that one failing sablier-v2 test

frontier159 · 2024-02-29T23:14:11Z

@klkvr When running tests with --gas-report, I think I've hit a 30mm gas limit in setUp(), which I wasn't before:

    ├─ [30000000] → new <unknown>@0x5615dEB798BB3E4dFa0139dFa1b3D433Cc23b72f
    │   └─ ← 0 bytes of code

There are a bunch of contracts being forked and setup

Adding forge test --gas-report --block-gas-limit 10000000000000000000 didn't help.

Any ideas?

klkvr · 2024-03-01T00:03:14Z

So this can be reproduced with the following test:

import "forge-std/Test.sol";

contract C is Test {}

contract GasWaster {
    function waste() public {
        for (uint256 i = 0; i < 100; i++) {
            new C();
        }
    }
}

contract GasLimitTest is Test {
    function test() public {
        vm.createSelectFork("mainnet");
        
        GasWaster waster = new GasWaster();
        waster.waste();
    }
}

What happening here is that we are setting block gas limit to the real gas limit of the forked chain:

foundry/crates/evm/core/src/fork/init.rs

Line 68 in 4a91072

gas_limit: block.header.gas_limit,

However, without isolation we are never really validating gas usage because such checks are performed in revm::EVMImpl::preverify_transaction_innner.

With isolation, we are capping transaction gas limit at block gas limit value to not hit CallerGasLimitMoreThanBlock:

foundry/crates/evm/evm/src/inspectors/stack.rs

Line 472 in 4a91072

    
           data.env.tx.gas_limit = std::cmp::min(gas_limit + 21000, data.env.block.gas_limit.to());

IMO, revert here is a correct behavior, that one failing sablier v2 test tries to test exactly this (tx exceeding block gas limit)

However, users might need to disable those checks because it's pretty easy to go beyond 30M with various utility testing contracts.

I can see 3 approaches for this:

Expose CfgEnv::disable_block_gas_limit flag to allow disabling block gas limit checks.
Set CfgEnv::disable_block_gas_limit to true by default to fully avoid any reverts due to block gas limit violations. Allow enabling those checks for tests which rely on that via cheatcode/config.
Always override block gas limit with --block-gas-limit value, if provided.

All approaches are pretty similar, it will be easier to decide once we figure out if we want this to become a breaking change or keep backwards compatibility

@mattsse wdyt?

frontier159 · 2024-03-01T00:10:07Z

Thanks for the simple reproduction. imo this is already a breaking change in master since isolated-execution is on by default with --gas-report (this broke some other tests in my repo where I wanted to explicitly check gaslimit() was under a threshold)

Evalir · 2024-03-01T00:15:36Z

fwiw @frontier159 foundry is not stable and breaking changes can happen—albeit we try hard not to rip the bandaid if necessary. But we do understand the frustration 😄

I think adding a flag to disable the block gas limit here might be the move @klkvr. I'm also for the opposite (disable it by default, enable with flag) if we want to "remove" the breaking change.

frontier159 · 2024-03-01T00:17:00Z

Allg - I enjoy living my namesake and on the Frontier. Appreciate the work you kings are doing here

* [wip] feat(forge): isolated execution * small fixes * don't panic on transaction error + fixture fix * stricter call scheme check * refactor and more fixes * wip * fix * wip * wip * rm cheatcodes check * clippy * update commit logic * opt-in * enable in gas reports * --isolate * isolation tests * smaller diff * fmt * simplify logic * docs * fmt * enable isolation properly for --gas-report * change nonce incrementing * document why we touch

[wip] feat(forge): isolated execution

f771edc

klkvr requested review from Evalir, mattsse and DaniPopes as code owners February 20, 2024 10:52

klkvr marked this pull request as draft February 20, 2024 10:52

klkvr added 4 commits February 20, 2024 15:09

small fixes

3bf3b3c

don't panic on transaction error + fixture fix

12ae30b

stricter call scheme check

2f94204

refactor and more fixes

759a721

klkvr added 3 commits February 21, 2024 18:32

wip

75899e6

Merge branch 'master' into klkvr/isolated-execution

b59b450

fix

2633134

klkvr added 2 commits February 21, 2024 23:59

wip

986cd10

wip

9f10bae

klkvr commented Feb 21, 2024

View reviewed changes

rm cheatcodes check

408fb7f

clippy

f8f5107

klkvr added 3 commits February 22, 2024 18:50

update commit logic

130b298

opt-in

934fb03

enable in gas reports

af052a5

klkvr added 3 commits February 23, 2024 14:06

--isolate

fb64f4f

isolation tests

2258409

Merge branch 'master' into klkvr/isolated-execution

129470c

klkvr changed the title ~~[wip] feat(forge): isolated execution~~ feat(forge): isolated execution Feb 23, 2024

klkvr added 2 commits February 23, 2024 16:57

smaller diff

87032a0

fmt

71a4d74

mattsse requested changes Feb 26, 2024

View reviewed changes

crates/common/src/evm.rs Show resolved Hide resolved

crates/evm/evm/src/inspectors/stack.rs Show resolved Hide resolved

crates/evm/evm/src/inspectors/stack.rs Show resolved Hide resolved

klkvr added 2 commits February 26, 2024 17:29

simplify logic

5c4a191

docs

313a57f

klkvr requested a review from mattsse February 26, 2024 15:03

fmt

1869209

onbjerg approved these changes Feb 27, 2024

View reviewed changes

enable isolation properly for --gas-report

18acadd

gakonst approved these changes Feb 28, 2024

View reviewed changes

klkvr added 2 commits February 28, 2024 04:51

change nonce incrementing

33814e5

document why we touch

96b9688

gakonst merged commit 551bcb5 into foundry-rs:master Feb 28, 2024
19 checks passed

holic mentioned this pull request Feb 28, 2024

feat(gas-report): run gas report with --isolate latticexyz/mud#2331

Merged

frontier159 mentioned this pull request Feb 28, 2024

fix: forge gas tests after upstream update to accuracy TempleDAO/temple#975

Merged

klkvr mentioned this pull request Mar 1, 2024

feat(forge): --disable-block-gas-limit flag #7287

Merged

alex0207s mentioned this pull request Mar 11, 2024

fix: activate isolate mode in toml consenlabs/tokenlon-contracts#305

Merged

cruzdanilo mentioned this pull request Apr 5, 2024

Can the team create a cheat code to test sending multiple transactions from an EOA in a single test function? #7571

Closed

This was referenced Apr 26, 2024

Reset to upstream phylaxsystems/phoundry#12

Merged

feat/forge bin as lib phylaxsystems/phoundry#13

Merged

This was referenced Jun 26, 2024

Gas cost discrepancies when testing calls through a proxy #2503

Closed

feat: vm.finalize #2844

Closed

JoseMiguelHerrera mentioned this pull request Jun 20, 2024

docs: document best practices of measuring gas when using transient storage foundry-rs/book#1289

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(forge): isolated execution #7186

feat(forge): isolated execution #7186

klkvr commented Feb 20, 2024 •

edited

Loading

Philogy commented Feb 21, 2024

klkvr commented Feb 21, 2024

roninjin10 commented Feb 21, 2024

klkvr Feb 21, 2024 •

edited

Loading

klkvr commented Feb 22, 2024 •

edited

Loading

klkvr commented Feb 22, 2024 •

edited

Loading

klkvr commented Feb 22, 2024

klkvr commented Feb 23, 2024 •

edited

Loading

mattsse left a comment

onbjerg left a comment

gakonst Feb 27, 2024

klkvr Feb 28, 2024

gakonst Feb 28, 2024

gakonst Feb 28, 2024

gakonst Feb 27, 2024

gakonst Feb 28, 2024

klkvr Feb 28, 2024

gakonst Feb 28, 2024

klkvr Feb 28, 2024

frontier159 commented Feb 29, 2024

klkvr commented Mar 1, 2024

frontier159 commented Mar 1, 2024

Evalir commented Mar 1, 2024 •

edited

Loading

frontier159 commented Mar 1, 2024

feat(forge): isolated execution #7186

feat(forge): isolated execution #7186

Conversation

klkvr commented Feb 20, 2024 • edited Loading

Motivation

Solution

Open questions

Philogy commented Feb 21, 2024

klkvr commented Feb 21, 2024

roninjin10 commented Feb 21, 2024

klkvr Feb 21, 2024 • edited Loading

Choose a reason for hiding this comment

klkvr commented Feb 22, 2024 • edited Loading

klkvr commented Feb 22, 2024 • edited Loading

klkvr commented Feb 22, 2024

Performance

Gas reports

klkvr commented Feb 23, 2024 • edited Loading

mattsse left a comment

Choose a reason for hiding this comment

onbjerg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

frontier159 commented Feb 29, 2024

klkvr commented Mar 1, 2024

frontier159 commented Mar 1, 2024

Evalir commented Mar 1, 2024 • edited Loading

frontier159 commented Mar 1, 2024

klkvr commented Feb 20, 2024 •

edited

Loading

klkvr Feb 21, 2024 •

edited

Loading

klkvr commented Feb 22, 2024 •

edited

Loading

klkvr commented Feb 22, 2024 •

edited

Loading

klkvr commented Feb 23, 2024 •

edited

Loading

Evalir commented Mar 1, 2024 •

edited

Loading