adr: draft ADR for stateful precompiled contracts #1131

yihuang · 2022-06-16T09:59:25Z

ref: #1116

Description

Example implementation: https://github.com/yihuang/ethermint/tree/precompiled

IBC example:

For contributor use:

Targeted PR against correct branch (see CONTRIBUTING.md)
Linked to Github issue with discussion and accepted design OR link to spec that describes this work.
Code follows the module structure standards.
Wrote unit and integration tests
Updated relevant documentation (docs/) or specification (x/<module>/spec/)
Added relevant godoc comments.
Added a relevant changelog entry to the Unreleased section in CHANGELOG.md
Re-reviewed Files changed in the Github PR explorer

For admin use:

Added appropriate labels to PR (ex. WIP, R4R, docs, etc)
Reviewers assigned
Squashed all commits, uses message "Merge pull request #XYZ: [title]" (coding standards)

ref: evmos#1116

docs/architecture/adr-003-stateful-precompiles.md

facs95 · 2022-09-01T21:24:13Z

This is awesome! But collides a bit with the designed proposed in #1272 with a more modular proposition. We should try to decide towards one model and work together towards that model. If we agree the other proposal is the way to go we could either close this or adapt it to meet the modular design. Open for discussion 🙏

yihuang · 2022-09-02T01:12:37Z

This is awesome! But collides a bit with the designed proposed in #1272 with a more modular proposition. We should try to decide towards one model and work together towards that model. If we agree the other proposal is the way to go we could either close this or adapt it to meet the modular design. Open for discussion 🙏

I think the difference is mainly stateful vs stateless, we can rework this one after the stateless part is done.

loredanacirstea · 2022-09-15T08:17:40Z

This draft ADR is similar to the work I did back in February 2022:

go-ethereum

loredanacirstea/go-ethereum@abd62c3

Changes for making precompiles dynamic:

add a func getPrecompiles(chainRules params.Rules) map[common.Address]PrecompiledContract function, used to instantiate the precompiles in NewEVM

Changes for making precompiles stateful

adding evm *EVM, caller ContractRef information when running a precompile - see RunPrecompiledContract

Changes 1 and 2 were mostly isolated in loredanacirstea/go-ethereum@abd62c3.

Other changes for advanced precompiles that are original work

CallWithState, RunWithInitialState - for the interpreter precompile
giving access to the internal chain state, for introspection precompile

ethermint

Then, ethermint could just use the NewEVM constructor with custom precompiles loredanacirstea@5a8c3a7

But I think @fedekunze's approach is superior because it does not require go-ethereum changes. So, we don't need to expend effort in supporting a geth fork or in convincing the geth team to change their source.

yihuang · 2022-09-15T08:21:25Z

This draft ADR is similar to the work I did back in February 2022:

Yeah, It's directly inspired by your work, the main development is to make stateful precompiles work with statedb snapshot and revert.

loredanacirstea · 2022-09-15T08:30:14Z

@yihuang We should define what stateless and stateful precompiles are. Because I see several categories:

Stateless - as in they do not need to keep any precompile-specific state, nor do they need other state than what they receive as direct input. The equivalent of pure functions in Solidity.
State-viewing - precompiles do not have their own state, but they use internal state (e.g. introspection precompile, where you can get info about transactions/blocks/events). E.g. view functions in Solidity
Stateful - precompiles that need to keep their own internal state (even if temporary)
State-changing - precompiles that change the global state (EVM sstore, etc.). E.g. mutable/non-payable/payable functions in Solidity
[update] transaction initiator - precompiles that initiate transactions (e.g. EVM transactions through the account abstraction precompile that I developed)

yihuang · 2022-09-15T08:36:25Z

@yihuang We should define what stateless and stateful precompiles are. Because I see several categories:

Stateless - as in they do not need to keep any precompile-specific state, nor do they need other state than what they receive as direct input. The equivalent of pure functions in Solidity.

State-viewing - precompiles do not have their own state, but they use internal state (e.g. introspection precompile, where you can get info about transactions/blocks/events). E.g. view functions in Solidity

Stateful - precompiles that need to keep their own internal state (even if temporary)

State-changing - precompiles that change the global state (EVM sstore, etc.). E.g. mutable/non-payable/payable functions in Solidity

Hmm, what I mean here is the contract modifies the global state, specifically, modifies the cosmos-sdk states, like bank transfers, or sending ibc packets.

itsdevbear · 2022-09-22T16:18:59Z

@yihuang @loredanacirstea. All of the above makes sense, I've been doing some research into getting state changing precompile working and ultimately am realizing that the StateDB is the problem.

IMO we need to find a way to make it so that an EVM txn reverting reverts the Cosmos txn as well. The design decision to make a reverting EVM transaction a successful cosmos transaction is a huge antipattern in my opinion, and is the root of the StateDB commit() revert() issue.

yihuang · 2022-09-22T16:22:09Z

@yihuang @loredanacirstea. All of the above makes sense, I've been doing some research into getting state changing precompile working and ultimately am realizing that the StateDB is the problem.

IMO we need to find a way to make it so that an EVM txn reverting reverts the Cosmos txn as well. The design decision to make a reverting EVM transaction a successful cosmos transaction is a huge antipattern in my opinion, and is the root of the StateDB commit() revert() issue.

This ADR proposed a solution to this problem, with examples.

itsdevbear · 2022-09-22T16:53:12Z

@yihuang correct. But the issue is that we can't call arbitrary module logic, in the way that we can pass CosmosMsg's from CosmWasm VM to the module directly. Going this route, would required duplicating logic that is possibly already defined in it's keeper.

yihuang · 2022-09-22T17:01:51Z

@yihuang correct. But the issue is that we can't call arbitrary module logic, in the way that we can pass CosmosMsg's from CosmWasm VM to the module directly. Going this route, would required duplicating logic that is possibly already defined in it's keeper.

yes, we have to carefully manage temporary states in memory.
cosmos-sdk do have a CacheContext, which I don't think it can play well with StateDB, and deeply nested CacheContext perform very bad.

itsdevbear · 2022-09-22T17:03:13Z

Seems like bad developer UX, there has to be a way to get it to work like how CosmWasm works. That should be the end goal for Stateful Precompiles IMO.

itsdevbear · 2022-09-22T17:03:45Z

Having an EthTx revert, revert the whole Cosmos Txn would resolve this issue, no?

yihuang · 2022-09-22T17:05:32Z

Seems like bad developer UX, there has to be a way to get it to work like how CosmWasm works. That should be the end goal for Stateful Precompiles IMO.

I think CosmWasm call native functionalities by passing async messages, similar to how we do the log hooks currently.

yihuang · 2022-09-22T17:06:26Z

Having an EthTx revert, revert the whole Cosmos Txn would resolve this issue, no?

The exception revert can happen in nested way.

itsdevbear · 2022-09-22T17:09:23Z

Needless to say, I think the approach should be attempt to get synchronous ability to call Cosmos SDK modules from the evm. Having to duplicate all the logic seems like a massive way to introduce issues.

dreamer-zq · 2022-10-31T02:21:00Z

@yihuang @dreamer-zq I've proposed an approach to use CacheContext and still handle nested reverts. Essentially, by implementing the cache contexts in the statedb journals, cache contexts can follow the snapshot/revert logic properly, even in nested contract calls/reverts! Please take a look at this doc, which provides further explanation on this approach.

A first attempt Proof of Concept implementation can be found in this PR, notable files include: x/evm/statedb/interfaces.go, x/evm/statedb/statedb.go, x/evm/vm/geth/geth.go, and x/evm/types/precompiles.go. Note: this PR works on the latest version of geth without modifications, but requires shadowing many methods.

@calbera I have two questions,

evm When executing opcode, the injected evm object in EVMInterpreter is still go-etherum Implementation, then how to jump to the call method implemented in ethermint? My understanding is that only by injecting the evm implemented in ethermint into the EVMInterpreter will the execution logic of the opcode be overwritten. I may have misunderstood the execution chain of EVMInterpreter
statedb When executing the commit method, why only the last ExtJournalEntry is committed? Why does the state generated in the middle do not need to manually execute commit?

itsdevbear · 2022-10-31T13:10:59Z

@yihuang personally I'm not against a very mild fork with an extremely small diff. Though I know @fedekunze is quite against.

Why would we need nonce?

An invariant that is checked post call to ensure nonce hasn't been altered by a native call seems reasonable.

calbera · 2022-10-31T14:58:45Z

@yihuang @dreamer-zq I've proposed an approach to use CacheContext and still handle nested reverts. Essentially, by implementing the cache contexts in the statedb journals, cache contexts can follow the snapshot/revert logic properly, even in nested contract calls/reverts! Please take a look at this doc, which provides further explanation on this approach.
A first attempt Proof of Concept implementation can be found in this PR, notable files include: x/evm/statedb/interfaces.go, x/evm/statedb/statedb.go, x/evm/vm/geth/geth.go, and x/evm/types/precompiles.go. Note: this PR works on the latest version of geth without modifications, but requires shadowing many methods.

@calbera I have two questions,

evm When executing opcode, the injected evm object in EVMInterpreter is still go-etherum Implementation, then how to jump to the call method implemented in ethermint? My understanding is that only by injecting the evm implemented in ethermint into the EVMInterpreter will the execution logic of the opcode be overwritten. I may have misunderstood the execution chain of EVMInterpreter

statedb When executing the commit method, why only the last ExtJournalEntry is committed? Why does the state generated in the middle do not need to manually execute commit?

That's right. The EVMInterpreter is currently go-eth so it will not correctly call the modified version of Call. This will require significant amounts of shadowing, which includes rewriting the opcode functions (opCall, opCallCode, etc.) and the jump table to point to these new functions.
Essentially, each ExtJournalEntry holds a cache context. A subsequent ExtJournalEntry will add state by simply appending to the previous cache context. So in effect each cache context holds all state changes before it and the current state changes, and therefore only the last cache context needs to be committed.

yihuang · 2022-11-01T01:46:05Z

Essentially, each ExtJournalEntry holds a cache context. A subsequent ExtJournalEntry will add state by simply appending to the previous cache context. So in effect each cache context holds all state changes before it and the current state changes, and therefore only the last cache context needs to be committed.

But you'll need to hold the whole stack of cache contexts to be able to commit all the way back to the top level, commit once only commit to the upper layer.

dreamer-zq · 2022-11-01T01:46:28Z

@yihuang @dreamer-zq I've proposed an approach to use CacheContext and still handle nested reverts. Essentially, by implementing the cache contexts in the statedb journals, cache contexts can follow the snapshot/revert logic properly, even in nested contract calls/reverts! Please take a look at this doc, which provides further explanation on this approach.
A first attempt Proof of Concept implementation can be found in this PR, notable files include: x/evm/statedb/interfaces.go, x/evm/statedb/statedb.go, x/evm/vm/geth/geth.go, and x/evm/types/precompiles.go. Note: this PR works on the latest version of geth without modifications, but requires shadowing many methods.

@calbera I have two questions,

evm When executing opcode, the injected evm object in EVMInterpreter is still go-etherum Implementation, then how to jump to the call method implemented in ethermint? My understanding is that only by injecting the evm implemented in ethermint into the EVMInterpreter will the execution logic of the opcode be overwritten. I may have misunderstood the execution chain of EVMInterpreter

statedb When executing the commit method, why only the last ExtJournalEntry is committed? Why does the state generated in the middle do not need to manually execute commit?

That's right. The EVMInterpreter is currently go-eth so it will not correctly call the modified version of Call. This will require significant amounts of shadowing, which includes rewriting the opcode functions (opCall, opCallCode, etc.) and the jump table to point to these new functions.

Essentially, each ExtJournalEntry holds a cache context. A subsequent ExtJournalEntry will add state by simply appending to the previous cache context. So in effect each cache context holds all state changes before it and the current state changes, and therefore only the last cache context needs to be committed.

Thank you for your reply

loredanacirstea · 2022-11-05T22:00:08Z

I added Cosmos sdk.Context and vm.EVM as context to precompiles: #1433
And if this PR ethereum/go-ethereum#26119 is merged, it will also provide current call/subcall context.

itsdevbear · 2022-11-06T00:22:17Z

@loredanacirstea very similar to the arch @calbera and I are starting to come to, this is awsome.

The geth PR is sweet too, one issue we faced when getting this going without a fork was that we ended up having to pull much the evm out into ethermint to shadow.

loredanacirstea · 2022-11-06T09:51:08Z

@itsdevbear
Yes, I know. This is based on work I did back in February and I returned to simplify it.
Would be good to thumbs up the go-ethereum PR.

yihuang · 2022-11-16T03:32:57Z

cosmos/cosmos-sdk#13881

If we fix the slowness of nested cache context, we might reconsider the nested cache context solution, which would be perfect for precompiles.

dreamer-zq · 2022-12-01T03:12:21Z

@loredanacirstea I was thinking that if we customize an opCode, then define the evm in the interpreter as an interface, and then suggest that the etherum team export the interpreter as a public variable, so that the modification may be less and easier for them to accept? Of course, this is my simple idea, there may be something I haven't considered

loredanacirstea · 2022-12-10T11:01:31Z

I have a full working quasar precompile (cosmos sdk messages & queries from inside the EVM). This means stateful precompiles, solved call context nesting, efficient state sync between EVM StateDB & Cosmos context, gas metering of the mixed context! It is publicly verifiable on the Mythos chain. And demoed here, with more details: https://youtu.be/COu5Olszhtg. If you want to see the example contract used for testing, see min 7:09.

yihuang · 2022-12-10T13:22:50Z

I have a full working quasar precompile (cosmos sdk messages & queries from inside the EVM). This means stateful precompiles, solved call context nesting, efficient state sync between EVM StateDB & Cosmos context, gas metering of the mixed context! It is publicly verifiable on the Mythos chain. And demoed here, with more details: https://youtu.be/COu5Olszhtg. If you want to see the example contract used for testing, see min 7:09.

I'm curious how you do the context nesting and statedb thing.

loredanacirstea · 2022-12-10T19:02:29Z

I'm curious how you do the context nesting and statedb thing.

Of course.
But I hope you know that my effort until now has been volunteer effort (under The Laurel Project, unpaid). Recent events have forced me to change my approach. I will be continuing my innovations on Mythos and if any chain is interested to also have them, I invite them there (link to Discord in the video description).

itsdevbear · 2022-12-10T20:08:04Z

Sad to see Ethermint going down this path, but @loredanacirstea I totally understand your rationale, given the state of the project. We recently had to go this route as well and it totally sucks. If you ever want to collaborate on this topic please reach out to either myself or @calbera.

Godspeed 🙏

fedekunze · 2022-12-11T13:08:30Z

Sad to see Ethermint going down this path, but @loredanacirstea I totally understand your rationale, given the state of the project. We recently had to go this route as well and it totally sucks.

@itsdevbear care to elaborate on why you wrote this comment?

yihuang · 2022-12-16T03:09:28Z

I still only see one way to support precompiles with CacheContext, which is revert back to the context stack solution before the StateDB refactoring, that's the only way the state observed by precompile and evm contract are consistent.
With recent merge of PR cosmos/cosmos-sdk#13881, I think we should give it a try, WDYT @fedekunze ? Some degree of performance regression is expected, but hopefully not too much.

loredanacirstea · 2022-12-17T13:04:51Z

@fedekunze please clarify if this is the official stance of Tharsis, as cited in my comment here https://commonwealth.im/evmos/discussion/8496-bring-quasar-to-evmos-execute-cosmos-transactions-queries-from-the-evm-from-any-smart-contract?comment=41558

As the Evmos Core Development team, we won’t be using the Quasar codebase that you have ended up writing. The Evmos Core Development team will be proposing its own solution for this.

github-actions · 2023-02-01T00:23:27Z

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days-before-close if no further activity occurs.

yihuang · 2023-02-01T01:32:48Z

This PR is not relevant now, @mmsqe is taking a different approach on this issue, based on the cache store refactoring here: cosmos/cosmos-sdk#14444, we can expect an implementation PR directly soon.

0xadu · 2023-03-01T07:18:14Z

Ohh @yihuang I see what you mean, specifically the evm_denom balances, this should be easily solvable as well thankfully!

evm_denom balances, account nonces, I don't know how to do that, would be exciting to know how you guys did it. If you wrote those into ctx, it basically go back to the approach before journal logs, we'll have one nested cache layer for each message call. If we can modify go-ethereum to call statedb on message call return, we can flatten the cache layer there.

@yihuang thanks for your insightful discussions.
Does this mean that we need to access the dirty data that's not committed when we make a .call?

yihuang · 2023-03-01T07:35:33Z

@0xadu

the ideas in this doc are out dated, you should checkout the new implementation here: #1650

0xadu · 2023-03-01T07:37:35Z

@0xadu

the ideas in this doc is out dated, you can checkout the new implementation here: #1650

thnaks, I see.
I'm just curious why the original journal-based statedb approach does not work.

yihuang · 2023-03-01T07:40:22Z

@0xadu
the ideas in this doc is out dated, you can checkout the new implementation here: #1650

thnaks, I see. I'm just curious why the original journal-based statedb approach does not work.

do you mean wrap existing cache store into a journal entry? the performance is very bad with deeply nested cache store.

0xadu · 2023-03-01T07:47:14Z

I mean AFAIK the statedb has been refactored to replace nested cachestore with journals, but you seem to imply that we need to go back to the original cachestore approach to support stateful precompiled contracts. I'm not clear about the motivation. Correct me if I'm wrong.

yihuang · 2023-03-01T07:48:38Z

I mean AFAIK the statedb has been refactored to replace nested cachestore with journals, but you seem to imply that we need to go back to the original cachestore approach to support stateful precompiled contracts. I'm not clear about the motivation. Correct me if I'm wrong.

because native code don't have access to the dirty cache in statedb, for example the current balance of evm denom.

yihuang requested review from fedekunze, khoslaventures and jolube as code owners June 16, 2022 09:59

yihuang marked this pull request as draft June 16, 2022 10:09

yihuang added 3 commits July 6, 2022 11:16

draft ADR for stateful precompiled contracts

a894263

ref: evmos#1116

fixes

5439680

add example

57b6ad3

yihuang force-pushed the adr-precompiled branch from 9907a5a to 57b6ad3 Compare July 6, 2022 03:28

yihuang marked this pull request as ready for review July 6, 2022 03:28

Merge branch 'main' into adr-precompiled

bd70ce5

yihuang requested review from facs95 and danburck as code owners August 4, 2022 01:29

yihuang commented Sep 1, 2022

View reviewed changes

docs/architecture/adr-003-stateful-precompiles.md Show resolved Hide resolved

Update docs/architecture/adr-003-stateful-precompiles.md

84abc60

loredanacirstea mentioned this pull request Sep 15, 2022

imp(evm): stateless custom precompiles #1272

Merged

11 tasks

This was referenced Nov 5, 2022

core/vm: add call context (for access in precompiles) ethereum/go-ethereum#26119

Closed

Add sdk.Context and vm.EVM to precompiles #1433

Closed

github-actions bot added the Status: Stale label Feb 1, 2023

mmsqe mentioned this pull request Feb 1, 2023

feat: support stateful precompiled contracts #1650

Closed

11 tasks

yihuang closed this Feb 1, 2023

adr: draft ADR for stateful precompiled contracts #1131

adr: draft ADR for stateful precompiled contracts #1131

Conversation

yihuang commented Jun 16, 2022 • edited Loading

Description

facs95 commented Sep 1, 2022

yihuang commented Sep 2, 2022

loredanacirstea commented Sep 15, 2022

go-ethereum

ethermint

yihuang commented Sep 15, 2022 • edited Loading

loredanacirstea commented Sep 15, 2022 • edited Loading

yihuang commented Sep 15, 2022 • edited Loading

itsdevbear commented Sep 22, 2022

yihuang commented Sep 22, 2022 • edited Loading

itsdevbear commented Sep 22, 2022

yihuang commented Sep 22, 2022 • edited Loading

itsdevbear commented Sep 22, 2022

itsdevbear commented Sep 22, 2022

yihuang commented Sep 22, 2022

yihuang commented Sep 22, 2022 • edited Loading

itsdevbear commented Sep 22, 2022

dreamer-zq commented Oct 31, 2022 • edited Loading

itsdevbear commented Oct 31, 2022

calbera commented Oct 31, 2022

yihuang commented Nov 1, 2022 • edited Loading

dreamer-zq commented Nov 1, 2022

loredanacirstea commented Nov 5, 2022

itsdevbear commented Nov 6, 2022 • edited Loading

loredanacirstea commented Nov 6, 2022 • edited Loading

yihuang commented Nov 16, 2022 • edited Loading

dreamer-zq commented Dec 1, 2022

loredanacirstea commented Dec 10, 2022

yihuang commented Dec 10, 2022

loredanacirstea commented Dec 10, 2022

itsdevbear commented Dec 10, 2022

fedekunze commented Dec 11, 2022

yihuang commented Dec 16, 2022 • edited Loading

loredanacirstea commented Dec 17, 2022

github-actions bot commented Feb 1, 2023

yihuang commented Feb 1, 2023 • edited Loading

0xadu commented Mar 1, 2023

yihuang commented Mar 1, 2023 • edited Loading

0xadu commented Mar 1, 2023

yihuang commented Mar 1, 2023

0xadu commented Mar 1, 2023

yihuang commented Mar 1, 2023

yihuang commented Jun 16, 2022 •

edited

Loading

yihuang commented Sep 15, 2022 •

edited

Loading

loredanacirstea commented Sep 15, 2022 •

edited

Loading

yihuang commented Sep 15, 2022 •

edited

Loading

yihuang commented Sep 22, 2022 •

edited

Loading

yihuang commented Sep 22, 2022 •

edited

Loading

yihuang commented Sep 22, 2022 •

edited

Loading

dreamer-zq commented Oct 31, 2022 •

edited

Loading

yihuang commented Nov 1, 2022 •

edited

Loading

itsdevbear commented Nov 6, 2022 •

edited

Loading

loredanacirstea commented Nov 6, 2022 •

edited

Loading

yihuang commented Nov 16, 2022 •

edited

Loading

yihuang commented Dec 16, 2022 •

edited

Loading

yihuang commented Feb 1, 2023 •

edited

Loading

yihuang commented Mar 1, 2023 •

edited

Loading