Add better validation around matching response with request information #3140

fselmo · 2023-11-02T20:47:48Z

What was wrong?

Add better validation that the cached request information really matches the response we are trying to process. This is done by passing the actual response_id for the matching request once we go back through the middleware onion and the response is present. This area of the code hadn't been iterated on since some of the earlier commits in the building out of the provider. This is a very important and necessary improvement on id matching.

This resolves an issue reported on Discord where await asyncio.gather(one_request, second_request) was guessing the wrong request id for each request since they are basically being triggered / yielding to each other in a shared context.

Steps to reproduce

async def reproducible_example():
    async with AsyncWeb3.persistent_websocket(WebsocketProviderV2(PROVIDER_URI)) as w3:
        (
            latest,
            chain_id,
            block_num,
            pending,
            chain_id2,
            chain_id3,
            finalized,
            balance,
        ) = await asyncio.gather(
            w3.eth.get_block("latest"),
            w3.eth.chain_id,
            w3.eth.block_number,
            w3.eth.get_block("pending"),
            w3.eth.chain_id,
            w3.eth.chain_id,
            w3.eth.get_block("finalized"),
            w3.eth.get_balance("shaq.eth")
        )
        print(f"block num from latest block: {latest['number']}\n")
        print(f"chain_id: {chain_id}\n")
        print(f"block_num: {block_num}\n")
        print(f"block num from pending block: {pending['number']}\n")
        print(f"chain_id2: {chain_id2}\n")
        print(f"chain_id3: {chain_id3}\n")
        print(f"block num from finalized block: {finalized['number']}\n")
        print(f"balance: {balance}\n")

asyncio.run(reproducible_example())

Todo:

Add entry to the release notes

Cute Animal Picture

- Add better validation that the cached request information really matches the response we are trying to process. This is done by passing the response id to get the cached request information for the matching id. This resolves an issue reported on Discord where ``await asyncio.gather(one_request, second_request)`` was guessing the wrong request id for each request since they are basically being triggered / yielding to each other in a shared context.

- asyncio.gather() will run all tasks, yielding to each other. This makes for a good test case to test that each request information that is cached is matched appropriately to the response that is received and its id.

reedsa · 2023-11-02T22:39:20Z

web3/providers/websocket/request_processor.py

-        request_id = next(copy(self._provider.request_counter)) - 1
-        cache_key = generate_cache_key(request_id)
-        current_request_cached_info: RequestInformation = (
+        cache_key = generate_cache_key(response_id)


In the examples we were walking through earlier, I think there was a sticky point that some responses didnt appear to include an id. If that happens, generate_cache_key will raise. Should we swallow that exception and just skip the processors here instead?

That's a good point. We could still return the raw response and try to not halt things here. I think passing in the response to this method would allow us to retrieve the id if it's there or debug log the response that doesn't have an id and keep moving.

Yeah that sounds like it would work great!

reedsa

Lgtm

fselmo force-pushed the search-radius branch from f26bef8 to fb2bfd3 Compare November 2, 2023 21:04

fselmo added a commit to fselmo/web3.py that referenced this pull request Nov 2, 2023

add relevant newsfragment for ethereum#3140

89728cb

fselmo marked this pull request as ready for review November 2, 2023 22:00

fselmo added a commit to fselmo/web3.py that referenced this pull request Nov 2, 2023

add relevant newsfragment for ethereum#3140

3dbe951

fselmo force-pushed the search-radius branch from 89728cb to 3dbe951 Compare November 2, 2023 22:08

fselmo requested review from pacrob and reedsa November 2, 2023 22:11

fselmo added 2 commits November 2, 2023 16:24

Add a test for 'concurrent' async middleware response processing

f3083d6

- asyncio.gather() will run all tasks, yielding to each other. This makes for a good test case to test that each request information that is cached is matched appropriately to the response that is received and its id.

add relevant newsfragment for ethereum#3140

ccfdef2

fselmo force-pushed the search-radius branch from 3dbe951 to ccfdef2 Compare November 2, 2023 22:24

reedsa reviewed Nov 2, 2023

View reviewed changes

fselmo added a commit to fselmo/web3.py that referenced this pull request Nov 2, 2023

tweaks based on comments on PR ethereum#3140

514827b

fselmo requested a review from reedsa November 3, 2023 17:48

reedsa approved these changes Nov 3, 2023

View reviewed changes

tweaks based on comments on PR ethereum#3140

9af8aa4

fselmo force-pushed the search-radius branch from 514827b to 9af8aa4 Compare November 3, 2023 19:15

fselmo merged commit 3728df8 into ethereum:main Nov 3, 2023
3 of 89 checks passed

fselmo added a commit that referenced this pull request Nov 3, 2023

add relevant newsfragment for #3140

6679afe

fselmo deleted the search-radius branch November 3, 2023 19:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add better validation around matching response with request information #3140

Add better validation around matching response with request information #3140

fselmo commented Nov 2, 2023 •

edited

Loading

reedsa Nov 2, 2023

fselmo Nov 2, 2023

reedsa Nov 3, 2023

reedsa left a comment

Add better validation around matching response with request information #3140

Add better validation around matching response with request information #3140

Conversation

fselmo commented Nov 2, 2023 • edited Loading

What was wrong?

Steps to reproduce

Todo:

Cute Animal Picture

reedsa Nov 2, 2023

Choose a reason for hiding this comment

fselmo Nov 2, 2023

Choose a reason for hiding this comment

reedsa Nov 3, 2023

Choose a reason for hiding this comment

reedsa left a comment

Choose a reason for hiding this comment

fselmo commented Nov 2, 2023 •

edited

Loading