Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BlockService: AddBlock & AddBlocks write twice to datastore when used with Bitswap #7956

Closed
hannahhoward opened this issue Mar 2, 2021 · 18 comments · Fixed by ipfs/go-bitswap#571 or #9154
Assignees
Labels
effort/days Estimated to take multiple days, but less than a week exp/wizard Extensive knowledge (implications, ramifications) required kind/bug A bug in existing code (including security flaws) P2 Medium: Good to have, but can wait until someone steps up status/ready Ready to be worked

Comments

@hannahhoward
Copy link
Contributor

Version information:

current

Description:

Calling AddBlock or AddBlocks on the block service appears to write to the data store twice. The reason is that the blockservice calls Put here:
https://github.com/ipfs/go-blockservice/blob/master/blockservice.go#L146
And then calls exchange.HasBlock here:
https://github.com/ipfs/go-blockservice/blob/master/blockservice.go#L153
HasBlock eventually calls PutMany here:
https://github.com/ipfs/go-bitswap/blob/master/bitswap.go#L372

I produces a test to repro the issue (it's pretty ugly but should demonstrate that I'm not just reading it wrong): https://github.com/ipfs/go-blockservice/blob/test/prove_double_write/test/blocks_test.go#L108

I believe you could fix it by changing https://github.com/ipfs/go-bitswap/blob/master/bitswap.go#L371 from:

if len(wanted) > 0 {

to

if len(wanted) > 0 && from != "" {

but I dunno what side effects that might cause.

I just was reading through the code trying to figure some stuff out about designing new ipld-prime interfaces and I saw that. I dunno what the penalty is for double writing but when adding a big DAG I can imagine it might be large.

@hannahhoward hannahhoward added kind/bug A bug in existing code (including security flaws) need/triage Needs initial labeling and prioritization labels Mar 2, 2021
@Stebalien
Copy link
Member

This is why we end up calling Has here: https://github.com/ipfs/go-ipfs-blockstore/blob/fb07d7bc5aece18c62603f36ac02db2e853cadfa/blockstore.go#L172-L190.

But I'd really like to get rid of the second write as it's kind of racy and we can end up calling:

  1. Thread 1: PutMany, Has (returns false)
  2. Thread 2: PutMany, Has (returns false)
  3. Thread 1: Actually put.
  4. Thread 2: Actually put.

@Stebalien
Copy link
Member

But yeah. I think the concern is that if we don't write the block there, the caller may also not have written the block yet and then we'll think we have the block when we don't.

@aschmahmann
Copy link
Contributor

Is this something that gets resolved automatically when we finally convert the blockstore to be keyed on multihashes?

Is this important enough to bother worrying about before we do that conversion?

@Stebalien
Copy link
Member

Stebalien commented Mar 2, 2021 via email

@aschmahmann aschmahmann added status/ready Ready to be worked P2 Medium: Good to have, but can wait until someone steps up effort/days Estimated to take multiple days, but less than a week exp/wizard Extensive knowledge (implications, ramifications) required and removed need/triage Needs initial labeling and prioritization labels Mar 15, 2021
@schomatis schomatis self-assigned this Dec 23, 2021
@schomatis
Copy link
Contributor

From what I'm reading here the issue boils down to the undocumented HasBlock API: does it expect the block to be already in the Blockstore or not?

According to that answer either AddBlock or BitSwap's inmplementation of HasBlock should be adapted accordingly. I don't have enough background to make that call so will need a tech lead for this (cc @Stebalien or @aschmahmann). Is there any test or design consideration to help guide this decision?

As for the original performance consideration I think we need to clarify what we're trying to avoid: "write twice to datastore" is not the same as calling both Blockstore's Put and PutMany (or any combination of the two) since we always check with Has if the block is there before attempting the Datastore's Put, so at most the performance concern here would be calling Has more than needed, not Put (the latter case would seem very unlikely for it to be a performance concern here).

@schomatis
Copy link
Contributor

(A case might be argued in the Batching scenario where the Blockstore's PutMany breaks the abstraction by accessing directly the underlying Datastore interface through Has to make a decision of which block to put through Batch.Put(), the correct abstraction. Any check of existence should probably happen later during Batch.Commit(). Still, I don't think that's the main performance issue of concern here and in any case should be discussed independently from this issue.)

@BigLep BigLep added this to the Best Effort Track milestone Mar 10, 2022
@BigLep BigLep moved this to 🛑 Blocked in IPFS Shipyard Team Mar 10, 2022
@BigLep
Copy link
Contributor

BigLep commented Mar 11, 2022

2022-03-11 conversation: this is a performance issue. We need to do some more triage to understand if we're doubling our writes to disk. It's less bad if we're doing two checks. @schomatis analysis to date is that we're just doing two checks, which lowers the priority.

@BigLep
Copy link
Contributor

BigLep commented Mar 11, 2022

Next step: need more information on which method to fix. Someone will need to dig into the code more and make the plan.

@MichaelMure
Copy link
Contributor

Complementary food for thought: why does an exchange (bitswap) write blocks on disk to begin with? It looks to me that this is the root cause of that problem. Why is that not done only in the blockservice? What if you just want to retrieve blocks without persisting them?

I came to this issue for a different reason, that seems to tie to the problem you all describe. My problem is that I'm trying to write a custom data pipeline where I re-route blocks pinned to a CAR file. In that context I need to re-route the blocks pulled from the exchange, but once bitswap is constructed in go-ipfs I don't have access to the blockstore and I can't override that. It would be much simpler and more composable if the exchange would just exchange and not bother with writing on disk.

@Jorropo
Copy link
Contributor

Jorropo commented Jul 8, 2022

@MichaelMure I've been deep into bitswap's internal lately and AFAIT it's because it want to cache blocks you just downloaded.

@MichaelMure
Copy link
Contributor

MichaelMure commented Jul 8, 2022

Sure, but why would that be a bitswap responsibility?

The write happen in receiveBlocksFrom(). This function is called:

  • in HasBlock(), coming from BlockService, where we already added the blocks. It's wasteful and useless.
  • in ReceiveMessage() where (as I understand) blocks are received from the network and immediately shipped back to the called of GetBlock() or GetBlocks(): the BlockService. Caching can be done there.

Also, this comment is pretty telling: https://github.com/ipfs/go-bitswap/blob/84973686518be4831ee86e7b6b3f1b9834d0ce97/bitswap.go#L475-L478

@Jorropo
Copy link
Contributor

Jorropo commented Jul 8, 2022

Sure, but why would that be a bitswap responsibility?

It's software we can make stuff up. It just look like an architectural choice that stuck.
We are working on rewriting parts of the bitswap codebase. I'll fix this in the process.

@Jorropo Jorropo assigned Jorropo and unassigned schomatis Jul 8, 2022
@MichaelMure
Copy link
Contributor

Actually I would go even further. The only other place the blockstore is used in bitswap is to record stats (is the incoming blocks new or already present locally?). I'd be very tempted to remove that and drop the blockstore entirely from bitswap.

@MichaelMure
Copy link
Contributor

I'll fix this in the process.

Any rough timeframe on this? Please ping me on that PR.

@Jorropo
Copy link
Contributor

Jorropo commented Jul 8, 2022

Any rough timeframe on this? Please ping me on that PR.

A month ? Two maybe ?

@Jorropo Jorropo closed this as completed Jul 8, 2022
Repository owner moved this from 🛑 Blocked to 🎉 Done in IPFS Shipyard Team Jul 8, 2022
@Jorropo Jorropo reopened this Jul 8, 2022
@MichaelMure
Copy link
Contributor

MichaelMure commented Jul 8, 2022

A month ? Two maybe ?

That's a bit long for my need. I might patch bitswap and blockservice to remove the blockstore in bitswap. Would that be of interest for a PR?

Edit: on the client part of the code

@Jorropo
Copy link
Contributor

Jorropo commented Jul 8, 2022

Would that be of interest for a PR?

Yes if you do that before me 100% send a PR. thx

MichaelMure added a commit to MichaelMure/go-bitswap that referenced this issue Jul 8, 2022
This leave the responsibility and choice to do so to the caller, typically go-blockservice.

This has several benefit:
- untangle the code
- allow to use an exchange as pure block retrieval
- avoid double add

Close ipfs/kubo#7956
@MichaelMure
Copy link
Contributor

@Jorropo see ipfs/go-bitswap#571 and ipfs/go-blockservice#92.

I didn't remove the stats that use the blockstore, I'll let you judge if that make sense during your client/server split.

MichaelMure added a commit to MichaelMure/go-bitswap that referenced this issue Jul 9, 2022
This leave the responsibility and choice to do so to the caller, typically go-blockservice.

This has several benefit:
- untangle the code
- allow to use an exchange as pure block retrieval
- avoid double add

Close ipfs/kubo#7956
MichaelMure added a commit to MichaelMure/go-bitswap that referenced this issue Jul 14, 2022
This leave the responsibility and choice to do so to the caller, typically go-blockservice.

This has several benefit:
- untangle the code
- allow to use an exchange as pure block retrieval
- avoid double add

Close ipfs/kubo#7956
MichaelMure added a commit to MichaelMure/go-bitswap that referenced this issue Jul 14, 2022
This leave the responsibility and choice to do so to the caller, typically go-blockservice.

This has several benefit:
- untangle the code
- allow to use an exchange as pure block retrieval
- avoid double add

Close ipfs/kubo#7956
MichaelMure added a commit to MichaelMure/go-bitswap that referenced this issue Jul 14, 2022
This leave the responsibility and choice to do so to the caller, typically go-blockservice.

This has several benefit:
- untangle the code
- allow to use an exchange as pure block retrieval
- avoid double add

Close ipfs/kubo#7956
Jorropo pushed a commit to MichaelMure/go-bitswap that referenced this issue Jul 28, 2022
This leave the responsibility and choice to do so to the caller, typically go-blockservice.

This has several benefit:
- untangle the code
- allow to use an exchange as pure block retrieval
- avoid double add

Close ipfs/kubo#7956
Jorropo pushed a commit to MichaelMure/go-bitswap that referenced this issue Jul 28, 2022
This leave the responsibility and choice to do so to the caller, typically go-blockservice.

This has several benefit:
- untangle the code
- allow to use an exchange as pure block retrieval
- avoid double add

Close ipfs/kubo#7956
Jorropo added a commit to Jorropo/go-ipfs that referenced this issue Jul 28, 2022
Jorropo added a commit to Jorropo/go-ipfs that referenced this issue Jul 28, 2022
Jorropo added a commit that referenced this issue Jul 28, 2022
Jorropo pushed a commit to Jorropo/go-libipfs that referenced this issue Jan 26, 2023
This leave the responsibility and choice to do so to the caller, typically go-blockservice.

This has several benefit:
- untangle the code
- allow to use an exchange as pure block retrieval
- avoid double add

Close ipfs/kubo#7956


This commit was moved from ipfs/go-bitswap@a052ec9
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
effort/days Estimated to take multiple days, but less than a week exp/wizard Extensive knowledge (implications, ramifications) required kind/bug A bug in existing code (including security flaws) P2 Medium: Good to have, but can wait until someone steps up status/ready Ready to be worked
Projects
No open projects
Archived in project
7 participants