Achieving Connectivity Magic #1459

daviddias · 2018-07-20T17:29:00Z

I ran an impromptu and presidential design session with @alanshaw, @olizilla and @achingbrain to walk through some of the design decisions on libp2p and we ended up discussing the Connectivity Magic Objective and what exactly I mean by "creativity".

The tl;dr; there are multiple ways to find file, but all of them have tradeoffs.

Here is the sketch we ended up doing:

Here is an annotated version:

I can make a quick video explaining this, but perhaps it would be more valuable to have @alanshaw, @olizilla and @achingbrain explain this to @hugomrdias (who owns the KR to get files from the gateways) through a Zoom Recorded Call or Text, so that we check that the memes have indeed been transferred :)

The focus should be around on why the shortcoming exists and the 3 solutions on the solution spectrum (from hacky to ideal but not necessarily memory efficient).

ghost · 2018-07-22T16:21:07Z

It took me a while to see this: the three steps proposed are: 1) routing via DNS records 2) delegated routing 3) proper DHT routing

daviddias · 2018-07-23T15:47:56Z

Everyone, thank you for the really awesome sync up. It was lovely to see how everyone came up together so quickly, communicated well their concerns and proposed solutions and eventually converged in a good set of next steps 👏🏽👏🏽👏🏽❤️❤️❤️.

We converged on an alternative solution to (1), we named it 1.5. The gist of it is that we will use preload gateways (aka caching) to get that Connectivity Magic feeling. The action items are:

@lgierth Deploy "Pre Load gateways" preload.gateway.ipfs.io and update the js-ipfs team as soon as its there.
js-ipfs team Add new the gateways to the bootstrapper list and hit the preload gateway on every add/get/cat (+ dag.get/put) (like companion does)
js-ipfs team release js-ipfs 0.31
@hugomrdias integrate it all on js.ipfs.io

Note on hitting the preload gateway: Use the "x-ipfs-companion-no-redirect" to avoid getting the preload gateway hit redirected to the local daemon.

ghost · 2018-07-23T15:57:15Z

Here's the pros/cons of 1a) always connecting to ipfs.io gateways, and 1b) separate preload gateways, that were mentioned in the call:

ipfs.io gateways
- Pro: very easy
- Con: have no PeerIDs in js-ipfs code, but instead fetch fully-qualified multiaddrs from dnsaddrs. Means we can't authenticate these dnsaddr responses.
- Con: have to connect to all ipfs.io gateway nodes, 8 at the moment, but want to add more capacity for Dweb Summit
- Con: coupling libp2p routing cleverness to the gateways, which was a pain for infra team in the past
preload gateways
- Con: need to add HEAD requests to add/get/cat/dag.get/dag.put
- Pro: can have full addresses of these nodes in js-ipfs code, no dnsaddr requests needed
- Pro: only one connection to a preload node needed
- Pro: can have different garbage collection settings than the ipfs.io gateways

ghost · 2018-07-23T16:03:01Z

While I'm setting up these preload gateways, you can use the old OVH gateway nodes for testing: the wss0,wss1 addresses in js-ipfs bootstrap config. These run the gateway on :443/ipfs, right next to the /ws transport on :443/ws.

They're crappy fault-prone hardware and will be decomissioned soon, but should be good enough for a few days.

olizilla · 2018-07-23T16:04:18Z

+1

Can we call them "storage peers" rather than "prefetch gateways"? It's confusing to me to use the term gateway when we mean HTTP to IPFS translation, and it seems that storing files directly on the gateways has been the cause of some trouble.

As I see it we are proposing to add some "stable bootstrap peers with a good chunk of storage" to the network that will expose an http interface just to allow us to poke them out of band to trigger a request to fetch content from connected browser nodes.

These "storage peers" will (hopefully, usually) be in the swarm for all the gateway nodes too, so that requests for the content sent to the regular gateway will be 1 hop away (rather than 0 as they used to be, or n as they are currently)... It's not imagined that we'll be using the http address for the storage peers for general purpose http-to-ipfs content access in browsers, that'll still be the job of the regular gateway nodes.

alanshaw · 2018-07-23T16:08:48Z

So to clarify, we'd add all of the gateways to the bootstrap list, but HEAD request just one of those after add/get/cat/dag.get/dag.put?

How do we know which one was HEAD requested (and does that matter)?
After the HEAD request will we then be able to fetch content over HTTP from ipfs.io, or would we typically request it via the prefetch node? Both I presume but the prefetch would be faster?

lidel · 2018-07-23T17:01:08Z

Notes on redirect opt-out in Companion and preload:

x-ipfs-companion-no-redirect can be put in URL as a hash or query parameter – hash semantic is bit better as it does not leave the browser, it is just a hint for Cpmpanion
preload via asynchronous XHR with cheap HTTP HEAD for a CID should be enough, that is what Companion did before gateways were re-architected and preload worked perfectly back then

daviddias · 2018-07-24T10:25:12Z

HEAD request just one of those after add/get/cat/dag.get/dag.put?

@alanshaw we will HEAD request all the preload.ipfs.io Gateways. Which if I understood correctly, @lgierth will deploy 4 for extra capacity.

ghost · 2018-07-25T03:14:43Z

So, unfortunately JFK Terminal 1 has roughly three power outlets so I can't work on this tonight.

I'll have addresses for you tomorrow morning-ish (http for the preload requests and dnsaddr for peering). We don't have proper hosts for this provisioned yet, but in the meantime the old gateway hosts can be used for testing.

Can we call them "storage peers" rather than "prefetch gateways"?

Let's do preload peers -- not calling them gateways is fair, but "storage" would give a false sense of permanence.

ghost · 2018-07-25T03:15:57Z

HEAD requests

By the way, a strong alternative to this is calling /api/v0/refs?r=true&arg=QmFoo on the preload peer, and waiting for the complete response to stream in.

lidel · 2018-07-25T12:44:35Z

@lgierth do you know if ipfs refs --recursive preloads data for entire tree (with leaves), or everything-but-leaves?

ghost · 2018-07-25T17:12:14Z

do you know if ipfs refs --recursive preloads data for entire tree (with leaves), or everything-but-leaves?

Yes it loads every block that's referenced in that dag -- the nice thing is it enables progress reporting on the "client" side, since you already know the set of all CIDs, and cam compare how much has been fetched already.

daviddias · 2018-07-25T23:44:36Z

Can I get a quick check in on the status of #1459 (comment) ?

alanshaw · 2018-07-26T08:21:57Z

@diasdavid

I'm waiting on the new node addresses - @lgierth gave me two addresses but the domain preload.ipfs.io is not resolving yet
PR ready for review for preloading on ipfs.add* feat: preload content #1464
PR ready for review for preloading mfs.* and dag.put feat: preload mfs and dag.put #1468
PR for block.put, object.new, object.put, object.patch.* coming shortly
WIP on writing release notes for 0.31 ⚡️ v0.31.0 RELEASE 🚀 #1458

alanshaw · 2018-07-26T08:57:35Z

do you know if ipfs refs --recursive preloads data for entire tree (with leaves), or everything-but-leaves?

Yes it loads every block that's referenced in that dag -- the nice thing is it enables progress reporting on the "client" side, since you already know the set of all CIDs, and cam compare how much has been fetched already.

When we're adding content to IPFS, we don't really want to wait around for it to be uploaded to the preload nodes which is why a HEAD request is nice because it's quick and light and the content can then be slurped from my node asynchronously.

In companion we HEAD request for every CID added, but is this necessary?

Looking at the code it looks as though if you send a HEAD reuqest to a js-ipfs gateway it would load the CID as well as it's descendants (because it doesn't differentiate between HEAD and GET).

ghost · 2018-07-26T09:02:19Z

it would load the CID as well as it's descendants (because it doesn't differentiate between HEAD and GET).

Actually, HEAD is even a bit more diligent than GET -- the former reads the whole node to calculdate Content-Length, the latter reads only what it has to in order to satisfy the request.

Both don't read into directories, i.e. if you get a directory index, none of the children or subdirs are fetched.

ghost · 2018-07-26T09:44:15Z

@lgierth gave me two addresses but the domain preload.ipfs.io is not resolving yet

Ah yes -- there's now A/AAAA records for preload.ipfs.io, so if you're connected to all preloader peer in _dnsaddr.preload.ipfs.io, you're sure to hit home.

There is a slightly different option which is individual A/AAAA records for each preloader peer. That'd let you connect to only one preloader peer, and make HTTP requests to exactly that one, instead of connecting to all of them.

ghost · 2018-07-26T13:29:19Z

These preload peers are now reachable:

/dns4/node0.preload.ipfs.io/tcp/443/wss/ipfs/QmZMxNdpMkewiVZLMRxaNxUeZpDUb34pWjZ1kZvsd16Zic
/dns4/node1.preload.ipfs.io/tcp/443/wss/ipfs/Qmbut9Ywz9YEDrz8ySBSgWyJk41Uvm2QJPhwDJzJyGFsD6
https://node0.preload.ipfs.io/ipfs
https://node1.preload.ipfs.io/ipfs

If js-ipfs can resolve /dnsaddr, these addresses can be shortened to:

/dnsaddr/preload.ipfs.io/ipfs/QmZMxNdpMkewiVZLMRxaNxUeZpDUb34pWjZ1kZvsd16Zic
/dnsaddr/preload.ipfs.io/ipfs/Qmbut9Ywz9YEDrz8ySBSgWyJk41Uvm2QJPhwDJzJyGFsD6

Note that the PeerIDs will change sometime later today as we get the new hosts up -- right now this is two crappy old OVH hosts that I actually want to get rid of. I'll notify here once the PeerIDs are changing.

pgte · 2018-07-26T13:49:59Z

Sorry I'm a bit late to the party.
@alanshaw Is there a quick and dirty way of making this work on the latest js-ipfs?

ghost · 2018-07-26T13:52:08Z

It's gonna be in 0.31 very very soon: #1459 (comment)

refs #1459 This PR adds a new config property `preload`: ```js new IPFS({ preload: { enabled: false, addresses: ['/multiaddr/api/address'] } }) ``` * `preload.enabled` (default `false`) enables/disabled preloading - **should the default be false?** * `preload.addresses` array of node API addresses to preload content on. This are the addresses we make a `/api/v0/refs?arg=QmHash` request to, to initiate the preload **This PR upgrades the following APIs to preload content**. After adding content with `ipfs.files.add` (for example), we make a request to the first preload gateway addresses (providing `preload.enabled` is true), and will fall back to the second etc. * [x] `dag.put` * [x] `block.put` * [x] `object.new` * [x] `object.put` * [x] `object.patch.*` * [x] `mfs.*` MFS preloading is slightly different - we periodically submit your MFS root to the preload nodes when it changes. NOTE: this PR adds an option to `dag`, `block` and `object` APIs allowing users to opt out of preloading by specifying `preload: false` in their options object. License: MIT Signed-off-by: Alan Shaw <[email protected]>

ghost · 2018-07-28T01:14:22Z

Update about the PeerID changes mentioned above -- we will not change the PeerIDs, and will keep the PeerIDs mentioned above.

This means the addresses above in #1459 (comment) are the correct ones for the release.

This also means you can safely remove the wss0.bootstrap.libp2p.io and wss1.bootstrap.libp2p.io nodes from any configs, since we'll be shutting these two hosts down, and move their private keys and PeerIDs over to the new preloader hosts.

daviddias · 2018-07-28T07:19:58Z

This also means you can safely remove the wss0.bootstrap.libp2p.io and wss1.bootstrap.libp2p.io nodes from any configs -> Done

ghost · 2018-07-29T06:10:49Z

The CORS issue should be fixed

alanshaw · 2018-09-18T11:36:55Z

I'm not sure I understand the question...

preload on add effectively pushes your data up to the preload nodes for others to consume
preload on get prompts the preload nodes to fetch the data from a different node

The both solve the problem of connecting nodes to content without a DHT.

parkan · 2018-09-19T19:20:06Z

@alanshaw I guess it's confusing from a REST verb perspective -- what looks like the same GET call actually represents two different directions of data movement (which also happens out of band from the HTTP request itself), one from your node up to the server (kind of an upload/PUT) and one between some other node and the server (kind of a sideload/???)

at minimum, it's difficult to tell what's happening based on network requests in the console (are these preloads firing because I'm adding things or because I'm attempting to fetch?)

imagine a page that both .gets and .adds some content and preloads are failing (e.g. #1481); some content is also failing to load -- is it because of the failed preloads or something else?

I get that in a sense it doesn't matter and causing a remote .get on a node that you have a p2p connection to is an elegant solution to both problems, but I've found it difficult to reason about in practice

what about adding a (ignored) parameter like ?trigger=add to the URL so it's clear why it's happening?

parkan · 2018-09-19T19:21:25Z

at minimum, we should add a description of both scenarios here because it is not at all obvious that both can happen

Preload is confusing because it happens on both add and get, and because it isn't documented anywhere in depth. This clarifies what happens a bit. We should also write out the behavior in more detail like with circuit relay above (possibly in the form of a tutorial for setting up your own preload node) see also ipfs#1459 (comment)

Preload is confusing because it happens on both add and get, and because it isn't documented anywhere in depth. This clarifies what happens a bit. We should also write out the behavior in more detail like with circuit relay above (possibly in the form of a tutorial for setting up your own preload node) see also #1459 (comment)

parkan · 2018-10-03T15:07:09Z

@alanshaw here's another preload related question -- how long is data retained by the remote nodes? is it just a normal pin? do they announce to DHT?

alanshaw · 2019-02-27T11:12:10Z

Posting this here for discoverability:

What are the "preload" nodes and why are they necessary to run js-ipfs?
see #1874 (comment)

lidel · 2019-06-03T13:16:37Z

cc relevant topics:

Decentralized WebRTC Signaling Protocol: Add WebRTC Signaling Protocol Spec libp2p/specs#159
Saving previously seen nodes for later bootstrapping Saving previously seen nodes for later bootstrapping kubo#3926
- Persist routing table between restarts Persist routing table libp2p/go-libp2p-kad-dht#254

daviddias · 2019-07-21T21:42:24Z

Not a blocker but relevant to the thread libp2p/js-libp2p#385

achingbrain · 2023-05-31T16:28:39Z

Closing this because it is very stale.

We now live in the future where browser nodes can dial server nodes via WebTransport and WebRTC without any extra setup a la WebSockets so as these transports proliferate, connectivity will too. Magic!

daviddias mentioned this issue Jul 20, 2018

Preloader gateways for js-ipfs ipfs/infra#410

Closed

alanshaw mentioned this issue Jul 25, 2018

feat: preload content #1464

Merged

8 tasks

ghost mentioned this issue Jul 25, 2018

ipfs: create preload.ipfs.io gateways ipfs/infra#419

Merged

alanshaw mentioned this issue Jul 26, 2018

feat: preload mfs and dag.put #1468

Closed

lidel mentioned this issue Jul 26, 2018

Public gateway slow to open uploads made via embedded js-ipfs node ipfs/ipfs-companion#490

Closed

lidel mentioned this issue Jul 28, 2018

Revisit how Companion does preload at public gateways ipfs/ipfs-companion#546

Open

2 tasks

parkan mentioned this issue Sep 19, 2018

Clarify preload behavior #1583

Merged

alanshaw mentioned this issue Sep 24, 2018

2018 Q4 OKRs Planning #1566

Merged

ghost mentioned this issue Oct 5, 2018

gateway: make API commands configurable ipfs/kubo#5565

Closed

This was referenced May 30, 2019

feat: poster projects topics proposal ipfs/camp#8

Merged

Call for help - Limitations with js-ipfs #2093

Closed

jacobheun mentioned this issue Jun 4, 2019

Add delegated routing to js-ipfs browser #2155

Closed

3 tasks

vasco-santos mentioned this issue Jun 4, 2019

Reprovider to reannounce blocks to the network #2160

Closed

lidel mentioned this issue Jul 22, 2019

/dnsaddr support in bootstrap and swarm connect #2289

Closed

daviddias mentioned this issue Aug 22, 2019

A viable alternative to go-ipfs? #1563

Closed

lidel mentioned this issue Aug 28, 2019

Docs on setting up own bootstrapp node with WebSockets over TLS #2401

Closed

daviddias added the topic/libp2p Topic libp2p label Oct 1, 2019

This was referenced Sep 15, 2021

[Snyk] Fix for 1 vulnerabilities rtpro/js-ipfs#45

Open

[Snyk] Fix for 1 vulnerabilities awaisab172/js-ipfs#33

Open

SgtPooki assigned whizzzkid May 17, 2023

whizzzkid assigned achingbrain and unassigned whizzzkid May 24, 2023

achingbrain closed this as completed May 31, 2023

BigLep mentioned this issue Jun 1, 2023

[Tracking Issue] js-ipfs deprecation - signposting over to Helia #4336

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Achieving Connectivity Magic #1459

Achieving Connectivity Magic #1459

daviddias commented Jul 20, 2018 •

edited

Loading

ghost commented Jul 22, 2018

daviddias commented Jul 23, 2018 •

edited by hugomrdias

Loading

ghost commented Jul 23, 2018

ghost commented Jul 23, 2018

olizilla commented Jul 23, 2018

alanshaw commented Jul 23, 2018

lidel commented Jul 23, 2018

daviddias commented Jul 24, 2018

ghost commented Jul 25, 2018

ghost commented Jul 25, 2018

lidel commented Jul 25, 2018 •

edited

Loading

ghost commented Jul 25, 2018 •

edited by ghost

Loading

daviddias commented Jul 25, 2018

alanshaw commented Jul 26, 2018 •

edited

Loading

alanshaw commented Jul 26, 2018 •

edited

Loading

ghost commented Jul 26, 2018

ghost commented Jul 26, 2018

ghost commented Jul 26, 2018 •

edited by ghost

Loading

pgte commented Jul 26, 2018

ghost commented Jul 26, 2018

ghost commented Jul 28, 2018

daviddias commented Jul 28, 2018

ghost commented Jul 29, 2018

alanshaw commented Sep 18, 2018

parkan commented Sep 19, 2018

parkan commented Sep 19, 2018

parkan commented Oct 3, 2018

alanshaw commented Feb 27, 2019

lidel commented Jun 3, 2019

daviddias commented Jul 21, 2019

achingbrain commented May 31, 2023

Achieving Connectivity Magic #1459

Achieving Connectivity Magic #1459

Comments

daviddias commented Jul 20, 2018 • edited Loading

Here is the sketch we ended up doing:

Here is an annotated version:

ghost commented Jul 22, 2018

daviddias commented Jul 23, 2018 • edited by hugomrdias Loading

ghost commented Jul 23, 2018

ghost commented Jul 23, 2018

olizilla commented Jul 23, 2018

alanshaw commented Jul 23, 2018

lidel commented Jul 23, 2018

daviddias commented Jul 24, 2018

ghost commented Jul 25, 2018

ghost commented Jul 25, 2018

lidel commented Jul 25, 2018 • edited Loading

ghost commented Jul 25, 2018 • edited by ghost Loading

daviddias commented Jul 25, 2018

alanshaw commented Jul 26, 2018 • edited Loading

alanshaw commented Jul 26, 2018 • edited Loading

ghost commented Jul 26, 2018

ghost commented Jul 26, 2018

ghost commented Jul 26, 2018 • edited by ghost Loading

pgte commented Jul 26, 2018

ghost commented Jul 26, 2018

ghost commented Jul 28, 2018

daviddias commented Jul 28, 2018

ghost commented Jul 29, 2018

alanshaw commented Sep 18, 2018

parkan commented Sep 19, 2018

parkan commented Sep 19, 2018

parkan commented Oct 3, 2018

alanshaw commented Feb 27, 2019

lidel commented Jun 3, 2019

daviddias commented Jul 21, 2019

achingbrain commented May 31, 2023

daviddias commented Jul 20, 2018 •

edited

Loading

daviddias commented Jul 23, 2018 •

edited by hugomrdias

Loading

lidel commented Jul 25, 2018 •

edited

Loading

ghost commented Jul 25, 2018 •

edited by ghost

Loading

alanshaw commented Jul 26, 2018 •

edited

Loading

alanshaw commented Jul 26, 2018 •

edited

Loading

ghost commented Jul 26, 2018 •

edited by ghost

Loading