Forbid non-cryptographic hashes within /ipfs namespace #4371

ghost · 2017-11-05T22:20:58Z

Update: This is DONE, we currently block non-cryptographic hash functions at blockstore level – see #4371 (comment)

It just occured to me that by supporting non-cryptographic hashes (such as murmur3) in multihash, these are also available for CIDs, and can thus be used within the /ipfs namespace (example: ipfs/ipfs-companion#305).

This is not good! /ipfs paths MUST be cryptographically verifiable, it's their primary property. A lack of cryptographic verifiability seriously breaks so many assumptions on so many levels.

This would probably be appropriate to enforce within the respective IPLD modules.

ghost · 2017-11-05T22:21:08Z

cc @Stebalien @Kubuxu @whyrusleeping

ghost · 2017-11-05T22:27:42Z

Giving this a bug label. ipfs.io frontpage says:

Here's how IPFS works

Each file and all of the blocks within it are given a unique fingerprint called a cryptographic hash.

[...]

lidel · 2017-11-05T23:55:44Z

Also, it looks like there is a bug somewhere, probably in go-multihash.
Spec defines 0x22 as murmur3-128 while go-multihash/multihash.go generates only 4 bytes (32bits)

See go-multihash/multihash.go#L63 / #L140 vs multihash/hashtable.csv#L20

Stebalien · 2017-11-06T20:46:12Z

Yeah... we also have to check:

Hash lengths.
For sha1, we'd want to verify that the hash isn't in the set of class of known vulnerable hashes. Note: we can't do this by just looking at the hash; we need the data as well.

We may want to introduce a new type/constructor to enforce this. Personally, I'd rename the current Multihash type to InsecureMultihash or something scary like that and introduce an InsecureSum method etc.

magik6k · 2017-11-06T20:50:33Z

On sha1 - multiformats/go-multihash#57

Kubuxu · 2017-11-06T22:03:44Z

My idea of solving this issue, as well as issue of short hashes is:

For each hash function define its # of security bits ratio. As an example, SHA1 has length of 160 bits, if it was perfect hash function it would have 80 bits of security for collision but SHA1 is partially broken, it currently is deemed to have 61 bits of security so its security bits ratio is 61/80 = 0.76 (which could be further reduced because of SHAttered).

Then every time we see hash for content addressing we evaluate how many bits of security it really have (formula would be mhLenInBits / 2 * secBitRatio) and compared with required number of security bits. This value would by default set to allow SHA1 and could be increased in config.

Full round variant of SHA256 is yet to be successfully cryptoanalysed so its security bit factor would be 1 same as many other cryptographically sound hash functions.

Murmur would be cut off for two reasons, low bit count and low security factor (0 ??), same with other hash functions that are currently obsolete.

jbenet · 2017-12-24T03:58:36Z

I like the ideas for calculating the bit security, specially because can easily calculate what length of a hash would still be secure.
But the problem is that relies on having up-to-date knowledge on all attacks and increases surface for run-time attacks (trick the program into calculating incorrectly).
It would be best if we just had a whitelist of the hashes we think are secure, and minimum lengths. We can keep rolling that up and decommissioning hashes. This could also be altered by people recompiling go-ipfs for their own use cases, but our standard distribution should be fixed (and not allow an editable text config to compromise security)

jbenet · 2017-12-24T03:59:39Z

This is a bit urgent, so ideally a partial solution (like denying all hashes not in a short whitelist or with lengths < 256 bits) could be deployed in the next release.a

Kubuxu · 2018-01-31T11:26:16Z

(and not allow an editable text config to compromise security)

That is why in my writeup I only mentioned increasing number of security bits. Decreasing bellow some set minimum (or default) would not be possible.

jbenet · 2018-02-22T11:33:04Z

Team, we need this. To make it simpler for now, go with:

Have a compiled whitelist of the hashes we think are secure, w/ minimum lengths. We can keep rolling that up and decommissioning hashes. We can always expand the list should we think t ok. We can always change to a smarter function in the future.

Allowed hashes from https://github.com/multiformats/multihash/blob/master/hashtable.csv:

identity (once #4697 lands)
sha2-*
dbl-sha2-256  (lol, i'm sot surprised)
sha3-*
shake-*   ? (confirm someone)
keccak-*
blake2*

As said above, this could also be altered by people recompiling go-ipfs for their own use cases, but our standard distribution should be fixed (and not allow an editable text config to compromise security)
Let's leave the "smart hash picker" @Kubuxu described for the future.
For old content, we could have something like /ipfs-insecure/<hash> that allows insecure hashes. This will make sure old content is still retrievable and viewable locally, especially during flag days. let's leave this (or an alternate, better solution) for the future

mib-kd743naq · 2018-02-22T11:51:44Z

For old content, we could have something like...

Do note that there already is such "old content" from a test I did back in 2016 ( 17 cases, specifically everything in the rows v1pb-sha1-20 and v1raw-sha1-20 and the column v1pb-sha1-20 ).

You can use this matrix both for validating "yep, we no longer resolve this link" and for a possible future "yes, we still can open it under /ipfs-insecure/..."

whyrusleeping · 2018-02-22T21:39:05Z

Started a solution here: ipfs/go-cid#40

My approach will be to restrict the creation of invalid cids. This is a more surefire way restricting content than trying to police all the different places data can get into the system.

cc @Stebalien @magik6k @Kubuxu @kevina

See: ipfs/kubo#4371

mib-kd743naq · 2018-04-18T11:36:33Z

I am not sure whether this has been fully fixed or not, but there is improvement: I now get a 504 after a massive timeout ( 10 minutes or something, should likely be lowered ) if I try to access <redacted>

Kudos to the team!

Kubuxu · 2018-04-18T13:54:14Z

What is under this hash?

Kubuxu · 2018-04-18T13:55:09Z

It was solved by: #4751
but we want to move to a permanent solution of ipfs/go-cid#40

mib-kd743naq · 2018-04-18T14:05:17Z

What is under this hash?

On an unpatched go-ipfs:

~$ ipfs cat QmVpsktzNeJdfWEpyeix93QJdQaBSgRNxebSbYSo9SQPGx
This is some text self-referentially addressable via https://ipfs.io/ipfs/QmVpsktzNeJdfWEpyeix93QJdQaBSgRNxebSbYSo9SQPGx
I can of course just as easily replace it with anything else at any point in the future :)

From my email to security@ back in November:

This works because QmVpsktzNeJdfWEpyeix93QJdQaBSgRNxebSbYSo9SQPGx is a linknode with a sole pointer to https://ipfs.io/ipfs/z9iepse, which is the CID:

01 - version
55 - type raw
12 - sha256
01 - hash subset length
6d - first byte of hash

The attack would be mounted by providing some legitimate content with the intent to replace benign short-addressed nodes within the DAG with malicious ones later.

mib-kd743naq · 2018-04-18T14:12:39Z

Looking over #4751 it seems there is no work related to the DHT. I still strongly recommend the following:

I recommend limiting the length attribute of CidV1 to at least 160 bits, and adding code ensuring the DHT will not relay anything shorter ( to protect clients which are unable/unwilling to upgrade ).

Kubuxu · 2018-04-18T14:15:44Z

@mib-kd743naq my plan for ipfs/go-cid#40 is to forbid any creation of CID with the number of security bits lower than SHA1.

I wonder why it times out.

mib-kd743naq · 2018-04-18T14:24:16Z

my plan ... is to forbid any creation of CID

You can not really do this in an open p2p system: you can not control what shows up on the network. Case in point: I assembled all of the above without any of the go-ipfs libraries.

All you can realistically do is harden various readers/repeaters.

Kubuxu · 2018-04-18T14:46:51Z

Yes, that what I meant. If we are not able to create an insecure CID on our node then we won't be able to propagate info about it.

See: ipfs/kubo#4371

kevina · 2018-06-25T04:45:39Z

Here is my action plan to switching to Strict CIDs. At the same time we will add support for recognizing identity hashes and being able to use them without storing them in the datastore (see #4766 (comment))

(1) Merge Add initial support for identity hashes (Part 1) #5117
(2) Finish and merge Only allow crypto-safe hashes to be used in CIDs go-cid#40
(3) Finish (But don't merge) [WIP] Strict Cids #5150 using merged Only allow crypto-safe hashes to be used in CIDs go-cid#40
(4) Finish migration code WIP: Initial implementation of 7-8 migration. fs-repo-migrations#77 using dist created from [WIP] Strict Cids #5150
(5) Merge [WIP] Strict Cids #5150 and WIP: Initial implementation of 7-8 migration. fs-repo-migrations#77

Note I used gx-go link to test how badly ipfs/go-cid#40 breaks things by returning errors in the NewCid functions. It turns out not too much. There where one or two fixes I had to make outside of this repo. The main pain will be bubbling up a new version of go-cid.

@whyrusleeping @Kubuxu @Stebalien (and anyone else) sound good?

whyrusleeping · 2018-06-25T17:27:18Z

@kevina sounds reasonable to me.

whyrusleeping · 2018-06-25T17:28:57Z

Btw, for any readers of this issue, The discussion now is about a 'more correct' fix to the problem. Currently, we just prevent the use of these hashes via the blockstore in ipfs. This effectively solves the problem, but its not a clean solution.

ghost added the need/community-input Needs input from the wider community label Nov 5, 2017

ghost added the kind/bug A bug in existing code (including security flaws) label Nov 5, 2017

lidel mentioned this issue Nov 6, 2017

Is 32bit murmur3 safe to use in IPFS-based URL Shortener? ipfs/ipfs-companion#305

Closed

Stebalien added the topic/security Topic security label Nov 6, 2017

whyrusleeping added the P0 Critical: Tackled by core team ASAP label Dec 31, 2017

ghost mentioned this issue Feb 13, 2018

identity hash #4697

Closed

Kubuxu added a commit to ipfs/go-cid that referenced this issue Mar 1, 2018

Enforce the security rules of crypto-safe hashes and min length

2166e30

See: ipfs/kubo#4371

kevina pushed a commit to ipfs/go-cid that referenced this issue Jun 24, 2018

Enforce the security rules of crypto-safe hashes and min length

c4f3670

See: ipfs/kubo#4371

whyrusleeping mentioned this issue Jun 24, 2018

Only allow crypto-safe hashes to be used in CIDs ipfs/go-cid#40

Closed

kevina self-assigned this Jun 25, 2018

kevina mentioned this issue Jun 27, 2018

Ipfs v0.4.16-rc1 #5159

Merged

kevina mentioned this issue Jul 16, 2018

DO NOT MERGE update go-ipfs and deps to go-cid 0.7.22 #5243

Closed

Stebalien added P1 High: Likely tackled by core team if no one steps up and removed P0 Critical: Tackled by core team ASAP labels Sep 6, 2018

Stebalien unassigned kevina Apr 22, 2021

lidel added P3 Low: Not priority right now and removed P1 High: Likely tackled by core team if no one steps up labels Sep 15, 2021

lidel mentioned this issue Jan 28, 2022

Discourage use of broken hashes like SHA1 #8703

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Forbid non-cryptographic hashes within /ipfs namespace #4371

Forbid non-cryptographic hashes within /ipfs namespace #4371

ghost commented Nov 5, 2017 •

edited by lidel

Loading

ghost commented Nov 5, 2017

ghost commented Nov 5, 2017

lidel commented Nov 5, 2017

Stebalien commented Nov 6, 2017

magik6k commented Nov 6, 2017

Kubuxu commented Nov 6, 2017 •

edited

Loading

jbenet commented Dec 24, 2017

jbenet commented Dec 24, 2017

Kubuxu commented Jan 31, 2018

jbenet commented Feb 22, 2018

mib-kd743naq commented Feb 22, 2018

whyrusleeping commented Feb 22, 2018

mib-kd743naq commented Apr 18, 2018 •

edited

Loading

Kubuxu commented Apr 18, 2018

Kubuxu commented Apr 18, 2018

mib-kd743naq commented Apr 18, 2018

mib-kd743naq commented Apr 18, 2018

Kubuxu commented Apr 18, 2018 •

edited

Loading

mib-kd743naq commented Apr 18, 2018

Kubuxu commented Apr 18, 2018

kevina commented Jun 25, 2018 •

edited by Stebalien

Loading

whyrusleeping commented Jun 25, 2018

whyrusleeping commented Jun 25, 2018

Forbid non-cryptographic hashes within /ipfs namespace #4371

Forbid non-cryptographic hashes within /ipfs namespace #4371

Comments

ghost commented Nov 5, 2017 • edited by lidel Loading

ghost commented Nov 5, 2017

ghost commented Nov 5, 2017

lidel commented Nov 5, 2017

Stebalien commented Nov 6, 2017

magik6k commented Nov 6, 2017

Kubuxu commented Nov 6, 2017 • edited Loading

jbenet commented Dec 24, 2017

jbenet commented Dec 24, 2017

Kubuxu commented Jan 31, 2018

jbenet commented Feb 22, 2018

mib-kd743naq commented Feb 22, 2018

whyrusleeping commented Feb 22, 2018

mib-kd743naq commented Apr 18, 2018 • edited Loading

Kubuxu commented Apr 18, 2018

Kubuxu commented Apr 18, 2018

mib-kd743naq commented Apr 18, 2018

mib-kd743naq commented Apr 18, 2018

Kubuxu commented Apr 18, 2018 • edited Loading

mib-kd743naq commented Apr 18, 2018

Kubuxu commented Apr 18, 2018

kevina commented Jun 25, 2018 • edited by Stebalien Loading

whyrusleeping commented Jun 25, 2018

whyrusleeping commented Jun 25, 2018

ghost commented Nov 5, 2017 •

edited by lidel

Loading

Kubuxu commented Nov 6, 2017 •

edited

Loading

mib-kd743naq commented Apr 18, 2018 •

edited

Loading

Kubuxu commented Apr 18, 2018 •

edited

Loading

kevina commented Jun 25, 2018 •

edited by Stebalien

Loading