Fix unbounded vec deserialization #592

HarukaMa · 2022-01-16T09:04:28Z

Motivation

The Vec deserialization procedure of CanonicalDeserialize didn't check if the vector size is reasonable which causes https://github.com/AleoHQ/snarkOS/issues/1534.

This PR limits the data size to 1GB without counting Vec overhead which should be enough (?).

Edit: looks like Transactions, Transition and Event are safe.

Test Plan

Currently the tests of CanonicalSerialize and CanonicalDeserialize only checks if the data is intact after a se/des cycle so didn't add any test there.

I'm using a fake client to send bad proof data to test it. It seems there is no significant memory usage changes when I rapidly trigger a fairly large allocation (several hundred MBs).

Related PRs

(Link your related PRs here)

ljedrz · 2022-01-16T09:37:55Z

Thanks for the contribution! Does this fix the issue for any hand-crafted message you tested? I have a branch with panic catches around calls to Vec::with_capacity, but this spot (which was also my first target) wasn't sufficient for the operator to no longer crash - I'll share it once I'm at the PC.

HarukaMa · 2022-01-16T09:41:41Z

I've tried with different vec sizes and when the size exceeds 1GB it will directly err out without crashing.

I guess you can't catch OOM errors on allocations as it doesn't print backtraces even with the env param. I guess the best way is to not try to alloc that large memory.

ljedrz · 2022-01-16T09:49:52Z

Agreed, a fixed boundary is the safest approach and this change looks fine; I'll leave the decision regarding the maximum size to @howardwu.

howardwu · 2022-01-17T05:40:50Z

Are we able to expand the maximum capacity beyond 1GB? While 1GB may seem large, it is small for our universal setup ceremony, which would need at least 10GB (if not more) for this data structure (IIRC).

Can you clarify what case is introducing a failure at 1GB in snarkOS?

HarukaMa · 2022-01-17T08:32:11Z

@howardwu In the serialized proof, the size of each Vec is directly encoded in the message. Attacker could then manually craft a "proof" with some absurd Vec size like 1 trillion. Calling Vec::with_capacity with that size would result in trying to allocate a very large chunk of memory. Rust would simply abort on failed allocation (rust-lang/rust#29802).

There might still be different ways to handle this if imposing size limit is not that easy:

Use special deserialization on proofs, instead of using the generalized CanonicalDeserialize (like Transactions, which limits the Vec size to u16 thus not affected);
Don't pre-allocate memory and use with_capacity(0) instead, which might be slower due to required reallocs;
Make a custom allocator to handle situations like this.

Still, being able to accept arbitrary input from network with an unchecked size is not safe and should be carefully considered.

ljedrz · 2022-01-17T08:44:05Z

Don't pre-allocate memory and use with_capacity(0) instead, which might be slower due to required reallocs

This will work, as long as we still do expect to read a length (just not pre-allocate based on it). We could also use something like Vec::try_reserve (which would bump the MSRV to 1.57).

Use special deserialization on proofs (...)

This would probably work best, even if it requires some tinkering around the overall serialization setup.

Pratyush · 2022-01-17T09:05:58Z

Special deserialization for proofs makes sense to me, as the number of elements in the (commitment) vec is known statically: https://github.com/AleoHQ/snarkVM/blob/435f1120b15d0d63944b9935667b607084b83cef/marlin/src/ahp/ahp.rs#L63. We can do a similar analysis for the other collections used in the proof, like those used for evaluations.

HarukaMa · 2022-01-17T09:28:24Z

Considering the proof is always 771 bytes on network, I think the whole structure is statically known.

Feel free to supersede this PR as I'm not sure how to correctly make a special deserializer for the proof. I'd recommend to put it on high priority though, as currently it's still possible to crash every node on the network with a specially crafted payload.

ljedrz · 2022-01-21T09:16:52Z

Until proof-specific deserialization is implemented, I proposed a more generic solution in https://github.com/AleoHQ/snarkVM/pull/609.

howardwu · 2022-04-03T18:34:23Z

Closing as the issue has been addressed in #735

Fix unbounded vec deserialization

09d6c1a

HarukaMa mentioned this pull request Jan 16, 2022

Hidden danger after message:: poolresponse is modified [Bug] AleoNet/snarkOS#1540

Closed

ljedrz mentioned this pull request Jan 21, 2022

A few deserialization improvements #609

Closed

ljedrz mentioned this pull request Apr 3, 2022

Deserialization fixes #735

Merged

howardwu closed this Apr 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix unbounded vec deserialization #592

Fix unbounded vec deserialization #592

HarukaMa commented Jan 16, 2022 •

edited

Loading

ljedrz commented Jan 16, 2022

HarukaMa commented Jan 16, 2022

ljedrz commented Jan 16, 2022 •

edited

Loading

howardwu commented Jan 17, 2022 •

edited

Loading

HarukaMa commented Jan 17, 2022 •

edited

Loading

ljedrz commented Jan 17, 2022 •

edited

Loading

Pratyush commented Jan 17, 2022

HarukaMa commented Jan 17, 2022

ljedrz commented Jan 21, 2022

howardwu commented Apr 3, 2022

Fix unbounded vec deserialization #592

Fix unbounded vec deserialization #592

Conversation

HarukaMa commented Jan 16, 2022 • edited Loading

Motivation

Test Plan

Related PRs

ljedrz commented Jan 16, 2022

HarukaMa commented Jan 16, 2022

ljedrz commented Jan 16, 2022 • edited Loading

howardwu commented Jan 17, 2022 • edited Loading

HarukaMa commented Jan 17, 2022 • edited Loading

ljedrz commented Jan 17, 2022 • edited Loading

Pratyush commented Jan 17, 2022

HarukaMa commented Jan 17, 2022

ljedrz commented Jan 21, 2022

howardwu commented Apr 3, 2022

HarukaMa commented Jan 16, 2022 •

edited

Loading

ljedrz commented Jan 16, 2022 •

edited

Loading

howardwu commented Jan 17, 2022 •

edited

Loading

HarukaMa commented Jan 17, 2022 •

edited

Loading

ljedrz commented Jan 17, 2022 •

edited

Loading