Propagate panics harder #14

alexcrichton · 2016-09-30T22:11:03Z

Once a Connection has panicked in I/O it's effectively poisoned and we
shouldn't come back to it (due to a lack of UnwindSafe bound). Set a
flag on Connection and bail early before we access the stream.

sfackler · 2016-09-30T22:12:52Z

I don't think this is necessary. We rethrow the panic, so from the perspective of a user of this crate, it's as if the catch_unwind didn't exist. Is there something I'm not thinking of?

alexcrichton · 2016-09-30T22:13:57Z

Oh, sorry, to clarify, this is fixing the SIGILL on travis

It looks like SecureTransport writes twice before returning back for the panic to get propagated? Somehow the runtime is detecting a double panic.

sfackler · 2016-10-01T11:04:16Z

The SIGILL is a thing that's weird, but it seems to me like some kind of issue with libstd. IIRC it popped up after some of those optimizations made to the panic runtime a couple weeks ago.

Specifically, it shouldn't matter if secure transport tries to write twice, since we catch the panic each time. If the panic runtime isn't decrementing the panic count when a panic is caught, that seems bad.

alexcrichton · 2016-10-03T17:22:39Z

The change in question was likely rust-lang/rust#34866, but I disagree that this change isn't needed. With panic safety we shouldn't come back to call write again, but what's happening here appears to be:

The SslStream::handshake method is called
Transitively, the write_func callback is called
This panics, returning errSecIO
The panic is then propagated on the other side of the handshake
As part of the panic, the SslStream is destroyed
SSLClose is called by the destructor
Transitively write_func is called again

The ud2 instruction happens because this is legitimately a double panic after #34866. Catching a panic while panicking isn't allowed (which is what's happening here). Moreover, I think it's also invalid to go back to the stream and try to write again after it's panicked, the purpose of the catch_unwind + propagation is to avoid that, right?

sfackler · 2016-10-03T17:59:05Z

Ah, that does make sense. I think I'd kind of prefer to only have the Drop impl check the poison state rather than everything, under the rationale that if the user wants to call write after a panic, that's up to them. Does that make sense?

We could alternatively not call SSLClose in Drop at all. It won't work in a nonblocking context and it might make more sense to have people opt-into an orderly session shutdown rather than doing it by default. IIRC rust-openssl doesn't do a shutdown in its destructor, but schannel-rs does :(. We should figure out a consistent story here and make sure everything's doing the same thing.

alexcrichton · 2016-10-03T18:36:03Z

Ah yeah I'd imagine that Drop would opportunistically do I/O if possible (like BufWriter) but ignore all errors (including would block)

alexcrichton · 2016-10-03T18:54:09Z

Ok, updated a bit with an assert!(!self.panicked) in the C callbacks and a guard against calling those functions elsewhere.

Is that what you're thinking?

sfackler · 2016-10-03T18:58:19Z

Let's cut the change to get_ref, get_mut, read, write, and flush, since it seems fine to let the user explicitly poke at things after a panic if they want to. That would also require the asserts in the C callbacks to go away, which is probably a good idea anyway since we don't want to unwind through C.

alexcrichton · 2016-10-03T23:35:19Z

Hm right yeah that's true, I think that with propagation it's ok to re-poke. The C side though was intended to abort quickly b/c it's a bug to reenter. Would you prefer though if they just immediately returned errSecIO?

sfackler · 2016-10-03T23:47:09Z

It's only a bug if we do it without the user's permission (i.e. the Drop use case). If the user wants to start calling read/write again, that seems like a thing that should work.

Once a `Connection` has panicked in I/O it's effectively poisoned and we shouldn't come back to it in the destructor, so skip `SSLClose` in this case.

alexcrichton · 2016-10-04T00:02:13Z

Ok, updated. Slowly getting my head straight around this again...

sfackler · 2016-10-04T00:03:42Z

Thanks!

alexcrichton force-pushed the fix-panics branch from 6d8a3bd to cc8ab4f Compare October 3, 2016 18:53

Don't SSLClose a panicked connection

e2f0dc8

Once a `Connection` has panicked in I/O it's effectively poisoned and we shouldn't come back to it in the destructor, so skip `SSLClose` in this case.

alexcrichton force-pushed the fix-panics branch from cc8ab4f to e2f0dc8 Compare October 3, 2016 23:50

sfackler merged commit 18f0f3c into kornelski:master Oct 4, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Propagate panics harder #14

Propagate panics harder #14

alexcrichton commented Sep 30, 2016

sfackler commented Sep 30, 2016

alexcrichton commented Sep 30, 2016

sfackler commented Oct 1, 2016

alexcrichton commented Oct 3, 2016

sfackler commented Oct 3, 2016

alexcrichton commented Oct 3, 2016

alexcrichton commented Oct 3, 2016

sfackler commented Oct 3, 2016

alexcrichton commented Oct 3, 2016

sfackler commented Oct 3, 2016

alexcrichton commented Oct 4, 2016

sfackler commented Oct 4, 2016

Propagate panics harder #14

Propagate panics harder #14

Conversation

alexcrichton commented Sep 30, 2016

sfackler commented Sep 30, 2016

alexcrichton commented Sep 30, 2016

sfackler commented Oct 1, 2016

alexcrichton commented Oct 3, 2016

sfackler commented Oct 3, 2016

alexcrichton commented Oct 3, 2016

alexcrichton commented Oct 3, 2016

sfackler commented Oct 3, 2016

alexcrichton commented Oct 3, 2016

sfackler commented Oct 3, 2016

alexcrichton commented Oct 4, 2016

sfackler commented Oct 4, 2016