Decide what to do when on task double-failure #910

brson · 2011-09-12T20:11:07Z

Currently there are no landing pads generated for resource destructors so they are leaky when they fail. We don't have the same tricky situation that C++ has with throwing destructors but I haven't thought through how it should work yet.

Edit: This issue has changed into what to do after a dtor fails during unwinding. The previous issue of dtor failure not working at all has been resolved.

brson · 2011-09-12T20:18:40Z

There's a failing test in run-fail/unwind-resource-fail.rs

brson · 2011-09-14T17:59:54Z

And another in run-fail/unwind-resource-fail2.rs

brson · 2011-11-15T18:23:47Z

The language reference is actually quite explicit about this:

"A task may transition to the failing state at any time, due to an un-trapped signal or the evaluation of a fail expression. Once failing, a task unwinds its stack and transitions to the dead state. Unwinding the stack of a task is done by the task itself, on its own control stack. If a value with a destructor is freed during unwinding, the code for the destructor is run, also on the task's control stack. Running the destructor code causes a temporary transition to a running state, and allows the destructor code to cause any subsequent state transitions. The original task of unwinding and failing thereby may suspend temporarily, and may involve (recursive) unwinding of the stack of a failed destructor. Nonetheless, the outermost unwinding activity will continue until the stack is unwound and the task transitions to the dead state. There is no way to “recover” from task failure. Once a task has temporarily suspended its unwinding in the failing state, failure occurring from within this destructor results in hard failure. The unwinding procedure of hard failure frees resources but does not execute destructors. The original (soft) failure is still resumed at the point where it was temporarily suspended. "

graydon · 2012-07-26T23:43:39Z

Yeah. The hard/soft thing has never been implemented, as far as I know, but I think it's the best we can do: retain correctness of our memory model and try to retain correctness of user code, but not at the cost of infinite-recursion. If their dtor itself fails, just unwind that sub-failure w/o the opportunity for sub-sub-failures.

pnkfelix · 2013-03-22T12:17:35Z

�It would be good to revise the title of this bug to reflect what the task is. (From the comments thus far, it sounds like the task is to implement the hard/sort failure unwindin protocol; but it might also be useful to clean up the docs here.)

pnkfelix · 2013-03-22T12:17:50Z

Not critical for 0.6; de-milestoning

bblum · 2013-06-05T00:30:50Z

confirmed again; current code that can crash is as follows:

struct Foo { x: ~int, }

impl Drop for Foo {
    fn finalize(&self) { fail!(); }
}

fn main() {
    let _x = Foo { x: ~5, };
    fail!();
}

If my idea for an effect system goes through, we might make this sound again by simply requiring no failure in destructors, e.g. trait Drop { #[wont(Fail)] fn finalize(...); }.

bblum · 2013-06-11T04:03:46Z

nominating for well-defined milestone

graydon · 2013-06-13T16:36:53Z

there is language about this in the manual already in the task section, so I think it's well defined already. just not implemented. feature completeness issue.

graydon · 2013-06-13T16:36:58Z

accepted for feature-complete milestone

huonw · 2013-09-10T13:26:46Z

Triage: the updated version of @bblum's example still crashes:

struct Foo { x: ~int, }

impl Drop for Foo {
    fn drop(&self) { fail!(); }
}

fn main() {
    let _x = Foo { x: ~5, };
    fail!();
}

task <unnamed> failed at 'explicit failure', 910.rs:9
task <unnamed> failed at 'explicit failure', 910.rs:4


The ocean ate the last of the land and poured into the smoking gulf, thereby
giving up all it had ever conquered. From the new-flooded lands it flowed
again, uncovering death and decay; and from its ancient and immemorial bed it
trickled loathsomely, uncovering nighted secrets of the years when Time was
young and the gods unborn. Above the waves rose weedy remembered spires. The
moon laid pale lilies of light on dead London, and Paris stood up from its damp
grave to be sanctified with star-dust. Then rose spires and monoliths that were
weedy but not remembered; terrible spires and monoliths of lands that men never
knew were lands...

fatal runtime error: unwinding again

catamorphism · 2013-10-17T17:51:48Z

1.0 backcompat

alexcrichton · 2013-10-24T20:35:05Z

This not only applies to destructors failing while failing, but this appears to also apply to linked failure as well. A program may only contain one fail!() statement, but it may still leak due to linked failure. An example of this would be:

use std::rt::io::timer;

struct A {
  b: B,
}

struct B {
  foo: int,
}

impl Drop for A {
  fn drop(&mut self) {
    timer::sleep(50);
  }
}

impl Drop for B {
  fn drop(&mut self) {
    println!("dropping b\n");
  }
}

fn main() {
  do spawn {
    let _a = A { b: B { foo: 3 } };
  }
  fail!()
}

The destructor for B never runs in this program because the call to timer::sleep invokes scheduler business which will realize that it needs to fail due to linked failure. It seems odd to me that if a task only contains one source of failure then we can still leak.

This specific use case may not entirely fall under this issue, but it seems fairly serious.

bblum · 2013-10-25T04:06:46Z

The scheduler does have logic for not failing from linked failure when a task is already failing. So if, for example, the sleeping destructor here were to happen during unwinding, I think it should be fine. But if it happens normally you could still get failure.

brson · 2014-01-03T02:28:39Z

I've updated the title to reflect what we're currently talking about.

My understanding is that it is just impossible to throw from a landing pad because it will potentially corrupt the unwinder and our three options are:

abort - this is what C++ does
resume unwinding without completing the landing pad - this is what Ada does and we haven't discussed this option
abort the task without completing unwinding - this is "hard" failure we've talked about

The "hard" failure is made more difficult by tasks that are not simply green threads. For example, we have use cases where we want to run in a 'task context', on the current thread, and then be told whether the task succeeded or failed, all on the same thread. There's no way in that scenario to just terminate the task cleanly.

My current opinion is that, for 1.0 we should take the most conservative approach and abort. For future versions we should consider trying to do what Ada does and skip a single landing pad, but still proceed to catch the exception.

Previously this was an rtabort!, indicating a runtime bug. Promote this to a more intentional abort and print a (slightly) more informative error message. Can't test this sense our test suite can't handle an abort exit.

bblum · 2014-01-03T03:25:33Z

I agree with double-abort for 1.0.

Thinking about the other two schemes in terms of how RWARCs (or whatever they're called these days) behave, aborting just a single task strikes me as risky in terms of unexpected behaviour befalling the user. If a double-failure occurs inside an access to a RWARC, it won't be automatically unlocked during unwinding, and other contending tasks will block forever.

I think that resuming the unwinding and skipping the one landing pad still makes sense, though. If double-failure occurs in a user-provided destructor (or in the destructor of some buggy library), skipping the rest of the destructor can only "surprise" other code directly associated with the faulty destructor.

emberian · 2014-01-04T06:44:11Z

I agree too.

* Add memref handling for fwd mode * Add simple llvm dialect * Update enzyme/Enzyme/MLIR/Implementations/LLVMAutoDiffOpInterfaceImpl.cpp Co-authored-by: Tim Gymnich <[email protected]> * fixup Co-authored-by: Tim Gymnich <[email protected]>

jruderman added a commit that referenced this issue Sep 21, 2011

Add another testcase for #910

f7b6794

ghost assigned brson Mar 15, 2012

brson mentioned this issue Mar 28, 2012

"fatal, 'on_rust_stack()' failed" with recursive resource #2061

Closed

catamorphism mentioned this issue Jun 21, 2012

A destructor that fails when run from the cycle collector will do terrible things #2047

Closed

bblum mentioned this issue Jul 26, 2012

Box-enclosed destructors can't safely access other boxes. #3039

Closed

bblum mentioned this issue Jun 7, 2013

Use of GC inside destructors aborts when destructor called from GC #6996

Closed

brson mentioned this issue Nov 1, 2013

Drop struct fields if the user destructor fails #10219

Merged

brson mentioned this issue Jan 3, 2014

Abort on double-failure. #910 #11283

Merged

bors closed this as completed in 239fb1f Jan 4, 2014

coastalwhite pushed a commit to coastalwhite/rust that referenced this issue Aug 5, 2023

Properly escape the '[' and ']' (rust-lang#910)

718175b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decide what to do when on task double-failure #910

Decide what to do when on task double-failure #910

brson commented Sep 12, 2011

brson commented Sep 12, 2011

brson commented Sep 14, 2011

brson commented Nov 15, 2011

graydon commented Jul 26, 2012

pnkfelix commented Mar 22, 2013

pnkfelix commented Mar 22, 2013

bblum commented Jun 5, 2013

bblum commented Jun 11, 2013

graydon commented Jun 13, 2013

graydon commented Jun 13, 2013

huonw commented Sep 10, 2013

catamorphism commented Oct 17, 2013

alexcrichton commented Oct 24, 2013

bblum commented Oct 25, 2013

brson commented Jan 3, 2014

bblum commented Jan 3, 2014

emberian commented Jan 4, 2014

Decide what to do when on task double-failure #910

Decide what to do when on task double-failure #910

Comments

brson commented Sep 12, 2011

brson commented Sep 12, 2011

brson commented Sep 14, 2011

brson commented Nov 15, 2011

graydon commented Jul 26, 2012

pnkfelix commented Mar 22, 2013

pnkfelix commented Mar 22, 2013

bblum commented Jun 5, 2013

bblum commented Jun 11, 2013

graydon commented Jun 13, 2013

graydon commented Jun 13, 2013

huonw commented Sep 10, 2013

catamorphism commented Oct 17, 2013

alexcrichton commented Oct 24, 2013

bblum commented Oct 25, 2013

brson commented Jan 3, 2014

bblum commented Jan 3, 2014

emberian commented Jan 4, 2014