Use mmap on MARF connections #2900

jcnelson · 2021-10-26T02:26:26Z

This PR against develop activates the mmap pragma on all sqlite connections. In support of this, this PR also vendors the
ctrlc crate we had been using, and extends it to handle a SIGBUS signal.

Sqlite may trigger SIGBUS signals if the underlying database file is mmap'ed and becomes unavailable at runtime (e.g. suppose it's on a network drive and the network goes down). SIGBUS is only triggered on an attempt to read the unavailable file; an attempt to write to an invalid address that was mapped will (correctly) trigger a SIGSEGV and lead to a crash. This PR makes it so that the node treats SIGBUS like SIGTERM, SIGINT, and SIGHUP -- it triggers a graceful shutdown.

I'm open to making the node simply crash with a panic as well. In fact, I think that it would be preferable if the process synchronously terminated on SIGBUS. But, I'd like confirmation that this is desired before making this happen (since it can lead to chainstate corruption).

I chose to vendor ctrlc because (a) it's a pretty stable crate at this point -- it's only been receiving PRs to update dependency versions -- and (b) it's very simple, especially compared to alternative signal handler crates, and it already does 99% of what we need.

… signal handler report the signal received to the user-supplied callback.

…trlc crate's platform-specific deps

…ng -- this is an inevitable consequence of having multiple runloops

gregorycoppola · 2021-10-26T12:14:28Z

Hey Jude.. Thanks for a fast change.

How are you testing this?

src/util/db.rs

jcnelson · 2021-10-26T15:55:07Z

So, there is one thing that warrants further investigation here -- a thread that triggers SIGBUS needs to be terminated immediately. Not at a rendezvous or cancellation point, but at the CPU instruction which caused SIGBUS. This is because SIGBUS is triggered by an unaligned memory access, or in mmap(2)'s case, an attempt to read from a file-backed page whose underlying file data is absent. This can arise if you truncate the file while it's mmap(2)ed, or if the file is deleted, or rendered unavailable through a network partition. In all cases, it's Bad News Bears -- something is irrevocably corrupt. The only point of trying to exit gracefully is to avoid corrupting the other databases, so we can do a postmortem.

The reason we have to be this strict is because of how SIGBUS works. When the hardware raises the page fault to the kernel, the kernel will suspend the executing task at the offending CPU instruction, set up and run the signal handler right then and there (i.e. SIGBUS is handled synchronously in the program execution), and on signal handler return, attempt to re-run the offending CPU instruction. So if the thread's execution isn't terminated immediately, we'd be setting ourselves up for an infinite loop -- the offending thread will re-attempt to load data from an unbacked page, triggering another SIGBUS, causing the signal handler to run and exit, causing the same offending instruction to be re-run, over and over forever. The offending thread will never reach a cancellation point, nor will an attempt to join with it work. So, while the other threads will gracefully terminate, the offending thread never will.

There are a couple ways we can address this:

We can represent the halting state of the threads in some kind of async-safe global that the signal handler can inspect, so that once the other non-SIGBUS-triggering threads have all died, the signal handler can abort(2).
We can keep track of the file handles to all sqlite databases, and on SIGBUS, we execve(2) the same process image but with an environment variable set that causes main() to gracefully close the sqlite databases before halting.

Let me know if not crashing-and-burning is still something we want to do here.

kantai · 2021-10-26T16:05:14Z

Let me know if not crashing-and-burning is still something we want to do here.

I think crash-and-burn is the preferred behavior here. If possible, though, the node operator should be able to figure out that there was a system I/O error that led to the crash.

In terms of testing, I think this PR needs two things:

Some amount of testing of the signal handlers -- the vendoring of ctrlc is fine, but we need to test it, both the original tests, and the modifications.
It looks like the vendoring of ctrlc broke the libclarity build. That needs to be fixed.

gregorycoppola · 2021-10-26T17:00:51Z

Hey @jcnelson , I see I was re-added for review, but my question last time was about the testing plan.

Are you able to recreate these problem cases by interacting with the server? Or in tests?

gregorycoppola · 2021-10-26T17:02:18Z

Oh whoops.. I thought I was re-added but I guess GitHub just leaves me with the "yellow dot" status if I just add comments at the bottom. :/ Never mind my last comment.

gregorycoppola

Just to try this again (sorry for the spam)..

I'm wondering how we are going to test this.

jcnelson · 2021-10-26T17:06:19Z

Some amount of testing of the signal handlers -- the vendoring of ctrlc is fine, but we need to test it, both the original tests, and the modifications. @kantai

I'm unaware of a safe way to test signal handlers that doesn't also break the test runner, but I'll try. I could make it so that there's test-specific paths in the signal handler that causes it to just set a global somewhere instead of crashing the process, but that doesn't really test the "crash-and-burn" property.

…ibc methods for recording that a signal has been caught.

jcnelson · 2021-10-26T19:03:39Z

Okay, I updated the tests to run the original ctrlc tests. But, we can only set the signal handler once in the test runner's execution lifetime, so I think we should just leave it at that. I've manually tested that the node will print out what kind of signal caused it to die.

…ds in ctrlc

…ly uses a different ABI for write(2)

kantai

LGTM!

Before merging, can you add an entry to the CHANGELOG.md?

jcnelson · 2021-10-27T16:29:48Z

Thanks; added.

gregorycoppola

thanks for the change!

jcnelson added 9 commits October 25, 2021 22:16

feat: vendor ctrlc package and add support for SIGBUS. Also, have the…

2a1b5bf

… signal handler report the signal received to the user-supplied callback.

feat: add nix crate (to support vendored ctrlc package) and add the c…

d202d26

…trlc crate's platform-specific deps

refactor: db_mkdirs() now only returns one argument

c55a6e4

chore: give credit where credit is due for ctrlc

fd962c3

chore: expose ctrlc vendored package

c76a7f5

feat: expose ctrlc crate, with OS-specific packages

4cef87b

feat: use mmap by default in sqlite connections (256MB)

36b2743

chore: no longer need ctrlc

0511532

refactor: use deps/ctrlc now

4ef95f9

jcnelson requested review from kantai, gregorycoppola and pavitthrap October 26, 2021 02:30

jcnelson added 4 commits October 25, 2021 23:32

fix: update docstrings to use deps::ctrlc

ede6334

fix: use move in docstring and use new arg

0cde87c

fix: add unix and windows deps for vendored ctrlc package to libclarity

cef19e6

fix: don't panic if we set a signal handler multiple times when testi…

9943a4b

…ng -- this is an inevitable consequence of having multiple runloops

kantai reviewed Oct 26, 2021

View reviewed changes

src/util/db.rs Outdated Show resolved Hide resolved

kantai linked an issue Oct 26, 2021 that may be closed by this pull request

Use MMAP for SQLite MARF connections #2869

Closed

gregorycoppola reviewed Oct 26, 2021

View reviewed changes

jcnelson added 5 commits October 26, 2021 13:55

refactor: only MARF databases will use mmap

88185e0

refactor: expose test module

c5a276f

refactor: run original ctrlc tests as a unit test

1e1d7c9

fix: don't mmap on sqlite_open()

13eb05e

fix: crash and burn on SIGBUS, and use async-safe (but rust unsafe) l…

0a46823

…ibc methods for recording that a signal has been caught.

jcnelson added 2 commits October 26, 2021 15:02

fix: use proper libc types

34f2cbf

chore: new Cargo lockfile

7062a02

jcnelson added 4 commits October 26, 2021 15:59

chore: remove needless docstring example

684e8c9

fix: have clarity target use nix and winapi for platform-specific nee…

1ff6075

…ds in ctrlc

chore: updated cargo.lock

7403620

fix: windows-specific call to libc::write, because windows inexplicab…

0ae09f8

…ly uses a different ABI for write(2)

jcnelson requested review from gregorycoppola and kantai October 27, 2021 04:46

kantai approved these changes Oct 27, 2021

View reviewed changes

chore: add CHANGELOG entry about mmap'ed connections

8303d22

gregorycoppola approved these changes Oct 27, 2021

View reviewed changes

jcnelson merged commit 75f6328 into develop Oct 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use mmap on MARF connections #2900

Use mmap on MARF connections #2900

jcnelson commented Oct 26, 2021 •

edited

Loading

gregorycoppola commented Oct 26, 2021

jcnelson commented Oct 26, 2021 •

edited

Loading

kantai commented Oct 26, 2021

gregorycoppola commented Oct 26, 2021

gregorycoppola commented Oct 26, 2021 •

edited

Loading

gregorycoppola left a comment

jcnelson commented Oct 26, 2021

jcnelson commented Oct 26, 2021

kantai left a comment

jcnelson commented Oct 27, 2021

gregorycoppola left a comment

Use mmap on MARF connections #2900

Use mmap on MARF connections #2900

Conversation

jcnelson commented Oct 26, 2021 • edited Loading

gregorycoppola commented Oct 26, 2021

jcnelson commented Oct 26, 2021 • edited Loading

kantai commented Oct 26, 2021

gregorycoppola commented Oct 26, 2021

gregorycoppola commented Oct 26, 2021 • edited Loading

gregorycoppola left a comment

Choose a reason for hiding this comment

jcnelson commented Oct 26, 2021

jcnelson commented Oct 26, 2021

kantai left a comment

Choose a reason for hiding this comment

jcnelson commented Oct 27, 2021

gregorycoppola left a comment

Choose a reason for hiding this comment

jcnelson commented Oct 26, 2021 •

edited

Loading

jcnelson commented Oct 26, 2021 •

edited

Loading

gregorycoppola commented Oct 26, 2021 •

edited

Loading