Support dynamically setting multiple failpoints #38

ahrtr · 2022-11-22T01:58:53Z

Users can dynamically setting multiple failpoints using endpoint /failpoints, see example below,

$ curl http://127.0.0.1:22381/failpoints -X PUT -d'failpoint1=return("hello");failpoint2=sleep(10)'

Please review this PR commit by commit.

@spzala @serathius @ramil600

ahrtr · 2022-11-22T02:22:28Z

cc @ptabor as well

ahrtr · 2022-11-22T19:05:15Z

Related to #38

It isn't safe to lock each failpoint. The server might panic before sending response back to client. For example, when a user set a terms for failpoint A, but the server might be panicking due to failpoint B. So it isn't safe to lock each single failpoint once a time. Instead, we should lock the global mutex. When each failpoint is triggered, it just needs to acquire the read lock. But when a client set terms for failpoints, they need to acquire the write lock, and release the lock after the server(runtime) sends response back to the client. Signed-off-by: Benjamin Wang <[email protected]>

When the server(runtime) processes a http request, it acquires the global write lock. This prevents all failpoints from being triggered. It ensures the server(runtime) doesn't panic due to any failpoints during processing the HTTP request. It may be inefficient, but correctness is more important than efficiency. Usually users will not enable too many failpoints at a time, so it (the efficiency) isn't a problem. Example: endpoint: /failpoints body: failpoint1=return("hello");failpoint2=sleep(10) Signed-off-by: Benjamin Wang <[email protected]>

Signed-off-by: Benjamin Wang <[email protected]>

runtime/http.go

runtime/runtime.go

runtime/runtime_test.go

Signed-off-by: Benjamin Wang <[email protected]>

ahrtr · 2022-11-25T10:27:25Z

Thanks @serathius . Resolved all your comments. PTAL.

serathius · 2022-11-25T11:13:53Z

runtime/runtime.go

+	if len(fps) == 0 {
+		return fpMap, nil
+	}


Note: Don't think this is needed.

If we remove this, When users pass an empty string, then it will return error "bad failpoint xxx".

If we don't check the fps, then users are supposed to always pass a non-empty string. It seems like not a big deal, so let's keep it as it's for now.

runtime/http.go

runtime/runtime.go

Signed-off-by: Benjamin Wang <[email protected]>

ahrtr added 3 commits November 25, 2022 04:26

update design doc to cover the examples of setting multiple failpoints

1337baf

Signed-off-by: Benjamin Wang <[email protected]>

ahrtr force-pushed the dynamical_multiple_failpoints_20221122 branch from b475efa to 1337baf Compare November 24, 2022 21:24

ahrtr mentioned this pull request Nov 24, 2022

failpoint delete may block on "defer Release" #14

Closed