Ingester readonly on startup until replay and rediscover is done to prevent broken head blocks #3358

mdisibio · 2024-02-02T12:57:42Z

What this PR does:
This is one of those bugs that makes you wonder "how did this ever work?" This PR changes the ingester to enter a read-only state on startup (similar to shutdown) until wal replay and local blocks rediscover are complete. This prevents pushes from creating headblocks that get tangled up in the replay process (and broken). I really like having the startup error different than the shutting down error. It makes it easy to check logs for Ingester is starting and verify the fix is working.

Description of the Bug
Here is a full sequence of the bug:

An Ingester unexpectedly terminates (OOM, panic, readinessprobe failure etc)
The ingester restarts and begins WAL replay
Because it didn't leave the ring gracefully and propagating the new state takes some time, it still appears HEALTHY to (some or all) the distributors.
The distributors push traffic to it while it is replaying
PushBytes works (unexpectedly) and creates a headblock <--- This is where we fix it
The headblock gets picked up by the replay process and deleted
All pushes to that headblock fail and the ingester never recovers

Steps to Reproduce

Requires a distributed setup with separate distributors, ingesters, and replication factor >= 2
While continuously pushing traffic, quickly kill and restart a single ingester via docker kill/etc
Only kill 1 ingester, so that the distributor sees the minimum replicas and keeps pushing traffic
Eventually it will trigger the bug

Which issue(s) this PR fixes:
Fixes #3346

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

…plete

joe-elliott

so during the period of startup/shutdown there will be one ingester always failing meaning reduced availability. we're doing it now as well, but only during shutdown.

i am concerned that during a k8s rollout of a large cluster if one ingester is starting up and one is shutting down we'll start failing writes.

is there anyway to just not raise the healthy flag in the ring until we've done wal replay?

mdisibio · 2024-02-02T16:03:05Z

is there anyway to just not raise the healthy flag in the ring until we've done wal replay?

It already doesn't raise the healthy flag, but the issue is the non-zero propagation time during which the distributors continue sending traffic. We should expect to see a short burst of "Ingester is starting" errors until the distributors catch up. But I can dig more in this area and double check things.

Edit: Wanted to add a bit more: When an ingester dies and restarts quickly, it is immediately receiving writes on the same ip/port, because it's ring state hasn't propagated yet to the distributors. I don't think any amount of ring manipulation would fix that. We'd have to do something else like not start gRPC.

joe-elliott · 2024-02-02T16:09:41Z

It already doesn't raise the healthy flag, but the issue is the non-zero propagation time during which the distributors continue sending traffic.

Oh, I see. It's explained in your original steps. In a normal rollout this shouldn't even occur.

Ingester enter readonly on startup until replay and rediscover is com…

3bb042f

…plete

mdisibio requested review from joe-elliott, annanay25, mapno, yvrhdn, zalegrala, electron0zero, ie-pham and stoewer as code owners February 2, 2024 12:57

changelog

a8d1bf6

joe-elliott reviewed Feb 2, 2024

View reviewed changes

joe-elliott approved these changes Feb 2, 2024

View reviewed changes

mdisibio merged commit c1f9fd9 into grafana:main Feb 2, 2024
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ingester readonly on startup until replay and rediscover is done to prevent broken head blocks #3358

Ingester readonly on startup until replay and rediscover is done to prevent broken head blocks #3358

mdisibio commented Feb 2, 2024 •

edited

Loading

joe-elliott left a comment

mdisibio commented Feb 2, 2024 •

edited

Loading

joe-elliott commented Feb 2, 2024

Ingester readonly on startup until replay and rediscover is done to prevent broken head blocks #3358

Ingester readonly on startup until replay and rediscover is done to prevent broken head blocks #3358

Conversation

mdisibio commented Feb 2, 2024 • edited Loading

joe-elliott left a comment

Choose a reason for hiding this comment

mdisibio commented Feb 2, 2024 • edited Loading

joe-elliott commented Feb 2, 2024

mdisibio commented Feb 2, 2024 •

edited

Loading

mdisibio commented Feb 2, 2024 •

edited

Loading