Automated solution to back up etcd on a schedule from within the cluster #609
Replies: 4 comments 3 replies
-
Another idea worth investigating: a tekton pipeline, which can be triggered ad-hoc (or via a CronJob) |
Beta Was this translation helpful? Give feedback.
-
Also need to consider the keys for an encrypted etcd snapshot |
Beta Was this translation helpful? Give feedback.
-
Very much a work in progress and the codebase is a mess, but I'm working on two methods -- a systemd approach and a "k8s-native" approach. Both drive /usr/local/bin/cluster-backup.sh on a schedule and then managed the lifecycle of the snapshots. Works for my use cases, YMMV. |
Beta Was this translation helpful? Give feedback.
-
We manage the continuous etcd backup process with this cron one-liner on an external server. Of course this external server need ssh and optional also access via oc. |
Beta Was this translation helpful? Give feedback.
-
The current documentation on backing up the etcd cluster notes that there is a script on control plane nodes that can be used to take a snapshot of the etcd data and write it to to a local directory on the node. It does not provide any guidance or details on how to manage this backup process effectively. Since backups are basically the only recourse available in disaster recovery scenarios, they are very important to do correctly. Additionally, it is imperative that the backups be stored outside the cluster itself.
At the April 27th OKD Working Group meeting, there was some discussion of how to automate and manage this procedure. Several ideas were floated:
If there's community interest around one of these options (or perhaps another option entirely!) we can collaborate on putting something simple together to point people at when they ask about backups, or recommend all OKD users deploy into their clusters for peace of mind.
Beta Was this translation helpful? Give feedback.
All reactions