Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SOLR EFS Backups #3862

Closed
1 task done
jbrown-xentity opened this issue Jun 15, 2022 · 7 comments
Closed
1 task done

SOLR EFS Backups #3862

jbrown-xentity opened this issue Jun 15, 2022 · 7 comments
Assignees
Labels
component/solr-service Related to Solr-as-a-Service, a brokered Solr offering component/ssb Feature

Comments

@jbrown-xentity
Copy link
Contributor

jbrown-xentity commented Jun 15, 2022

User Story

In order to have easy SOLR disaster recovery, data.gov admins wants EFS to have regular backups and a recovery process.

Acceptance Criteria

[ACs should be clearly demoable/verifiable whenever possible. Try specifying them using BDD.]

  • GIVEN backups are occurring of EFS data for SOLR
    WHEN SOLR EFS/data is lost happens
    THEN a backup of EFS can be restored \

Background

https://docs.aws.amazon.com/efs/latest/ug/awsbackup.html

https://registry.terraform.io/providers/hashicorp/aws/latest/docs/resources/efs_backup_policy

Security Considerations (required)

None, all solr data is public

Sketch

Should be adding the necessary terraform to solr brokerpack for regular backups.

Should also involve a test of restoring a backup (does not have to be a full DB/SOLR)

@nickumia-reisys
Copy link
Contributor

Proof that EFS Backups are enabled,

In AWS Backup

image

In EFS (each volume)

image

@FuhuXia
Copy link
Member

FuhuXia commented Jun 17, 2022

The default setting is daily backups with a 35-day retention period.
35-day is more than what we need. With same amount the resources, a 6-hour backup with 7-day retention make more sense, if more frequent backup does not impact solr performance.

@nickumia-reisys
Copy link
Contributor

Default settings are as follows, Backups:

  • Start within 8 hours of 0500 UTC
  • Complete within 7 days
  • Retained for 35-days
  • Run daily

Terraform does not provide native support for configuring this plan as part of EFS. However, we may be able to create a custom Backup plan that does not use the default Automatic Backups behavior. Additional testing would be necessary to configure it properly.

@FuhuXia
Copy link
Member

FuhuXia commented Jun 17, 2022

Can we tell how long did it take for one backup?

@nickumia-reisys
Copy link
Contributor

We can check the status of current backups and see how long backups took previously.

@nickumia-reisys
Copy link
Contributor

It would seem like the backups are relatively quick, the 23GB full-production backup took ~2 hours.

image

@albionzeglin-gsa
Copy link

The EFS backups are sensitive to changes that occur to files during backup. We need to confirm that either SOLR is paused during backup execution, or that SOLR is not sensitive to file inconsistencies in backup restores.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component/solr-service Related to Solr-as-a-Service, a brokered Solr offering component/ssb Feature
Projects
Archived in project
Development

No branches or pull requests

5 participants