Add ScheduleOnly initPolicy #59

spilchen · 2021-09-21T13:21:49Z

This introduces a new initPolicy called ScheduleOnly. The bootstrap of the database, either create_db or revive_db, is not handled. Use this policy when you have a vertica cluster running outside of Kubernetes and you want to provision new nodes to run inside Kubernetes.

Most of the automation is disabled when running in this mode. The only automation that is done is attempting to restart any down pods with 'admintools -t restart_node'. The user is resonsible for adding the pods as nodes to a vertica cluster (update_vertica), adding them to a database (admintools -t db_add_node), and handling any restart of the cluster (admintools -t re_ip/start_db).

Here is a sample CR:

apiVersion: vertica.com/v1beta1
kind: VerticaDB
metadata:
  name: sample
spec:
  initPolicy: ScheduleOnly
  subclusters:
    - name: sc1
      size: 3
    - name: sc2
      size: 3

Notice that the entire .spec.communal section is omitted.

The number of pods that are created is dictated by the size of each subcluster. However, subclusters aren't added by the operator. We group by subcluster to control the name of each of the pod. The actual subcluster the pod is part of does not have to match the name in the CR.

ningdeng · 2021-09-22T13:59:52Z

pkg/controllers/podfacts.go

-				pf.hasStaleAdmintoolsConf = true
+		// We can't reliably set compat21NodeName because the operator didn't
+		// originate the install.  We will intentionally leave that blank.
+		pf.compat21NodeName = ""


No change required, just trying to confirm my understanding of the functionalities: does this mean re_ip will not be automated on the schedule only pods (Edit: I was asking about if there could be a case where some pods are managed by the operator starting from installation while some are not, but since there can be only one init policy so I think the behavior is consistent in terms of that either the operator manages the entire cluster from the very beginning or the operator only cares about schedule only pods, so there's no case of some pods being managed by the operators starting from installation but some are not). I recall compat21NodeName is used for re_ip when db is down but would like to double check if my memory is correct.

nvm, I saw the changes in restart_reconcile and I think my question is answered.

spilchen · 2021-09-22T18:25:04Z

Thanks for taking a look @ningdeng

Matt Spilchen added 7 commits September 20, 2021 09:20

Code drop

5413221

e2e test

7debffd

Additional fixes

4318271

Moving image pull out of kuttl-test.yaml

e5a818e

Merge branch 'main' into hybrid-k8s

a570a93

Add changie

f5479f7

Apply reviisions

25802ba

spilchen requested a review from ningdeng September 21, 2021 13:21

spilchen self-assigned this Sep 21, 2021

ningdeng reviewed Sep 22, 2021

View reviewed changes

ningdeng approved these changes Sep 22, 2021

View reviewed changes

spilchen merged commit 2fb2c89 into vertica:main Sep 22, 2021

spilchen deleted the hybrid-k8s branch September 22, 2021 18:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ScheduleOnly initPolicy #59

Add ScheduleOnly initPolicy #59

spilchen commented Sep 21, 2021

ningdeng Sep 22, 2021 •

edited

Loading

ningdeng Sep 22, 2021

spilchen commented Sep 22, 2021

Add ScheduleOnly initPolicy #59

Add ScheduleOnly initPolicy #59

Conversation

spilchen commented Sep 21, 2021

ningdeng Sep 22, 2021 • edited Loading

Choose a reason for hiding this comment

ningdeng Sep 22, 2021

Choose a reason for hiding this comment

spilchen commented Sep 22, 2021

ningdeng Sep 22, 2021 •

edited

Loading