-
Notifications
You must be signed in to change notification settings - Fork 62
upgrade to kubecf-v2.7.1, scheduler pods disappeared. #1675
Comments
There are 2 issues found from above upgrade process.
with 'cc_deployment_updater: false', no "cc-deployment-updater-cc-deployment-updater" should be created in scheduler pod. https://github.com/cloudfoundry-incubator/kubecf/blob/master/chart/hooks/pre-upgrade/remove-deployment-updater-readiness.sh |
I cannot reproduce this issue; are you sure you updated cf-operator before updating kubecf, as 2.7.1 requires cf-operator 7.xx and 2.6.1 uses cf-operator 6.xx. Anyways, your patch to check that the cf-deployment-update job exists before patching looks correct, so I've approved it, but it would be good to understand the root cause of your initial problem. |
Thanks for your comments @jandubois. For the issue
I will do more test and investigation. |
when I tried reproduce the issue
|
I think we're hitting the same issue with a fresh install of 2.7.1. Post-deploy the scheduler has 0/1 pods ready. Running
And indeed when running
|
As a followup on the above, we removed the bad
so I would expect it to be removed but it appears that it is removing it enough that |
2.7.1 has an annoying bug that breaks the scheduler: cloudfoundry-incubator/kubecf#1675
for the cc_deployment_updater issue, the fix is merged #1676. |
Ah right. It would be good if a new release with that fix could be cut because as far as we can see, 2.7.1 doesn't actually work fully on a fresh install. Thanks for getting back to us @ShuangMen 🙂 |
@andy-paine There should be a new release within the next few days. |
Describe the bug
step 1. kubecf v2.6.1 installed in Kubernetes (version 1.19.5_1529)
step 2. run helm upgrade kubecf, try to upgrade kubecf to v2.7.1, with 'cc_deployment_updater: false' in file 'config/jobs.yaml'
after upgrade, check the statefulsets:
but from the cf-operator quarks-statefulset logs, there are many errors for statefulsets update, one of them is like below.
seems the statefulsets not upgrade successfully.
step 3. run helm upgrade kubecf again and check the pod status
no "scheduler" pods there.
$ k describe statefulset scheduler-z0 -n kubecf
and get below error:
check the log of cf-operator-quarks-statefulset,
To Reproduce
run helm upgrade from kubecf v2.6.1 (on Kubernetes 1.19) to kubecf-v2.7.1, then run helm upgrade again.
Expected behavior
helm upgrade successfully with all pods running well.
Environment
The text was updated successfully, but these errors were encountered: