Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix scale down timing bug #90

Merged
merged 3 commits into from
Nov 1, 2021
Merged

Conversation

spilchen
Copy link
Collaborator

@spilchen spilchen commented Nov 1, 2021

This fixes a timing scenario with scale down that can cause corruption with the admintools.conf.

This can occur if during ‘admintools -t db_remove_node’ another change is made to the VerticaDB CR to scale down again. For instance, if we patched the CR to scale a subcluster from 3 to 2. The operator will invoke ‘admintools -t db_remove_node’. We hit the issue if before admintools returns the VerticaDB is patched again to scale the subcluster from 2 to 1.

If you hit this problem, you will see a message like this in the operator log when it tries admintools commands:

Error in /opt/vertica/config/admintools.conf?: No option 'v_verticadb_node0005' in section: 'Nodes'

The fix for this is that the uninstall will be requeued if we detect another scale down has
occurred. This will force us to call ‘admintools -t db_remove_node’ for the 2nd scale down before we drive uninstall logic.

@spilchen spilchen merged commit 0336847 into vertica:main Nov 1, 2021
@spilchen spilchen deleted the scale-down-fix branch November 1, 2021 17:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant