-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to fence nodes with fence_azure_arm
agent
#90
Comments
Hey @jcanocan,
thank you for pointing this out, really appreciated!
I think
|
Thanks for answering back! I'm glad to help 😊 Regarding Azure/azure-sdk-for-python#30983 (comment). Looks like they are not motivated to make the change. Moreover, It will take some time to land. Therefore, what do you think about including the following command right after RUN sed -i 's/\"instanceView\"/expand=\"instanceView\"/' /usr/sbin/fence_azure_arm I would agree that it's not a very clean solution, just a workaround. Nevertheless, it will allow the fence agent work. |
It seems they need to propagate the request to the right people :)
We actually want to decouple the operator's image from the one containing the agents so that one could use an image with a specific fencing agent and the related quirks to make it work. |
First of all thanks Javier for noticing/raising the notion of using Azure fence agent!
Yes, how about creating a PR with the above fix to https://github.com/ClusterLabs/fence-agents/tree/main repo? They are available in their mailing list if you want to discuss about if beforehand. |
Thanks for letting me know. Sounds nice :)
Thanks for the suggestion. I misinterpreted the words in Azure/azure-sdk-for-python#30983 (comment), but I just realized that the azure fence agent is independent to the https://github.com/Azure/azure-sdk-for-python. Apologizes for the confusion. So I will try to post a PR fixing this issue in the fence agent. Meanwhile, I will learn how to build the operator locally and deploy it in an OCP cluster. |
Posted ClusterLabs/fence-agents#562. Just in case you are curious :) |
Hi!
I'm currently playing around with FAR with Azure VMs. I've been able to install NHC, FAR in an OCP 4.13 cluster, to create the FAR Template and start the remediation process. This is the FAR Template I'm currently using:
I've tried with fence_azure_arm tool standalone locally to restart a faulty VM where an OCP node is running. For that purpose, I stopped the kubelet process to bring a node to an unhealthy state, and it worked but requires a tiny modification, see: Azure/azure-sdk-for-python#30983 (comment)
Nevertheless, it is not working along with FAR operator. It throws the following errors:
It looks like FAR it's not able to find the
fence_azure_arm
tool in PATH for its purpose.Environment:
Thanks in advance!
The text was updated successfully, but these errors were encountered: