Newbe OKD intall issues #1974
Replies: 11 comments
-
I would try curling 22623 and 6443 on the bootstrap node. If you can't curl those, then I would check the bootstrap node journal for indications of why its not progressing. The control plane nodes will require the machine config server and API server to be up on the bootstrap node before an install will progress. |
Beta Was this translation helpful? Give feedback.
-
So looking at the logs and curl commands i see the following:
I am able to curl from bootstrap node but not the control-plan0 node, not sure what to check or where to check for that, On the bootstrap node I see this in the
HAProxy is showing that it can connect to the bootstrap node but not control-plane0
|
Beta Was this translation helpful? Give feedback.
-
so the bootstrap is up and serving the mcs and the api, thats good. can you paste a screen shot from the console on one of the control plane nodes? also, if you happen to log bundle from the failed install that would be helpful as well. |
Beta Was this translation helpful? Give feedback.
-
Not sure what logs would be helpful, but it seems I keep going backwards, now my bootstrap node is not bringing up the API server and im seeing the following in the kubelet.service journal
|
Beta Was this translation helpful? Give feedback.
-
these are the logs that are needed if you can get them: |
Beta Was this translation helpful? Give feedback.
-
Here are the logs from the bootstrap node and the one current master |
Beta Was this translation helpful? Give feedback.
-
So the main issue I am seeing in the logs and journal currently is
I have Gathered logs for analysis: |
Beta Was this translation helpful? Give feedback.
-
You very possibly have old certificates based on the expiration warning above. I would recommend you delete the current directory where your ignition files are being generated and try again. Stale content can remain in that directory between installation attempts. |
Beta Was this translation helpful? Give feedback.
-
Ok got the cert issue resolved now i have the cluster doing something, i see errors on control-plane0 as follows:
However this node is able to communicate and is presenting an API that HAProxy is able to communicate with but also odd it's saying that node is not ready while the other 2 are. Not sure what's going on with that
I have collected logs to maybe better help |
Beta Was this translation helpful? Give feedback.
-
I suspect there may be an overcommitment or storage issue. etcd is taking over 10s(should be < 200ms) to read. so I think your cluster will eventually finish installing if you just wait, but your cluster probably won't be as responsive as you'd like. {"level":"warn","ts":"2024-07-18T19:39:06.458793Z","caller":"etcdserver/util.go:170","msg":"apply request took too long","took":"10.669075005s","expected-duration":"200ms","prefix":"read-only range ","request":"key:"/kubernetes.io/console.openshift.io/consolesamples/" range_end:"/kubernetes.io/console.openshift.io/consolesamples0" limit:10000 ","response":"","error":"etcdserver: leader changed"} |
Beta Was this translation helpful? Give feedback.
-
So it looks like its been here for the past hour, not sure if their is something I can do to help clean up these errors and help the installer move on or not:
|
Beta Was this translation helpful? Give feedback.
-
I am trying to install OKD on top of libvirt in my home lab however I keep running into errors where when the networks are created on the control-plane nodes the HAProxy returns connection refused from the node. I am not sure what to troubleshoot form the control-plane side to see why connections are being refused, DNS is resolving, I can Ping the host from the bootstrap node and other nodes.
haproxy.txt
Beta Was this translation helpful? Give feedback.
All reactions