-
Notifications
You must be signed in to change notification settings - Fork 297
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proxy issue during 4.12 upgrade #1481
Comments
Did other nodes got this configuration settings after successful reboot? I'd expect MCO to apply proxy settings, but it looks like there's a race applying those |
No, on "without issue" workers, I don't have this file after sucessful reboot. |
Experienced this issue on 5 nodes in a 10 node test cluster (same MCP), running the rebase command manually whilst setting upper and lowercase http/https proxy env's would sometimes show a message of pulling manifest then timeout with the above error, most attempts would just timeout with the same message. Upgrading another cluster in same environment didn't have these issues. |
Sounds indeed like an MCO race. Please report this to https://issues.redhat.com/browse/OCPBUGS, component "Machine Config Operator" with a must-gather please. |
We also have okd behind the proxy and this approach helps us to update to 4.12 version. @llomgui thank you. |
I ran into the same error situation when upgrading from 4.11.0-0.okd-2023-01-14-152430 to 4.12.0-0.okd-2023-04-16-041331. You see also crashing coredns- and keepalived- pods on effected node. I executed this one liner via SSH on all nodes: Please note, you can only create http-proxy.conf during an upgrade process, otherwise the directory /etc/systemd/system/rpm-ostreed.service.d/ does not exists on the node. I'm not sure if you can place this file also before starting the upgrade. |
Caused by ostreedev/ostree-rs-ext#582, fixed in rpm-ostree 2024.2. openshift/okd-machine-os#751 should include it, but in order to update to it in disconnected env you'd need a workaround (see previous comment) |
The previous comment workaround won't work with the latest 4.15 version.
Credit to ostreedev/ostree-rs-ext#582 (comment) |
@llomgui @danielchristianschroeter could you check if https://github.com/okd-project/okd/releases/tag/4.15.0-0.okd-2024-02-23-163410 sets correct proxy vars for rpm-ostreed? |
@vrutkovs It doesn't work, I still have to use the workaround above. |
Doesn't work on clean install or upgrade? |
Hello, thanks for sharing |
Hello,
During an 4.12 upgrade I had an issue with worker upgrade (fedora 36 to fedora 37).
The first worker was stuck, so I check
journalctl -f
.I saw the following log:
Txn Rebase on /org/projectatomic/rpmostree1/fedora_coreos failed: Failed to invoke skopeo proxy method OpenImage: remote error: pinging container registry quay.io: Get "[https://quay.io/v2/":](https://quay.io/v2/%22:) dial tcp 54.163.152.191:443: i/o timeout
The cluster is behind a company proxy. So it should not try to get the package directly.
On some workers, the solution was to create this file:
sudo vi /etc/systemd/system/rpm-ostreed.service.d/http-proxy.conf
sudo systemctl daemon-reload
sudo systemctl restart rpm-ostreed.service
But on some others, I did'nt have to do anything. It worked without any issue.
I created a
poc
cluster before updating this cluster, with the same version4.11.0-0.okd-2023-01-14-152430
.To make sure the upgrade to 4.12 is working on GCP. I didn't get any issue.
The text was updated successfully, but these errors were encountered: