-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
installer not completing : incorrect ocp default registry format registry@sha@sha #933
Comments
cc: @abhinavdahiya @wking |
Probably related to this code:
Hmm. Why is this a warning instead of fatal? (And shouldn't that be |
If it's an So basically there are 2 issues: 1 - double sha256 and 2 that it takes forever for it to finally permanently fail due to a bad format. |
Excerpted from the linked logs:
Huh, update payload Here's the hung wait in those logs:
And on the next
Both v0.6.0 and v0.7.0 have the same code @cgwalters excerpted above. And once the wrapping templating is expanded, it seems to be working fine for me with my random Podman build from last week: $ podman version
Version: 0.12.2-dev
Go Version: go1.10.3
Git Commit: "ec4cada3d1eabc77d9691a71fe2c99e3bf9343d6-dirty"
Built: Wed Dec 12 23:58:05 2018
OS/Arch: linux/amd64
$ podman pull quay.io/openshift-release-dev/ocp-release:4.0.0-4
$ podman inspect quay.io/openshift-release-dev/ocp-release:4.0.0-4 -f '{{ index .RepoDigests 0 }}'
quay.io/openshift-release-dev/ocp-release@sha256:4f02d5c7183360a519a7c7dbe601f58123c9867cd5721ae503072ae62920575b The next time you see this, can you grab Re: "why the warning?", @smarterclayton motivated it with a reference to broken registries, although I'm still not clear on the details there. |
@wking Haven't run into this since that day, so closing the issue, but will reopen if I see the behaviour again. |
Jan 09 16:27:24 test1-bootstrap bootkube.sh[24200]: unable to pull quay.io/openshift-release-dev/ocp-release@sha256@sha256:e237499d3b118e25890550daad8b17274af93baf855914a9c6f8f07ebc095dea: error getting default registries to try: invalid reference format |
podman versionVersion: 0.11.1.1 |
podman pull quay.io/openshift-release-dev/ocp-release:4.0.0-9 podman inspect quay.io/openshift-release-dev/ocp-release:4.0.0-9 -f '{{ index .RepoDigests 0 }}' |
@wking Thanks for the update! |
Due to the issue in podman it seems to incorrectly return digests with double @sha256 in the name. This patch fixes the issue in the bootkube.sh to remove double occurances of @256 and replace it with single occurence, as normally expected. I.e. quay.io/openshift-release-dev/ocp-release@sha256@sha256:e237499d3b118e25890550daad8b17274af93baf855914a9c6f8f07ebc095dea Reference to the issue: openshift#933 Signed-off-by: Lev Veyde <[email protected]>
Re-opening until we get Podman 1.0. |
I have face the same issue with installer 0.9.1 I have also seen 0.10 releases. Does that fix this issue ?? |
0.9.1 and 0.10.0 both pin RHCOS 47.249 with |
RHCOS 47.268 picked up |
0.11.0 bumped to RHCOS 47.280 with Podman 1.0, so I think this is fixed. /close |
@wking: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Jan 31 00:53:06 ip-10-0-4-46 systemd[1]: Started Bootstrap a Kubernetes cluster. |
@wking yes, it's still up. |
Fix in flight with containers/podman#2251, although we'll need to wait for that to land in libpod and percolate through into RHCOS. |
Or... you could merge my PR which fixes the issue right now, and doesn't
break anything once/when the podman will be finally fixed.
Thanks in advance,
Lev Veyde.
…On Thu, Jan 31, 2019 at 9:04 PM W. Trevor King ***@***.***> wrote:
Fix in flight with containers/podman#2251
<containers/podman#2251>, although we'll need to
wait for that to land in libpod and percolate through into RHCOS.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#933 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ASiYs6uNJYWj3RyI7qRi0H_8zrq2bUy4ks5vIz4lgaJpZM4ZXE3E>
.
|
True. But the merge queue is long, and this is a 1% issue. So I'm fine letting it slide, but I'm also fine if other maintainers want to land your guard. |
Due to the issue in podman it seems to incorrectly return digests with double @sha256 in the name. This patch fixes the issue in the bootkube.sh to remove double occurances of @256 and replace it with single occurence, as normally expected. I.e. quay.io/openshift-release-dev/ocp-release@sha256@sha256:e237499d3b118e25890550daad8b17274af93baf855914a9c6f8f07ebc095dea Reference to the issue: openshift#933 Signed-off-by: Lev Veyde <[email protected]>
I'm going to go ahead and close this issue as I believe this has been fixed in oodman. Its not an installer issue and I don't think we should paper over a broken runtime. |
Version
Platform (aws|libvirt|openstack):
aws
What happened?
Ran the installer and got errors related to pulling an image. The location seems to be incorrect and has 2 shas:
Dec 17 19:44:54 ip-10-0-9-101 bootkube.sh[3794]: unable to pull quay.io/openshift-release-dev/ocp-release@sha256@sha256:4f02d5c7183360a519a7c7dbe601f58123c9867cd5721ae503072ae62920575b: error getting default registries to try: invalid reference format
What you expected to happen?
I expect the installer to run
How to reproduce it (as minimally and precisely as possible)?
run the installer, note that it hangs at the below DEBUG lines for 20 minutes:
DEBUG added kube-scheduler.157140aa428fc565: ip-10-0-3-94_a271882c-024e-11e9-ae96-024b8209fb6a became leader DEBUG added kube-controller-manager.157140ab00eeb507: ip-10-0-3-94_a26c1b70-024e-11e9-98fb-024b8209fb6a became leader
until proceeding to the error:
ERROR: logging before flag.Parse: E1217 15:14:13.773270 1521 streamwatcher.go:109] Unable to decode an event from the watch stream: http2: server sent GOAWAY and closed the connection; LastStreamID=3, ErrCode=NO_ERROR, debug="" WARNING RetryWatcher - getting event failed! Re-creating the watcher. Last RV: 148 WARNING Failed to connect events watcher: Get https://k-api.devcluster.openshift.com:6443/api/v1/namespaces/kube-system/events?resourceVersion=148&watch=true: dial tcp 52.37.184.199:6443: connect: connection refused
the pastebin logs below are occuring at the time that the WARNINGs in the main terminal appear.
Anything else we need to know?
Output from running
journalctl -b -u bootkube --no-pager
on bootstrap:http://pastebin.test.redhat.com/685235
The text was updated successfully, but these errors were encountered: