-
Notifications
You must be signed in to change notification settings - Fork 168
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
kola testiso --scenarios pxe-install
is flaky
#1597
Comments
jlebon
added a commit
to jlebon/fedora-coreos-pipeline
that referenced
this issue
Jul 15, 2020
This test is flaky right now. See coreos/coreos-assembler#1597.
dustymabe
pushed a commit
to coreos/fedora-coreos-pipeline
that referenced
this issue
Jul 15, 2020
This test is flaky right now. See coreos/coreos-assembler#1597.
jlebon
added a commit
to jlebon/ignition
that referenced
this issue
Jul 16, 2020
Regression from coreos#958. We switched the list of providers from an array to a map. But iteration order through a map is undefined, so we lost the precedence of providers. I think this is the cause behind a lot of the FCOS installer test timeouts, such as: coreos/coreos-assembler#1597 There, we pass the Ignition config for the PXE boot via `ignition.config.url`, but if the metal (no-op) fetcher appears earlier than the `cmdline` fetcher, we get no config. And similarly for the installed system when the no-op fetcher appears before the `system` fetcher (which coreos-installer's `--ignition-file` leverages). The likelihood of this happening increased in the v2.4.0 release due to coreos#1002, which only gave us one try to iterate over the correct provider first (at the `fetch` stage), rather than every stage having a go at it. Closes: coreos/coreos-assembler#1597
Fix in coreos/ignition#1032. |
jlebon
added a commit
to jlebon/ignition
that referenced
this issue
Jul 16, 2020
Regression from coreos#958. We switched the list of providers from an array to a map. But iteration order through a map is undefined, so we lost the precedence of providers. I think this is the cause behind a lot of the FCOS installer test timeouts, such as: coreos/coreos-assembler#1597 There, we pass the Ignition config for the PXE boot via `ignition.config.url`, but if the metal (no-op) fetcher appears earlier than the `cmdline` fetcher, we get no config. And similarly for the installed system when the no-op fetcher appears before the `system` fetcher (which coreos-installer's `--ignition-file` leverages). The likelihood of this happening increased in the v2.4.0 release due to coreos#1002, which only gave us one try to iterate over the correct provider first (at the `fetch` stage), rather than every stage having a go at it. Closes: coreos/coreos-assembler#1597
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The pipeline is sometimes hitting:
I can reproduce this locally sometimes. Offhand it looks like sometimes during the first (live PXE) boot, Ignition isn't getting the complete config but either an empty config or missing the
ignition.config.url
karg? Hard to tell. But the end result is that thefiles
stage doesn't write downlive-signal-ok.service
which the test harness relies on.The text was updated successfully, but these errors were encountered: