Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CRIU: Restore failure on x86-64 #19114

Open
tajila opened this issue Mar 11, 2024 · 5 comments
Open

CRIU: Restore failure on x86-64 #19114

tajila opened this issue Mar 11, 2024 · 5 comments
Labels
arch:x86 criu Used to track CRIU snapshot related work

Comments

@tajila
Copy link
Contributor

tajila commented Mar 11, 2024

17:10:16   [OUT] initiate restore
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:1839): prctl failed @1839 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:1840): prctl failed @1840 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:1841): prctl failed @1841 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:1842): prctl failed @1842 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:1843): prctl failed @1843 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:1844): prctl failed @1844 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:1845): prctl failed @1845 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:1846): prctl failed @1846 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:1847): prctl failed @1847 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:1848): prctl failed @1848 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:1849): prctl failed @1849 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:1850): prctl failed @1850 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:777): prctl failed @777 with -1
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:779): Can't restore EXE link (-1)
17:10:16   [OUT] pie: 31483: Error (criu/pie/restorer.c:2102): Restorer fail 31483
17:10:16   [OUT] Error (criu/cr-restore.c:2547): Restoring FAILED.
17:10:16   [OUT] Removed test output files
17:10:16   [OUT] finished script

https://openj9-jenkins.osuosl.org/job/Test_openjdk17_j9_sanity.functional_x86-64_linux_Personal/473/
https://openj9-jenkins.osuosl.org/job/Test_openjdk17_j9_sanity.functional_x86-64_linux_Personal_testList_0/460/ - ub16x64j96

@tajila tajila added the criu Used to track CRIU snapshot related work label Mar 11, 2024
@tajila
Copy link
Contributor Author

tajila commented Mar 11, 2024

@ymanton Have you seen this before

@cjjdespres
Copy link
Contributor

cjjdespres commented Mar 22, 2024

@ymanton
Copy link
Member

ymanton commented Mar 22, 2024

Sorry, missed this mention.

I've seen this error before, but it can be caused by many different kinds of problems. We need a more verbose CRIU log to understand what happened before the errors, although even that isn't always enough info.

I'll open a PR to make the logging more verbose and we can go from there.

This one is odd in that lots of CRIU tests succeeded, but a handful failed, however the syscall causing the issues is always executed and I don't see any documented reason why it would fail intermittently.

@pshipton
Copy link
Member

Is this a machine problem, unsupported OS? The test was running on Ubuntu 16. They always fails on Ubuntu 16, see #18994 (comment) for a short list and then I stopped reporting them.

Also Jason opened #19211 for the same issue on SLES 12.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
arch:x86 criu Used to track CRIU snapshot related work
Projects
None yet
Development

No branches or pull requests

4 participants