-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[OKD 4.10] CephFS seems to be broken with FCOS 35 upgrade. image registry writes fail. #1167
Comments
That would be extremely useful. If we could reproduce it easily (i.e. single node without OKD) then we can bisect the history and find where it stopped working. |
Here is the error: From pod with CephFS mounted at /var/www/html
From worker node:
What is odd is that the Unfortunately, FCOS doesn't seem to have the selinux managements tools like sealert |
Here are the file permissions on the container:
|
@fortinj66 any luck on getting a smaller reproducer? |
I have not yet, unfortunately but @SriRamanujam tried something and was unable to make it break... okd-project/okd#1160 (comment) More research is needed... I've stumbled across a new issue which may have higher priority :( |
Hi, |
I'm wondering if this is somehow SELinux policy drift. See #701 (comment) for a description and workaround. |
Another question to ask (similar to the above) is "does this issue happen on a brand new cluster or just one that upgraded?". |
@dustymabe Over on the OKD issue, @schuemann has reported that this was reproducible on brand new and upgraded clusters: okd-project/okd#1160 (comment) Let me try the workaround in the issue you linked and see how that goes. |
Running |
We remove all custom policies on all nodes (finding more rules with |
Thanks for the info @SriRamanujam @depouill. |
Figured it out - details in okd-project/okd#1160 (comment). tl;dr the kernel changed a default from synchronous to asynchronous directory ops and that seems to be breaking selinux contexts in cephfs mounts across the board - probably xattrs in general if I had to guess, though I didn't explicitly test that. So nothing to do with FCOS or selinux specifically. |
Looks like we can call this closed when the 5.17.9 kernel lands in FCOS. See https://bugzilla.redhat.com/show_bug.cgi?id=2063929#c15 |
So not likely in a 5.16.x kernel that we would see in FCOS 35? So probably not in OKD 4.10... |
@fortinj66 - Fedora CoreOS is in the process of moving to Fedora 36 (next week's As an aside there is a kernel build of 5.17.9 for F36 and F35: So you could pick up and use that if you're motivated. |
That’s not a bad idea…. And I can test if the kernel panic is fixed too… |
Fixes an issue in ceph. Closes coreos/fedora-coreos-tracker#1167
The fix for this went into |
The fix for this went into |
Fixes an issue in ceph. Closes coreos/fedora-coreos-tracker#1167
Fixes an issue in ceph. Closes coreos/fedora-coreos-tracker#1167
Describe the bug
Writing to image registry fails after upgrade from OKD 4.9 -> 4.10 using ceph-fs filesystems. Ceph block FS is fine.
See
okd-project/okd#1160
okd-project/okd#1160 (comment)
okd-project/okd#1153
Reproduction steps
Steps to reproduce the behavior:
Expected behavior
CephFS accepts writes properly
Actual behavior
See logs above
System details
OKD with FCOS 35
I'm hoping to be able to reproduce this in an easier manner soon. My initial thought is that this is some kind of selinux permissions issue.
The text was updated successfully, but these errors were encountered: