Context Deadline Exceeded (CreateContainerError) warnings on pods with attached PVCs #1766
-
I have a number of Cronjobs running at 1 minute intervals with attached PVC storage (EBS GP3 volumes). Occasionally we see jobs spinning up and being delayed for varying times reporting "Context Deadline Exceeded". The pods do eventually come online after this time and complete successfully. Events from one such pod look like the following:
I'm trying to gain some further insight into the processes running when setting up the context so that I can find a way to reduce the amount of time it takes for the cron to start. To me it looks to be similar to cri-o/cri-o#6185 - however that issue refers to object storage rather than a block device, is it possible that SElinux labelling could be the issue, and is there a way in OKD to test with the "TrySkipVolumeSELinuxLabel" label? Regards, |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments
-
It looks very similar, so I think its the same issue
Yes. You should create a MachineConfig with CRIO config change, see openshift/enhancements#362 (comment) for example |
Beta Was this translation helpful? Give feedback.
-
Thanks for the quick reply and the example, I'll trial it out and see if there is any noticeable improvement. Regards, |
Beta Was this translation helpful? Give feedback.
-
I ended up testing this on a isolated single node and the pod startup time was vastly improved when attaching a well populated PVC volume. I've since rolled this out to all nodes and we noticed the Load Average on worker nodes noticeably drop with no ill-effects. |
Beta Was this translation helpful? Give feedback.
It looks very similar, so I think its the same issue
Yes. You should create a MachineConfig with CRIO config change, see openshift/enhancements#362 (comment) for example