Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Regression in kube-scheduler impacting Kubernetes versions v1.27.14, v1.28.10, v1.29.5 #4434

Closed
qpetraroia opened this issue Jul 24, 2024 · 3 comments
Labels
action-required bug Needs Attention 👋 Issues needs attention/assignee/owner

Comments

@qpetraroia
Copy link
Contributor

qpetraroia commented Jul 24, 2024

Describe the bug

The AKS team has found a bug that is causing regressions in kube-scheduler impacting Kubernetes versions v1.27.14, v1.28.10, v1.29.5. This regression causes the kube-scheduler to panic when a cluster has a pod with a bad node affinity. If this is the case, no pod will be scheduled.

Below is an example condition where the bug will be triggered

apiVersion: v1
kind: Pod
metadata:
  name: break-kube-scheduler
spec:
  affinity:
    nodeAffinity:
      requiredDuringSchedulingIgnoredDuringExecution:
        nodeSelectorTerms:
        - matchFields:
          - key: metadata.name
            operator: In
            values:
            - invalid-node # a node that doesn't exist
 

Interim fix
Be aware that Azure Kubernetes Service engineers are actively fixing all impacted clusters as well as deploying a fix to kube-scheduler which will be rolled out to all regions. These fixes will be applied automatically to your cluster.

Thank you for understanding,
The AKS team

@qpetraroia qpetraroia added the bug label Jul 24, 2024
@qpetraroia qpetraroia pinned this issue Jul 24, 2024
@robbiezhang
Copy link
Contributor

k8s issue: kubernetes/kubernetes#124930

Copy link
Contributor

Action required from @aritraghosh, @julia-yin, @AllenWen-at-Azure

@microsoft-github-policy-service microsoft-github-policy-service bot added the Needs Attention 👋 Issues needs attention/assignee/owner label Aug 26, 2024
@AllenWen-at-Azure
Copy link
Contributor

This upstream issue has been fixed in AKS releases in July. closing this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
action-required bug Needs Attention 👋 Issues needs attention/assignee/owner
Projects
None yet
Development

No branches or pull requests

3 participants