Skip to content

Latest commit

 

History

History
860 lines (621 loc) · 21 KB

File metadata and controls

860 lines (621 loc) · 21 KB

CKA (Certified Kubernetes Administrator)

Setup

  • Set up Kubernetes cluster. Use on of the following

    1. Minikube for local free & simple cluster
    2. Managed Cluster (EKS, GKE, AKS)
  • Set aliases

alias k=kubectl
alias kd=kubectl delete
alias kds=kubectl describe
alias ke=kubectl edit
alias kr=kubectl run
alias kg=kubectl get

Pods

Run a command to view all the pods in the current namespace

kubectl get pods

Note: create an alias (alias k=kubectl) and get used to k get po

Run a pod called "nginx-test" using the "nginx" image

k run nginx-test --image=nginx

Assuming you have a Pod called "nginx-test", how to remove it?

k delete nginx-test

In what namespace the etcd pod is running? list the pods in that namespace

k get po -n kube-system

Let's say you didn't know in what namespace it is. You could then run k get po -A | grep etc to find the Pod and see in what namespace it resides.

List pods from all namespaces

k get po -A

The long version would be kubectl get pods --all-namespaces.

Write a YAML of a Pod with two containers and use the YAML file to create the Pod (use whatever images you prefer)
cat > pod.yaml <<EOL
apiVersion: v1
kind: Pod
metadata:
  name: test
spec:
  containers:
  - image: alpine
    name: alpine
  - image: nginx-unprivileged
    name: nginx-unprivileged
EOL

k create -f pod.yaml

If you ask yourself how would I remember writing all of that? no worries, you can simply run kubectl run some_pod --image=redis -o yaml --dry-run=client > pod.yaml. If you ask yourself "how am I supposed to remember this long command" time to change attitude ;)

Create a YAML of a Pod without actually running the Pod with the kubectl command (use whatever image you prefer)

k run some-pod -o yaml --image nginx-unprivileged --dry-run=client > pod.yaml

How to test a manifest is valid?

with --dry-run flag which will not actually create it, but it will test it and you can find this way any syntax issues.

k create -f YAML_FILE --dry-run

How to check which image a certain Pod is using?

k describe po <POD_NAME> | grep -i image

How to check how many containers run in signle Pod?

k get po POD_NAME and see the number under "READY" column.

You can also run k describe po POD_NAME

Run a Pod called "remo" with the the latest redis image and the label 'year=2017'

k run remo --image=redis:latest -l year=2017

List pods and their labels

k get po --show-labels

Delete a Pod called "nm"

k delete po nm

List all the pods with the label "env=prod"

k get po -l env=prod

To count them: k get po -l env=prod --no-headers | wc -l

Create a static pod with the image python that runs the command sleep 2017

First change to the directory tracked by kubelet for creating static pod: cd /etc/kubernetes/manifests (you can verify path by reading kubelet conf file)

Now create the definition/manifest in that directory k run some-pod --image=python --command sleep 2017 --restart=Never --dry-run=client -o yaml > statuc-pod.yaml

Describe how would you delete a static Pod

Locate the static Pods directory (look at staticPodPath in kubelet configuration file).

Go to that directory and remove the manifest/definition of the staic Pod (rm <STATIC_POD_PATH>/<POD_DEFINITION_FILE>)

Troubleshooting Pods

You try to run a Pod but see the status "CrashLoopBackOff". What does it means? How to identify the issue?

The container failed to run (due to different reasons) and Kubernetes tries to run the Pod again after some delay (= BackOff time).

Some reasons for it to fail:

  • Misconfiguration - mispelling, non supported value, etc.
  • Resource not available - nodes are down, PV not mounted, etc.

Some ways to debug:

  1. kubectl describe pod POD_NAME
    1. Focus on State (which should be Waiting, CrashLoopBackOff) and Last State which should tell what happened before (as in why it failed)
  2. Run kubectl logs mypod
    1. This should provide an accurate output of
    2. For specific container, you can add -c CONTAINER_NAME
  3. If you still have no idea why it failed, try kubectl get events

What the error ImagePullBackOff means?

Most likely you didn't write correctly the name of the image you try to pull and run. Or perhaps it doesn't exists in the registry.

You can confirm with kubectl describe po POD_NAME

How to check on which node a certain Pod is running?

k get po POD_NAME -o wide

Run the following command: kubectl run ohno --image=sheris. Did it work? why not? fix it without removing the Pod and using any image you would like

Because there is no such image sheris. At least for now :)

To fix it, run kubectl edit ohno and modify the following line - image: sheris to - image: redis or any other image you prefer.

You try to run a Pod but it's in "Pending" state. What might be the reason?

One possible reason is that the scheduler which supposed to schedule Pods on nodes, is not running. To verify it, you can run kubectl get po -A | grep scheduler or check directly in kube-system namespace.

How to view the logs of a container running in a Pod?

k logs POD_NAME

There are two containers inside a Pod called "some-pod". What will happen if you run kubectl logs some-pod

It won't work because there are two containers inside the Pod and you need to specify one of them with kubectl logs POD_NAME -c CONTAINER_NAME

Namespaces

List all the namespaces

k get ns

Create a namespace called 'alle'

k create ns alle

Check how many namespaces are there

k get ns --no-headers | wc -l

Check how many pods exist in the "dev" namespace

k get po -n dev

Create a pod called "kartos" in the namespace dev. The pod should be using the "redis" image.

If the namespace doesn't exist already: k create ns dev

k run kratos --image=redis -n dev

You are looking for a Pod called "atreus". How to check in which namespace it runs?

k get po -A | grep atreus

Nodes

Run a command to view all nodes of the cluster

kubectl get nodes

Note: create an alias (alias k=kubectl) and get used to k get no

Create a list of all nodes in JSON format and store it in a file called "some_nodes.json"

k get nodes -o json > some_nodes.json

Check what labels one of your nodes in the cluster has

k get no minikube --show-labels

Services

Check how many services are running in the current namespace

k get svc

Create an internal service called "sevi" to expose the app 'web' on port 1991
How to reference by name a service called "app-service" within the same namespace?

app-service

How to check the TargetPort of a service?

k describe svc <SERVICE_NAME>

How to check what endpoints the svc has?

k describe svc <SERVICE_NAME>

How to reference by name a service called "app-service" within a different namespace, called "dev"?

app-service.dev.svc.cluster.local

Assume you have a deployment running and you need to create a Service for exposing the pods. This is what is required/known:
  • Deployment name: jabulik
  • Target port: 8080
  • Service type: NodePort
  • Selector: jabulik-app
  • Port: 8080

kubectl expose deployment jabulik --name=jabulik-service --target-port=8080 --type=NodePort --port=8080 --dry-run=client -o yaml -> svc.yaml

vi svc.yaml (make sure selector is set to jabulik-app)

k apply -f svc.yaml

ReplicaSets

How to check how many replicasets defined in the current namespace?

k get rs

You have a replica set defined to run 3 Pods. You removed one of these 3 pods. What will happen next? how many Pods will there be?

There will still be 3 Pods running theoretically because the goal of the replica set is to ensure that. so if you delete one or more Pods, it will run additional Pods so there are always 3 Pods.

How to check which container image was used as part of replica set called "repli"?

k describe rs repli | grep -i image

How to check how many Pods are ready as part of a replica set called "repli"?

k describe rs repli | grep -i "Pods Status"

How to delete a replica set called "rori"?

k delete rs rori

How to modify a replica set called "rori" to use a different image?

k edis rs rori

Scale up a replica set called "rori" to run 5 Pods instead of 2

k scale rs rori --replicas=5

Scale down a replica set called "rori" to run 1 Pod instead of 5

k scale rs rori --replicas=1

Troubleshooting ReplicaSets

Fix the following ReplicaSet definition
apiVersion: apps/v1
kind: ReplicaCet
metadata:
  name: redis
  labels:
    app: redis
    tier: cache
spec:
  selector:
    matchLabels:
      tier: cache
  template:
    metadata:
      labels:
        tier: cachy
    spec:
      containers:
      - name: redis
        image: redis

kind should be ReplicaSet and not ReplicaCet :)

Fix the following ReplicaSet definition
apiVersion: apps/v1
kind: ReplicaSet
metadata:
  name: redis
  labels:
    app: redis
    tier: cache
spec:
  selector:
    matchLabels:
      tier: cache
  template:
    metadata:
      labels:
        tier: cachy
    spec:
      containers:
      - name: redis
        image: redis

The selector doesn't match the label (cache vs cachy). To solve it, fix cachy so it's cache instead.

Deployments

How to list all the deployments in the current namespace?

k get deploy

How to check which image a certain Deployment is using?

k describe deploy <DEPLOYMENT_NAME> | grep image

Create a file definition/manifest of a deployment called "dep", with 3 replicas that uses the image 'redis'

k create deploy dep -o yaml --image=redis --dry-run=client --replicas 3 > deployment.yaml

Remove the deployment `depdep`

k delete deploy depdep

Create a deployment called "pluck" using the image "redis" and make sure it runs 5 replicas

kubectl create deployment pluck --image=redis

kubectl scale deployment pluck --replicas=5

Create a deployment with the following properties:
  • called "blufer"
  • using the image "python"
  • runs 3 replicas
  • all pods will be placed on a node that has the label "blufer"

kubectl create deployment blufer --image=python --replicas=3 -o yaml --dry-run=client > deployment.yaml

Add the following section (vi deployment.yaml):

spec:
  affinity:
    nodeAffinity:
      requiredDuringSchedlingIgnoredDuringExecution:
        nodeSelectorTerms:
        - matchExpressions:
          - key: blufer
            operator: Exists

kubectl apply -f deployment.yaml

Troubleshooting Deployments

Fix the following deployment manifest
apiVersion: apps/v1
kind: Deploy
metadata:
  creationTimestamp: null
  labels:
    app: dep
  name: dep
spec:
  replicas: 3
  selector:
    matchLabels:
      app: dep
  strategy: {}
  template:
    metadata:
      creationTimestamp: null
      labels:
        app: dep
    spec:
      containers:
      - image: redis
        name: redis
        resources: {}
status: {}

Change kind: Deploy to kind: Deployment

Fix the following deployment manifest
apiVersion: apps/v1
kind: Deployment
metadata:
  creationTimestamp: null
  labels:
    app: dep
  name: dep
spec:
  replicas: 3
  selector:
    matchLabels:
      app: depdep
  strategy: {}
  template:
    metadata:
      creationTimestamp: null
      labels:
        app: dep
    spec:
      containers:
      - image: redis
        name: redis
        resources: {}
status: {}

The selector doesn't match the label (dep vs depdep). To solve it, fix depdep so it's dep instead.

Scheduler

How to schedule a pod on a node called "node1"?

k run some-pod --image=redix -o yaml --dry-run=client > pod.yaml

vi pod.yaml and add:

spec:
  nodeName: node1

k apply -f pod.yaml

Note: if you don't have a node1 in your cluster the Pod will be stuck on "Pending" state.

Node Affinity

Using node affinity, set a Pod to schedule on a node where the key is "region" and value is either "asia" or "emea"

vi pod.yaml

affinity:
  nodeAffinity:
    requiredDuringSchedlingIgnoredDuringExecution:
      nodeSelectorTerms:
      - matchExpressions:
        - key: region
          operator: In
          values:
          - asia
          - emea

Using node affinity, set a Pod to never schedule on a node where the key is "region" and value is "neverland"

vi pod.yaml

affinity:
  nodeAffinity:
    requiredDuringSchedlingIgnoredDuringExecution:
      nodeSelectorTerms:
      - matchExpressions:
        - key: region
          operator: NotIn
          values:
          - neverland

Labels and Selectors

How to list all the Pods with the label "app=web"?

k get po -l app=web

How to list all objects labeled as "env=staging"?

k get all -l env=staging

How to list all deployments from "env=prod" and "type=web"?

k get deploy -l env=prod,type=web

Node Selector

Apply the label "hw=max" on one of the nodes in your cluster

kubectl label nodes some-node hw=max

reate and run a Pod called `some-pod` with the image `redis` and configure it to use the selector `hw=max`
kubectl run some-pod --image=redis --dry-run=client -o yaml > pod.yaml

vi pod.yaml

spec:
  nodeSelector:
    hw: max

kubectl apply -f pod.yaml

Explain why node selectors might be limited

Assume you would like to run your Pod on all the nodes with with either hw set to max or to min, instead of just max. This is not possible with nodeSelectors which are quite simplified and this is where you might want to consider node affinity.

Taints

Check if there are taints on node "master"

k describe no master | grep -i taints

Create a taint on one of the nodes in your cluster with key of "app" and value of "web" and effect of "NoSchedule". Verify it was applied

k taint node minikube app=web:NoSchedule

k describe no minikube | grep -i taints

You applied a taint with k taint node minikube app=web:NoSchedule on the only node in your cluster and then executed kubectl run some-pod --image=redis. What will happen?

The Pod will remain in "Pending" status due to the only node in the cluster having a taint of "app=web".

You applied a taint with k taint node minikube app=web:NoSchedule on the only node in your cluster and then executed kubectl run some-pod --image=redis but the Pod is in pending state. How to fix it?

kubectl edit po some-pod and add the following

  - effect: NoSchedule
    key: app
    operator: Equal
    value: web

Exit and save. The pod should be in Running state now.

Remove an existing taint from one of the nodes in your cluster

k taint node minikube app=web:NoSchedule-

Resources Limits

Check if there are any limits on one of the pods in your cluster

kubectl describe po <POD_NAME> | grep -i limits

Run a pod called "yay" with the image "python" and resources request of 64Mi memory and 250m CPU

kubectl run yay --image=python --dry-run=client -o yaml > pod.yaml

vi pod.yaml

spec:
  containers:
  - image: python
    imagePullPolicy: Always
    name: yay
    resources:
      requests:
        cpu: 250m
        memory: 64Mi

kubectl apply -f pod.yaml

Run a pod called "yay2" with the image "python". Make sure it has resources request of 64Mi memory and 250m CPU and the limits are 128Mi memory and 500m CPU

kubectl run yay2 --image=python --dry-run=client -o yaml > pod.yaml

vi pod.yaml

spec:
  containers:
  - image: python
    imagePullPolicy: Always
    name: yay2
    resources:
      limits:
        cpu: 500m
        memory: 128Mi
      requests:
        cpu: 250m
        memory: 64Mi

kubectl apply -f pod.yaml

Monitoring

Deploy metrics-server

kubectl apply -f https://github.com/kubernetes-sigs/metrics-server/releases/latest/download/components.yaml

Using metrics-server, view the following:
  • top performing nodes in the cluster
  • top performing Pods

  • top nodes: kubectl top nodes
  • top pods: kubectl top pods

Scheduler

Can you deploy multiple schedulers?

Yes, it is possible. You can run another pod with a command similar to:

spec:
  containers:
  - command:
    - kube-scheduler
    - --address=127.0.0.1
    - --leader-elect=true
    - --scheduler-name=some-custom-scheduler
...

Assuming you have multiple schedulers, how to know which scheduler was used for a given Pod?

Running kubectl get events you can see which scheduler was used.

You want to run a new Pod and you would like it to be scheduled by a custom schduler. How to achieve it?

Add the following to the spec of the Pod:

spec:
  schedulerName: some-custom-scheduler