节点调度
节点调度
Taint
$ kubectl get nodes
NAME STATUS ROLES AGE VERSION
elliot-01 Ready master 7d14h v1.18.2
elliot-02 Ready <none> 7d14h v1.18.2
elliot-03 Ready <none> 7d14h v1.18.2
$ kubectl describe node elliot-01 | grep -i taint
Taints: node-role.kubernetes.io/master:NoSchedule
我们将测试一些东西,并允许主节点运行其他
$ kubectl create deployment nginx --image=nginx
deployment.apps/nginx created
$ kubectl get deployments.apps
NAME READY UP-TO-DATE AVAILABLE AGE
nginx 1/1 1 1 5s
$ kubectl scale deployment nginx --replicas=3
deployment.apps/nginx scaled
$ kubectl get deployments.apps
NAME READY UP-TO-DATE AVAILABLE AGE
nginx 3/3 3 3 1m5s
$ kubectl get pods -o wide
NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
limit-pod 1/1 Running 0 3m44s 10.32.0.4 elliot-02 <none> <none>
nginx 1/1 Running 0 25m 10.46.0.1 elliot-03 <none> <none>
nginx-85f7fb6b45-9bzwc 1/1 Running 0 6m7s 10.32.0.3 elliot-02 <none> <none>
nginx-85f7fb6b45-cbmtr 1/1 Running 0 6m7s 10.46.0.2 elliot-03 <none> <none>
nginx-85f7fb6b45-rprz5 1/1 Running 0 6m7s 10.32.0.2 elliot-02 <none> <none>
我们将把
$ kubectl taint node elliot-02 key1=value1:NoSchedule
node/elliot-02 tainted
$ kubectl describe node elliot-02 | grep -i taint
Taints: key1=value1:NoSchedule
$ kubectl taint node elliot-03 key1=value1:NoSchedule
node/elliot-03 tainted
$ kubectl describe node elliot-03 | grep -i taint
Taints: key1=value1:NoSchedule
现在让我们增加副本的数量:
$ kubectl scale deployment nginx --replicas=5
deployment.apps/nginx scaled
$ kubectl get pods -o wide
NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
limit-pod 1/1 Running 0 5m23s 10.32.0.4 elliot-02 <none> <none>
nginx 1/1 Running 0 27m 10.46.0.1 elliot-03 <none> <none>
nginx-85f7fb6b45-9bzwc 1/1 Running 0 7m46s 10.32.0.3 elliot-02 <none> <none>
nginx-85f7fb6b45-cbmtr 1/1 Running 0 7m46s 10.46.0.2 elliot-03 <none> <none>
nginx-85f7fb6b45-qnhtl 0/1 Pending 0 18s <none> <none> <none> <none>
nginx-85f7fb6b45-qsvpp 0/1 Pending 0 18s <none> <none> <none> <none>
nginx-85f7fb6b45-rprz5 1/1 Running 0 7m46s 10.32.0.2 elliot-02 <none> <none>
正如我们所看到的,新的复制是孤儿,希望有一个节点出现在
$ kubectl taint node elliot-02 key1:NoSchedule-
node/elliot-02 untainted
$ kubectl taint node elliot-03 key1:NoSchedule-
node/elliot-03 untainted
$ kubectl get pods -o wide
NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
limit-pod 1/1 Running 0 6m17s 10.32.0.4 elliot-02 <none> <none>
nginx 1/1 Running 0 27m 10.46.0.1 elliot-03 <none> <none>
nginx-85f7fb6b45-9bzwc 1/1 Running 0 8m40s 10.32.0.3 elliot-02 <none> <none>
nginx-85f7fb6b45-cbmtr 1/1 Running 0 8m40s 10.46.0.2 elliot-03 <none> <none>
nginx-85f7fb6b45-qnhtl 1/1 Running 0 72s 10.46.0.5 elliot-03 <none> <none>
nginx-85f7fb6b45-qsvpp 1/1 Running 0 72s 10.46.0.4 elliot-03 <none> <none>
nginx-85f7fb6b45-rprz5 1/1 Running 0 8m40s 10.32.0.2 elliot-02 <none> <none>
有几种类型的标签,我们可以用来对节点进行分类,我们来测试另一个调用
$ kubectl taint node elliot-02 key1=value1:NoExecute
node/elliot-02 tainted
$ kubectl taint node elliot-03 key1=value1:NoExecute
node/elliot-03 tainted
$ kubectl get pods
NAME READY STATUS RESTARTS AGE
nginx-85f7fb6b45-87sq5 0/1 Pending 0 20s
nginx-85f7fb6b45-8q99g 0/1 Pending 0 20s
nginx-85f7fb6b45-drmzz 0/1 Pending 0 20s
nginx-85f7fb6b45-hb4dp 0/1 Pending 0 20s
nginx-85f7fb6b45-l6zln 0/1 Pending 0 20s
我们可以看到所有的
$ kubectl scale deployment nginx --replicas=1
deployment.apps/nginx scaled
$ kubectl get pods
nginx-85f7fb6b45-drmzz 0/1 Pending 0 43s
$ kubectl taint node elliot-02 key1:NoExecute-
node/elliot-02 untainted
$ kubectl taint node elliot-03 key1:NoExecute-
node/elliot-03 untainted
$ kubectl get pods
NAME READY STATUS RESTARTS AGE
nginx-85f7fb6b45-drmzz 1/1 Running 0 76s
我们现在有一个节点在正常运行。但是如果我们的
$ kubectl taint nodes --all node-role.kubernetes.io/master-
node/elliot-01 untainted
$ kubectl describe node elliot-01 | grep -i taint
Taints: <none>
$ kubectl scale deployment nginx --replicas=4
deployment.apps/nginx scaled
$ kubectl get pods -o wide
NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
nginx-85f7fb6b45-2c6dm 1/1 Running 0 9s 10.32.0.2 elliot-02 <none> <none>
nginx-85f7fb6b45-4jzcn 1/1 Running 0 9s 10.32.0.3 elliot-02 <none> <none>
nginx-85f7fb6b45-drmzz 1/1 Running 0 114s 10.46.0.1 elliot-03 <none> <none>
nginx-85f7fb6b45-rstvq 1/1 Running 0 9s 10.46.0.2 elliot-03 <none> <none>
让我们在
$ kubectl taint node elliot-02 key1=value1:NoExecute
node/elliot-02 tainted
$ kubectl taint node elliot-03 key1=value1:NoExecute
node/elliot-03 tainted
$ kubectl get pods -o wide
NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
nginx-85f7fb6b45-49knz 1/1 Running 0 14s 10.40.0.5 elliot-01 <none> <none>
nginx-85f7fb6b45-4cm9x 1/1 Running 0 14s 10.40.0.4 elliot-01 <none> <none>
nginx-85f7fb6b45-kppnd 1/1 Running 0 14s 10.40.0.6 elliot-01 <none> <none>
nginx-85f7fb6b45-rjlmj 1/1 Running 0 14s 10.40.0.3 elliot-01 <none> <none>
$ kubectl taint node --all key1:NoSchedule-
node/elliot-01 untainted
node/elliot-02 untainted
node/elliot-03 untainted
$ kubectl taint node --all key1:NoExecute-
node/kube-worker1 untainted
node/kube-worker2 untainted
error: taint "key1:NoExecute" not found
将节点设置为维护状态
为了将节点置于维护状态,我们将使用
$ kubectl cordon elliot-02
node/elliot-02 cordoned
$ kubectl get nodes
NAME STATUS ROLES AGE VERSION
elliot-01 Ready master 7d14h v1.18.2
elliot-02 Ready,SchedulingDisabled <none> 7d14h v1.18.2
elliot-03 Ready <none> 7d14h v1.18.2
注意到节点
$ kubectl uncordon elliot-02
node/elliot-02 uncordoned
$ kubectl get nodes
NAME STATUS ROLES AGE VERSION
elliot-01 Ready master 7d14h v1.18.2
elliot-02 Ready <none> 7d14h v1.18.2
elliot-03 Ready <none> 7d14h v1.18.2
根据标签选择节点
节点选择器是对我们的节点进行分类的方法,比如我们的节点
$ kubectl label node elliot-02 disk=SSD
$ kubectl label node elliot-02 dc=UK
$ kubectl label node elliot-03 dc=Netherlands
$ kubectl label nodes elliot-03 disk=hdd
$ kubectl label nodes elliot-03 disk=HDD --overwrite
要知道每个节点上配置的标签,只需执行以下命令。
$ kubectl label nodes elliot-02 --list
dc=UK
disk=SSD
kubernetes.io/hostname=elliot-02
beta.kubernetes.io/arch=amd64
beta.kubernetes.io/os=linux
$ kubectl label nodes elliot-03 --list
beta.kubernetes.io/os=linux
dc=Netherlands
disk=HDD
kubernetes.io/hostname=elliot-03
beta.kubernetes.io/arch=amd64
现在,只需再次执行部署,但首先我们将在
apiVersion : apps / v1
kind : Deployment
metadata :
labels :
run : nginx
name : third-deployment
namespace : default
spec :
replicas : 1
selector :
matchLabels :
run : nginx
template :
metadata :
creationTimestamp : null
labels :
run : nginx
dc :Netherlands
spec :
containers :
- image : nginx
imagePullPolicy : Always
name : nginx2
ports :
- containerPort : 80
protocol : TCP
resources : {}
terminationMessagePath : / dev / termination-log
terminationMessagePolicy : File
dnsPolicy : ClusterFirst
restartPolicy : Always
schedulerName : default-scheduler
securityContext : {}
terminationGracePeriodSeconds : 30
nodeSelector :
disk : SSD
从清单中创建部署:
$ kubectl create -f terceiro-deployment.yaml
deployment.extensions/terceiro-deployment created
$ kubectl get pods -o wide
NAME READY STATUS RESTARTS AGE IP NODE
primeiro-deployment-56d9... 1/1 Running 0 14m 172.17.0.4 elliot-03
segundo-deployment-869f... 1/1 Running 0 14m 172.17.0.5 elliot-03
terceiro-deployment-59cd... 1/1 Running 0 22s 172.17.0.6 elliot-02
我们可以通过如下方式移除标签:
$ kubectl label nodes elliot-02 dc-
$ kubectl label nodes --all dc-
现在想象一下,这可以为你提供无限的可能性,比如它是否是生产型的,是否消耗大量的