问题
ka get pod -o wide
ka describe pod oracle-265abd451-0
Pod无法调度
Warning FailedScheduling 8s default-scheduler 0/3 nodes are available: 1 Insufficient memory, 1 node(s) had taint {node.kubernetes.io/not-ready: }, that the pod didn’t tolerate, 1 node(s) had volume node affinity conflict.
默认调度程序0/3个节点可用:1个内存不足,1个节点具有pod无法容忍的污点{node.kubernetes.io/not-ready:},1个节点具有卷节点亲和力冲突。
排查思路
查看节点状态:发现有一个节点NotReady
k get node
查看标签
k get nodes --show-labels
查看污点
k get node -o yaml |grep taint -A 5
f:taints: {}
manager: kube-controller-manager
operation: Update
time: "2022-07-19T12:49:26Z"
name: x-x-x-x
resourceVersion: "233927"
--
taints:
- effect: NoSchedule
key: node.kubernetes.io/not-ready
timeAdded: "2022-07-19T12:49:26Z"
- effect: NoExecute
key: node.kubernetes.io/not-ready
--
f:taints: {}
f:status:
f:conditions:
k:{"type":"DiskPressure"}:
f:lastTransitionTime: {}
f:message: {}
--
taints:
- effect: NoSchedule
key: node.kubernetes.io/unreachable
timeAdded: "2022-07-19T12:45:33Z"
- effect: NoExecute
key: node.kubernetes.io/unreachable
kubectl taint nodes node{1,2,3} node-role.kubernetes.io/master:NoSchedule
清除污点
k taint nodes --all node-role.kubernetes.io/master-
taint "node-role.kubernetes.io/master" not found
taint "node-role.kubernetes.io/master" not found
taint "node-role.kubernetes.io/master" not found
k get node -o yaml |grep taint -A 5
没了。。。
重启Pod后正常。。。