将原有集群删除后,重新创建集群,rancher集群报错:
Cluster health check failed: Failed to communicate with API server: Get "https://192.168.200.10:6443/api/v1/namespaces/kube-system?timeout=45s": context deadline exceeded
检查6443对应api-server的docker容器的日志:
docker logs -f kube-apiserver
报错信息为:
W0315 03:45:32.536470 1 clientconn.go:1120] grpc: addrConn.createTransport failed to connect to {https://192.168.200.10:2379 0 <nil>}. Err :connection error: desc = "transport: authentication handshake failed: read tcp 192.168.200.10:26328->192.168.200.10:2379: read: connection timed out". Reconnecting...
检查etcd容器的日志
docker logs -f etcd
2021-03-15 03:30:47.502782 E | rafthttp: request cluster ID mismatch (got a660753a6f52f139 want 5d9ecef6b01417cb)
2021-03-15 03:30:47.589028 E | rafthttp: request sent was ignored (cluster ID mismatch: peer[ac45d5f1b6bffa04]=a660753a6f52f139, local=5d9ecef6b01417cb)
原因是/var/lib/etcd/member/目录下的内容没有清空,etcd会查找旧的集群。所以重新安装集群之前,要把/var/lib/etcd/目录下的内容删掉