ZKFC报错:ZK Failover Controller failed: Received stat error from Zookeeper. code:CONNECTIONLOSS

Namenode HA模式下,一个节点掉了,另外的节点不能正常被切换成Active,看zkfc的日志发现出现zookeeper连接超时的异常。

2020-07-04 10:32:06,356 INFO org.apache.zookeeper.ClientCnxn: Unable to read additional data from server sessionid 0x1019acb4933001f, likely server has closed socket, closing socket connection and attempting reconnect
2020-07-04 10:32:06,666 INFO org.apache.zookeeper.ClientCnxn: Opening socket connection to server datalaker-nn01/10.130.14.201:2181. Will not attempt to authenticate using SASL (unknown error)
2020-07-04 10:32:06,667 INFO org.apache.zookeeper.ClientCnxn: Socket connection established to datalaker-nn01/10.130.14.201:2181, initiating session
2020-07-04 10:32:06,667 INFO org.apache.zookeeper.ClientCnxn: Unable to read additional data from server sessionid 0x1019acb4933001f, likely server has closed socket, closing socket connection and attempting reconnect
2020-07-04 10:32:08,731 INFO org.apache.zookeeper.ClientCnxn: Opening socket connection to server datalaker-nn02/10.130.14.36:2181. Will not attempt to authenticate using SASL (unknown error)
2020-07-04 10:32:08,731 INFO org.apache.zookeeper.ClientCnxn: Client session timed out, have not heard from server in 1964ms for sessionid 0x1019acb4933001f, closing socket connection and attempting reconnect
2020-07-04 10:32:08,832 FATAL org.apache.hadoop.ha.ActiveStandbyElector: Received stat error from Zookeeper. code:CONNECTIONLOSS. Not retrying further znode monitoring connection errors.
2020-07-04 10:32:09,918 INFO org.apache.zookeeper.ZooKeeper: Session: 0x1019acb4933001f closed
2020-07-04 10:32:09,918 FATAL org.apache.hadoop.ha.ZKFailoverController: Fatal error occurred:Received stat error from Zookeeper. code:CONNECTIONLOSS. Not retrying further znode monitoring connection errors.
2020-07-04 10:32:09,918 INFO org.apache.zookeeper.ClientCnxn: EventThread shut down
2020-07-04 10:32:09,918 INFO org.apache.hadoop.ipc.Server: Stopping server on 8019
2020-07-04 10:32:09,920 INFO org.apache.hadoop.ha.ActiveStandbyElector: Yielding from election
2020-07-04 10:32:09,920 INFO org.apache.hadoop.ipc.Server: Stopping IPC Server listener on 8019
2020-07-04 10:32:09,920 INFO org.apache.hadoop.ipc.Server: Stopping IPC Server Responder
2020-07-04 10:32:09,920 INFO org.apache.hadoop.ha.HealthMonitor: Stopping HealthMonitor thread
2020-07-04 10:32:09,921 FATAL org.apache.hadoop.hdfs.tools.DFSZKFailoverController: Got a fatal error, exiting now
java.lang.RuntimeException: ZK Failover Controller failed: Received stat error from Zookeeper. code:CONNECTIONLOSS. Not retrying further znode monitoring connection errors.
        at org.apache.hadoop.ha.ZKFailoverController.mainLoop(ZKFailoverController.java:369)
        at org.apache.hadoop.ha.ZKFailoverController.doRun(ZKFailoverController.java:238)
        at org.apache.hadoop.ha.ZKFailoverController.access$000(ZKFailoverController.java:61)
        at org.apache.hadoop.ha.ZKFailoverController$1.run(ZKFailoverController.java:172)
        at org.apache.hadoop.ha.ZKFailoverController$1.run(ZKFailoverController.java:168)
        at org.apache.hadoop.security.SecurityUtil.doAsLoginUserOrFatal(SecurityUtil.java:415)
        at org.apache.hadoop.ha.ZKFailoverController.run(ZKFailoverController.java:168)
        at org.apache.hadoop.hdfs.tools.DFSZKFailoverController.main(DFSZKFailoverController.java:181)

解决:
修改zookeeper的配置文件,在zoo.cfg中,修改ticktime为4000ms,默认是2000ms。

转载于:https://www.cnblogs.com/yjt1993/p/12185866.html

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值