一次主机重启后节点无法启动集群及数据库

远程登陆系统,检查发现,二节点集群以及实例确实没有启动,而一节点的是启动的,系统仍然能够对外提供服务。
node2:

node1:


检查集群日志:发现在启动CSSD进程时有报错。
检查cssd日志发现:有报has a disk HB, but no network HB,初步判断是由于网络的问题导致节点集群无法启动。
Ocssd.log

点击(此处)折叠或打开


  1. 2016-11-01 11:47:39.244: [ CSSD][88397568]clssscSelect: cookie accept request 0x7f22fc084650
  2. 2016-11-01 11:47:39.244: [ CSSD][88397568]clssscevtypSHRCON: getting client with cmproc 0x7f22fc084650
  3. 2016-11-01 11:47:39.244: [ CSSD][88397568]clssgmRegisterClient: proc(4/0x7f22fc084650), client(4/0x7f22fc077720)
  4. 2016-11-01 11:47:39.244: [ CSSD][88397568]clssgmExecuteClientRequest(): type(6) size(684) only connect and exit messages are allowed before lease acquisition proc(0x7f22fc084650) client(0x7f22fc077720)
  5. 2016-11-01 11:47:39.245: [ CSSD][88397568]clssgmDiscEndpcl: gipcDestroy 0x4c9
  6. 2016-11-01 11:47:39.579: [ CSSD][88397568]clssscSelect: cookie accept request 0x7f22fc06f050
  7. 2016-11-01 11:47:39.580: [ CSSD][88397568]clssscevtypSHRCON: getting client with cmproc 0x7f22fc06f050
  8. 2016-11-01 11:47:39.580: [ CSSD][88397568]clssgmRegisterClient: proc(3/0x7f22fc06f050), client(1/0x7f22fc07d2e0)
  9. 2016-11-01 11:47:39.580: [ CSSD][88397568]clssgmExecuteClientRequest(): type(6) size(684) only connect and exit messages are allowed before lease acquisition proc(0x7f22fc06f050) client(0x7f22fc07d2e0)
  10. 2016-11-01 11:47:39.580: [ CSSD][88397568]clssgmDiscEndpcl: gipcDestroy 0x4e1
  11. 2016-11-01 11:47:39.587: [ CSSD][104158976]clssnmlfmtlease: uniqueness 1477972049, gipc addr gipcha://oproject2:nm2_oprojec-cluster
  12. 2016-11-01 11:47:39.590: [ CSSD][104158976]clssnmvStatusBlkInit: myinfo nodename oproject2, uniqueness 1477972049
  13. 2016-11-01 11:47:39.628: [ CSSD][104158976]clssnmlgetslot:lease acquisition for node oproject2/slot 2 completed in 5030 msecs
  14. 2016-11-01 11:47:39.636: [ CSSD][104158976]clssnmvDHBValidateNcopy: node 1, oproject1, has a disk HB, but no network HB, DHB has rcfg 366471331, wrtcnt, 22240309, LATS 4294065090, lastSeqNo 0, uniqueness 1470976437, timestamp 1477972068/800668394
  15. 2016-11-01 11:47:39.636: [ CSSD][104158976]clssnmvDHBValidateNcopy: node 2, oproject2, has a disk HB, but no network HB, DHB has rcfg 366471330, wrtcnt, 4686552, LATS 4294065090, lastSeqNo 0, uniqueness 1477972049, timestamp 1473386969/1790883314
  16. 2016-11-01 11:47:39.659: [ SKGFD][104158976]Lib :UFS:: closing handle 0x2483ab0 for disk :/dev/CRS1:
  17. 2016-11-01 11:47:39.659: [ CSSD][104158976]clssnmInitNodeDB: Initializing with OCR id 0
  18. 2016-11-01 11:47:39.666: [ CSSD][88397568]clssscSelect: cookie accept request 0x24b0040
  19. 2016-11-01 11:47:39.666: [ CSSD][88397568]clssgmAllocProc: (0x7f22fc09a380) allocated
  20. 2016-11-01 11:47:39.666: [ CSSD][88397568]clssgmClientConnectMsg: properties of cmProc 0x7f22fc09a380 - 1,2,3,4,5
  21. 2016-11-01 11:47:39.666: [ CSSD][88397568]clssgmClientConnectMsg: Connect from con(0x52d) proc(0x7f22fc09a380) pid(2684) version 11:2:1:4, properties: 1,2,3,4,5
  22. 2016-11-01 11:47:39.666: [ CSSD][88397568]clssgmClientConnectMsg: msg flags 0x0000
  23. 2016-11-01 11:47:39.667: [ CSSD][88397568]clssgmExecuteClientRequest(): type(37) size(80) only connect and exit messages are allowed before lease acquisition proc(0x7f22fc09a380) client((nil))
  24. 2016-11-01 11:47:39.667: [ CSSD][88397568]clssgmDeadProc: proc 0x7f22fc09a380
  25. 2016-11-01 11:47:39.667: [ CSSD][88397568]clssgmDestroyProc: cleaning up proc(0x7f22fc09a380) con(0x52d) skgpid ospid 2684 with 0 clients, refcount 0
检查系统日志:发现在重启主机的时候,启动 eth1 网卡,即心跳网,启动失败报错改 IP 已被其他主机占用。

使用心跳IP进行SSH,发现无法登陆。后续联系系统工程师检查网络占用情况,了解到38网段是与存储通信,同时作为数据库的心跳,其他服务器与存储连接端口占用。

将占用心跳 IP 的主机修改后,数据库成功启动。


来自 “ ITPUB博客 ” ,链接:http://blog.itpub.net/29123031/viewspace-2132267/,如需转载,请注明出处,否则将追究法律责任。

转载于:http://blog.itpub.net/29123031/viewspace-2132267/

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值