环境:RHEL 5.5 + 11.2.0.3 GI, 双节点。
问题描述: OBA 发现节点2 被重新启动, 之后无法加入到集群。
分析过程:
实际上这是两个不同的问题, 首先节点2被重新启动, 之后节点2无法加入集群。这需要从集群的告警日志开始分析。
1 . 节点1的集群alert.log
2014-08-26 17:23:09.846
[cssd(l3527)]CRS-1612:Network communication with node **** (2) missing for 50% of
timeout interval. Removal of this node from cluster in 14.190 seconds
2014-08-26 17:23:16.860
[cssd(l3527) ]CRS-1611:Network communication with node **** (2) missing for 75% of
timeout interval. Removal of this node from cluster in 7.180 seconds
2014-08-26 17:23:21.870
[cssd(l3527) ]CRS-1610:Network communication with node **** (2) missing for 90% of
timeout interval. Removal of this node from cluster in 2.170 seconds
2014-08-26 17:23:24.038
[cssd(l3527)JCRS-1607:Node **** 工s being evicted in cluster incarnation 76357678;
details at (:CSSNM00007:) in /***/***/log/*****/cssd/ocssd.log.
2014-08-26 17:23:25.020
2014-08-26 17:23:55.048
[cssd(l3527) ]CRS-1601:CSSD Reconfiguration complete. Active nodes are ******
2 . 节点2的集群alert.log
2014-08-26 17:23:10.052
[cssd(26141) ]CRS-1612:Network communication with node ****** (1) missing for 50%
of timeout interval. Removal of this node from cluster in 14.180 seconds
2014-08-26 17:23:17.066
[cssd(26141) ]CRS-1611:Network communication with node ****** (1) missing for 75%
of timeout interval. Removal of this node from cluster in 7.170 seconds
2014-08-26 17:23:22.076
[cssd(26141) ]CRS-1610:Network communication with node ****** (1) missing for 90%
of timeout interval. Removal of this node from cluster in 2.160 seconds
2014-08-26 17:23:24.243
[cssd(26141) JCRS-1608:T h i s node w a s evicted by node 1, ******; details at
(:CSSNMOOOOS:) in /***/***/log/****/cssd/ocssd.log.
2014-08-26 17:23:24.243
[cssd(26141) ]CRS-1608:T h i s n o d e w a s evicted by node 1, *****; details at
(:CSSNMOOOOS:) in /***/***/log/****/cssd/ocssd.log.
2014-08-26 17:23:24.243
[cssd(2614l)]CRS-1656:The CSS daemon is terminating due to a fatal error; Details
at (:CSSSC00012:) in /***/***/log/****/cssd/ocssd.log
2014-08-26 17:23:24.243
[cssd(2614l)]CRS-1652:Starting clean up of CRSD resources.
2014-08-26 17:23: