节点2上报错 /var/log/messages,VIP网段有在早上断开过,已经和网络工程师确认 。
Feb 16 06:54:01 ecsrac02 kernel: bnx2: eth0 NIC Link is Down
Feb 16 06:56:07 ecsrac02 kernel: bnx2: eth0 NIC Link is Up, 1000 Mbps full duplex
节点2上的 alert log :
Mon Feb 16 05:11:30 2009
Thread 2 advanced to log sequence 3583 (LGWR switch)
Current log# 10 seq# 3583 mem# 0: /ocfs_data1/ecsdb/redo10a.log
Current log# 10 seq# 3583 mem# 1: /ocfs_index1/ecsdb/redo10b.log
Mon Feb 16 05:15:30 2009
Thread 2 advanced to log sequence 3584 (LGWR switch)
Current log# 11 seq# 3584 mem# 0: /ocfs_data1/ecsdb/redo11a.log
Current log# 11 seq# 3584 mem# 1: /ocfs_index1/ecsdb/redo11b.log
Mon Feb 16 06:54:42 2009
ALTER SYSTEM SET service_names='' SCOPE=MEMORY SID='ecsdb2';
Mon Feb 16 06:54:42 2009
Immediate Kill Session#: 1036, Serial#: 55845
Immediate Kill Session: sess: 0x25cb1d5b8 OS pid: 30068
Immediate Kill Session#: 1038, Serial#: 13595
Immediate Kill Session: sess: 0x25cb1eb20 OS pid: 11821
Immediate Kill Session#: 1042, Serial#: 50814
Immediate Kill Session: sess: 0x25cb215f0 OS pid: 16328
Immediate Kill Session#: 1047, Serial#: 10969
Immediate Kill Session: sess: 0x25fb36f48 OS pid: 14038
Immediate Kill Session#: 1113, Serial#: 46642
Immediate Kill Session: sess: 0x25fb631b0 OS pid: 31750
Immediate Kill Session#: 1118, Serial#: 26560
Immediate Kill Session: sess: 0x25cb54360 OS pid: 7687
Mon Feb 16 06:54:43 2009
Process OS id : 7687 alive after kill
Errors in file
Immediate Kill Session#: 1131, Serial#: 57076
Immediate Kill Session: sess: 0x25fb6f258 OS pid: 7157
Immediate Kill Session#: 1179, Serial#: 25165
Immediate Kill Session: sess: 0x25fb8f418 OS pid: 28197
Immediate Kill Session#: 1180, Serial#: 24265
Immediate Kill Session: sess: 0x25cb7daf8 OS pid: 11359
Immediate Kill Session#: 1205, Serial#: 54612
Immediate Kill Session: sess: 0x25fba0a60 OS pid: 19502
Immediate Kill Session#: 1210, Serial#: 38467
Immediate Kill Session: sess: 0x25cb91c10 OS pid: 26109
Immediate Kill Session#: 1211, Serial#: 60805
Immediate Kill Session: sess: 0x25fba4a98 OS pid: 6343
Immediate Kill Session#: 1246, Serial#: 6259
Immediate Kill Session: sess: 0x25cba9d60 OS pid: 21726
发现节点2的监听关闭,其他正常
ecsrac02$
ecsrac02$crs_stat -t
Name Type Target State Host
------------------------------------------------------------
ora.ecsdb.db application ONLINE ONLINE ecsrac02
ora....b1.inst application ONLINE ONLINE ecsrac01
ora....b2.inst application ONLINE ONLINE ecsrac02
ora....01.lsnr application ONLINE ONLINE ecsrac01
ora....c01.gsd application ONLINE ONLINE ecsrac01
ora....c01.ons application ONLINE ONLINE ecsrac01
ora....c01.vip application ONLINE ONLINE ecsrac01
ora....02.lsnr application ONLINE OFFLINE
ora....c02.gsd application ONLINE ONLINE ecsrac02
ora....c02.ons application ONLINE ONLINE ecsrac02
ora....c02.vip application ONLINE ONLINE ecsrac01
--------------------
ora....c02.vip application ONLINE ONLINE ecsrac01 可以看到 2节点上的vip漂移到节点1上
节点1上 ifconfig 也可以看到两个虚拟IP在节点1上 。
问题:
1. 刚开始crs_stat -t 看到节点2的实例和监听都是offline . 后来看到只有监听是offline , 不知道ip漂移后oracle是如何做的 。
2. 节点2上 alert log 可以看到下面这些命令
ALTER SYSTEM SET service_names='' SCOPE=MEMORY SID='ecsdb2';
Mon Feb 16 06:54:42 2009
Immediate Kill Session#: 1036, Serial#: 55845
Immediate Kill Session: sess: 0x25cb1d5b8 OS pid: 30068
Immediate Kill Session#: 1038, Serial#: 13595
Immediate Kill Session: sess: 0x25cb1eb20 OS pid: 11821
将service_names 置为空,会将哪些节点2上的服务关闭,节点1会做些什么 ?
3. 重新启动节点2上的监听,还是不能让它online
ecsrac02$crs_stop ora.ecsrac02.LISTENER_ECSRAC02.lsnr
Target set to OFFLINE for `ora.ecsrac02.LISTENER_ECSRAC02.lsnr`
ecsrac02$
ecsrac02$crs_start ora.ecsrac02.LISTENER_ECSRAC02.lsnr
ecsrac01 : CRS-1019: Resource ora.ecsrac02.LISTENER_ECSRAC02.lsnr (application) cannot run on ecsrac01
ecsrac02 : CRS-1018: Resource ora.ecsrac02.vip (application) is already running on ecsrac01
CRS-0223: Resource 'ora.ecsrac02.LISTENER_ECSRAC02.lsnr' has placement error.
ecsrac02$
需要重新启动节点2的所有服务吗 ? [root@ecsrac02 ~]# /etc/init.d/init.crs start
Feb 16 06:54:01 ecsrac02 kernel: bnx2: eth0 NIC Link is Down
Feb 16 06:56:07 ecsrac02 kernel: bnx2: eth0 NIC Link is Up, 1000 Mbps full duplex
节点2上的 alert log :
Mon Feb 16 05:11:30 2009
Thread 2 advanced to log sequence 3583 (LGWR switch)
Current log# 10 seq# 3583 mem# 0: /ocfs_data1/ecsdb/redo10a.log
Current log# 10 seq# 3583 mem# 1: /ocfs_index1/ecsdb/redo10b.log
Mon Feb 16 05:15:30 2009
Thread 2 advanced to log sequence 3584 (LGWR switch)
Current log# 11 seq# 3584 mem# 0: /ocfs_data1/ecsdb/redo11a.log
Current log# 11 seq# 3584 mem# 1: /ocfs_index1/ecsdb/redo11b.log
Mon Feb 16 06:54:42 2009
ALTER SYSTEM SET service_names='' SCOPE=MEMORY SID='ecsdb2';
Mon Feb 16 06:54:42 2009
Immediate Kill Session#: 1036, Serial#: 55845
Immediate Kill Session: sess: 0x25cb1d5b8 OS pid: 30068
Immediate Kill Session#: 1038, Serial#: 13595
Immediate Kill Session: sess: 0x25cb1eb20 OS pid: 11821
Immediate Kill Session#: 1042, Serial#: 50814
Immediate Kill Session: sess: 0x25cb215f0 OS pid: 16328
Immediate Kill Session#: 1047, Serial#: 10969
Immediate Kill Session: sess: 0x25fb36f48 OS pid: 14038
Immediate Kill Session#: 1113, Serial#: 46642
Immediate Kill Session: sess: 0x25fb631b0 OS pid: 31750
Immediate Kill Session#: 1118, Serial#: 26560
Immediate Kill Session: sess: 0x25cb54360 OS pid: 7687
Mon Feb 16 06:54:43 2009
Process OS id : 7687 alive after kill
Errors in file
Immediate Kill Session#: 1131, Serial#: 57076
Immediate Kill Session: sess: 0x25fb6f258 OS pid: 7157
Immediate Kill Session#: 1179, Serial#: 25165
Immediate Kill Session: sess: 0x25fb8f418 OS pid: 28197
Immediate Kill Session#: 1180, Serial#: 24265
Immediate Kill Session: sess: 0x25cb7daf8 OS pid: 11359
Immediate Kill Session#: 1205, Serial#: 54612
Immediate Kill Session: sess: 0x25fba0a60 OS pid: 19502
Immediate Kill Session#: 1210, Serial#: 38467
Immediate Kill Session: sess: 0x25cb91c10 OS pid: 26109
Immediate Kill Session#: 1211, Serial#: 60805
Immediate Kill Session: sess: 0x25fba4a98 OS pid: 6343
Immediate Kill Session#: 1246, Serial#: 6259
Immediate Kill Session: sess: 0x25cba9d60 OS pid: 21726
发现节点2的监听关闭,其他正常
ecsrac02$
ecsrac02$crs_stat -t
Name Type Target State Host
------------------------------------------------------------
ora.ecsdb.db application ONLINE ONLINE ecsrac02
ora....b1.inst application ONLINE ONLINE ecsrac01
ora....b2.inst application ONLINE ONLINE ecsrac02
ora....01.lsnr application ONLINE ONLINE ecsrac01
ora....c01.gsd application ONLINE ONLINE ecsrac01
ora....c01.ons application ONLINE ONLINE ecsrac01
ora....c01.vip application ONLINE ONLINE ecsrac01
ora....02.lsnr application ONLINE OFFLINE
ora....c02.gsd application ONLINE ONLINE ecsrac02
ora....c02.ons application ONLINE ONLINE ecsrac02
ora....c02.vip application ONLINE ONLINE ecsrac01
--------------------
ora....c02.vip application ONLINE ONLINE ecsrac01 可以看到 2节点上的vip漂移到节点1上
节点1上 ifconfig 也可以看到两个虚拟IP在节点1上 。
问题:
1. 刚开始crs_stat -t 看到节点2的实例和监听都是offline . 后来看到只有监听是offline , 不知道ip漂移后oracle是如何做的 。
2. 节点2上 alert log 可以看到下面这些命令
ALTER SYSTEM SET service_names='' SCOPE=MEMORY SID='ecsdb2';
Mon Feb 16 06:54:42 2009
Immediate Kill Session#: 1036, Serial#: 55845
Immediate Kill Session: sess: 0x25cb1d5b8 OS pid: 30068
Immediate Kill Session#: 1038, Serial#: 13595
Immediate Kill Session: sess: 0x25cb1eb20 OS pid: 11821
将service_names 置为空,会将哪些节点2上的服务关闭,节点1会做些什么 ?
3. 重新启动节点2上的监听,还是不能让它online
ecsrac02$crs_stop ora.ecsrac02.LISTENER_ECSRAC02.lsnr
Target set to OFFLINE for `ora.ecsrac02.LISTENER_ECSRAC02.lsnr`
ecsrac02$
ecsrac02$crs_start ora.ecsrac02.LISTENER_ECSRAC02.lsnr
ecsrac01 : CRS-1019: Resource ora.ecsrac02.LISTENER_ECSRAC02.lsnr (application) cannot run on ecsrac01
ecsrac02 : CRS-1018: Resource ora.ecsrac02.vip (application) is already running on ecsrac01
CRS-0223: Resource 'ora.ecsrac02.LISTENER_ECSRAC02.lsnr' has placement error.
ecsrac02$
需要重新启动节点2的所有服务吗 ? [root@ecsrac02 ~]# /etc/init.d/init.crs start
来自 “ ITPUB博客 ” ,链接:http://blog.itpub.net/35489/viewspace-553396/,如需转载,请注明出处,否则将追究法律责任。
转载于:http://blog.itpub.net/35489/viewspace-553396/