原系统删除11g Rac后安装10g Rac中的问题处理

    最近在自己虚拟机中将原来的11g rac删除掉,打算在原来的系统中安装10g rac,结果遇到了一些问题,特此记录下:


首先说下安装执行root.sh报错后的处理(重新执行root.sh)

#安装时出错,图形界面不要关闭
1.两节点分别执行$CRS_HOME/install/rootdelete.sh
2.某一节点执行$CRS_HOME/install/rootdeinstall.sh

3.

dd if=/dev/zero of=/dev/raw/raw1 bs=1M count=400
dd if=/dev/zero of=/dev/raw/raw2 bs=1M count=400
dd if=/dev/zero of=/dev/raw/raw3 bs=1M count=400
dd if=/dev/zero of=/dev/raw/raw4 bs=1M count=400
dd if=/dev/zero of=/dev/raw/raw5 bs=1M count=400


4.重新执行root.sh


其次说下执行节点二root.sh脚本时的情况


节点二执行root脚本时,出现如下界面,crsd与evmd两个服务一直没起来,等待超时。

[root@rac2 crs]# ./root.sh 
WARNING: directory '/u01/app/oracle/product/10.2.0' is not owned by root
WARNING: directory '/u01/app/oracle/product' is not owned by root
WARNING: directory '/u01/app/oracle' is not owned by root
WARNING: directory '/u01/app' is not owned by root
WARNING: directory '/u01' is not owned by root
Checking to see if Oracle CRS stack is already configured
/etc/oracle does not exist. Creating it now.

Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/u01/app/oracle/product/10.2.0' is not owned by root
WARNING: directory '/u01/app/oracle/product' is not owned by root
WARNING: directory '/u01/app/oracle' is not owned by root
WARNING: directory '/u01/app' is not owned by root
WARNING: directory '/u01' is not owned by root
clscfg: EXISTING configuration version 3 detected.
clscfg: version 3 is 10G Release 2.
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: rac1 rac1priv rac1
node 2: rac2 rac2priv rac2
clscfg: Arguments check out successfully.

NO KEYS WERE WRITTEN. Supply -force parameter to override.
-force is destructive and will destroy any previous cluster
configuration.
Oracle Cluster Registry for cluster has already been initialized
Startup will be queued to init within 90 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
CSS is active on these nodes.
	rac1
	rac2
CSS is active on all nodes.
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Waiting for the Oracle CRSD and EVMD to start
Timed out waiting for the CRS stack to start. 


正常情况下,节点二执行root.sh应该会报错,然后我们再手动添加网络信息,执行vipca,结果没有报错

#正常情况下
[root@rac2 bin]# ./oifcfg getif
eth0  192.168.50.0  global  public
eth1  10.10.10.0  global  cluster_interconnect 
[root@rac2 bin]# vi vipca
[root@rac2 bin]# ./vipca

但是此刻我们执行vipca后,发现什么图形界面都没出来,此时我们看下集群状态(crsd与evmd一直没有起来)

[root@rac2 bin]# ./crsctl check crs
CSS appears healthy
Cannot communicate with CRS
Cannot communicate with EVM  


看下vipca的日志以及crsd日志:

#/u01/app/oracle/product/10.2.0/crs/cfgtoollogs/vipca/vipca.log

[Thread-10] [11:56:28:967] [RuntimeExec.runCommand:74]  Calling Runtime.exec() with the command
   2204 [Thread-10] [11:56:28:967] [RuntimeExec.runCommand:76]  /u01/app/oracle/product/10.2.0/crs/bin/crs_stat
   2205 [Thread-10] [11:56:28:968] [RuntimeExec.runCommand:76]  -u
   2206 [Thread-10] [11:56:28:968] [RuntimeExec.runCommand:76]  ora.rac1.gsd
   2207 [Thread-30] [11:56:29:1] [StreamReader.run:61]  In StreamReader.run
   2208 [Thread-10] [11:56:29:3] [RuntimeExec.runCommand:131]  runCommand: Waiting for the process
   2209 [Thread-29] [11:56:29:4] [StreamReader.run:61]  In StreamReader.run
   2210 [Thread-29] [11:56:29:113] [StreamReader.run:65]  OUTPUT>CRS-0210: Could not find resource ora.rac1.gsd.
   2211 [Thread-29] [11:56:29:113] [StreamReader.run:65]  OUTPUT>
   2212 [Thread-10] [11:56:29:114] [RuntimeExec.runCommand:133]  runCommand: process returns -46
   2213 [Thread-10] [11:56:29:114] [RuntimeExec.runCommand:147]  RunTimeExec: output>
   2214 [Thread-10] [11:56:29:116] [RuntimeExec.runCommand:150]  CRS-0210: Could not find resource ora.rac1.gsd.
   2215 [Thread-10] [11:56:29:117] [RuntimeExec.runCommand:150]
   2216 [Thread-10] [11:56:29:117] [RuntimeExec.runCommand:155]  RunTimeExec: error>
   2217 [Thread-10] [11:56:29:118] [RuntimeExec.runCommand:175]  Returning from RunTimeExec.runCommand
   2218 [Thread-10] [11:56:29:118] [HAOperationImpl.runCommand:1272]  Unsigned return code = 210
   2219 [Thread-10] [11:56:29:119] [HAOperationImpl.runCommand:1286]  set status HA_RES_NOT_EXIST_ERR
   2220 [Thread-10] [11:56:29:119] [HAStatusOperation.run:127]  Returned from executing the HA Operation
   2221 [Thread-10] [11:56:29:119] [HAStatusOperation.run:132]  OUTPUT> CRS-0210: Could not find resource ora.rac1.gsd.
   2222 [Thread-10] [11:56:29:120] [HAStatusOperation.run:132]  OUTPUT>
   2223 [Thread-10] [11:56:29:120] [HAStatusOperation.run:138]  ERROR>
   2224 [Thread-10] [11:56:29:120] [LocalCommand.execute:56]  LocalCommand.execute: Returned from run method
   2225 [Thread-10] [11:56:29:120] [Operation.<init>:63]  Setting to be a synchronized operation
   2226 [Thread-10] [11:56:29:120] [HASContext.getInstance:190]  Module init : 6
   2227 [Thread-10] [11:56:29:121] [HASContext.getInstance:214]  Local Module init : 0
   2228 [Thread-10] [11:56:29:121] [HASContext.getInstance:249]  HAS Context Allocated: 39 to oracle.ops.mgmt.has.Util@18efaea
   2229 [Thread-10] [11:56:29:122] [Util.<init>:86]  Util Instance created.
   2230 [Thread-10] [11:56:29:123] [HASContext.getCRSHome:450]  /u01/app/oracle/product/10.2.0/crs
   2231 [Thread-10] [11:56:29:123] [Util.getCRSHome:428]  getCRSHome: ret=/u01/app/oracle/product/10.2.0/crs
   2232 [Thread-10] [11:56:29:123] [HAOperation.setCRSHome:148]  CRS_HOME: /u01/app/oracle/product/10.2.0/crs
   2233 [Thread-10] [11:56:29:123] [HAOperation.checkCommandVerb:211]  Checking an existence of CRS command
   2234 [Thread-10] [11:56:29:123] [LocalCommand.execute:54]  LocalCommand.execute: Calling run method
   2235 [Thread-10] [11:56:29:123] [HAProfileOperation.run:175]  Executing the HA operation /u01/app/oracle/product/10.2.0/crs/bin/crs_profil        e ora.rac1.gsd
#再看crsd日志
2017-07-04 12:02:16.755: [ COMMCRS][3926374720]clsc_connect: (0x541ab40) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:17.453: [ COMMCRS][3926374720]clsc_connect: (0x53ce880) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:18.646: [ COMMCRS][3926374720]clsc_connect: (0x53cc0f0) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:19.586: [ COMMCRS][3926374720]clsc_connect: (0x53d46b0) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:21.028: [ COMMCRS][3926374720]clsc_connect: (0x51b54d0) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:21.711: [ COMMCRS][3926374720]clsc_connect: (0x5422b50) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:23.145: [ COMMCRS][3926374720]clsc_connect: (0x536d050) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:24.581: [ COMMCRS][3926374720]clsc_connect: (0x541ab40) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:25.020: [ COMMCRS][3926374720]clsc_connect: (0x53ce880) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:26.205: [ COMMCRS][3926374720]clsc_connect: (0x53cc0f0) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:26.890: [ COMMCRS][3926374720]clsc_connect: (0x53d46b0) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:27.325: [ COMMCRS][3926374720]clsc_connect: (0x51b54d0) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:27.759: [ COMMCRS][3926374720]clsc_connect: (0x5422b50) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:28.691: [ COMMCRS][3926374720]clsc_connect: (0x536d050) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:30.123: [ COMMCRS][3926374720]clsc_connect: (0x541ab40) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:31.057: [ COMMCRS][3926374720]clsc_connect: (0x5434960) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))


后来看了下/var/tmp/.oracle 以及 /tmp/.oracle两个目录的权限,发现权限有问题,然后删除掉之后重新启动crs,正常启动,然后重新执行vipca成功启动。


最后在升级到10.2.0.5时,发现节点一无法启动,cssd日志一直报错

clssnmvReadDskHeartbeat: read ALL for Joining

后来经过排查发现还是以上两个文件权限问题(.oracle)


-=----------------------------------------------------------------

后来在网上查询下资料,发现对于出现以下问题的,一般问题有如下解决方法

问题:

crsd.log

2017-07-04 12:02:16.755: [ COMMCRS][3926374720]clsc_connect: (0x541ab40) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:17.453: [ COMMCRS][3926374720]clsc_connect: (0x53ce880) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:18.646: [ COMMCRS][3926374720]clsc_connect: (0x53cc0f0) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))

2017-07-04 12:02:19.586: [ COMMCRS][3926374720]clsc_connect: (0x53d46b0) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))
[root@rac2 bin]# ./crsctl check crs
CSS appears healthy
Cannot communicate with CRS
Cannot communicate with EVM 


问题原因解决:

1.防火墙原因
Oracle Metalink 上的一种类似的情况, 是因为防火墙的原因。 但是我的防火墙在安装系统的时候就关闭了。
问题表现, ping 私有IP 正常, 但是用tracert 私有IP。 就会有如下错误:
# traceroute 192.168.0.2
traceroute to  192.168.0.2 (192.168.0.2), 30 hops max, 46 byte packets
1  rac2prv (192.168.0.2)   0.201 ms !<10>   0.198 ms !<10>   0.109 ms !<10>

如果是这种情况, 关闭防火墙就可以了
# service iptables stop
# chkconfig iptables off.
 
 
2.raw 设备的权限问题
因为raw的配置是按照Oracle 官方文档配置的。

3.相关目录的权限问题
CRS 需要往相关的文件写一些信息,如果这些文件夹有权限问题,导致文件不能写。 也可能会出现这种情况。 
几个相关的目录:/var/tmp/.oracle, /tmp/.oracle和$CRS_HOME/log/sid/
Oracle 会往这几个文件里写一些socket和log的信息。 如果不能写,就会导致CRS不能启动。
如何判断是不是这个问题导致CRS不能启动的方法很简单。 就是先将这2个文件夹清空。 在启动CRS。 如果有文件生成就说明权限没有问题。
注意的事,要先关闭CRS。 如果CRS 在运行, 强制删除这2个文件夹,可能会导致CRS 挂掉。
      
 
RAC root.sh Oracle CRS stack is already configured and will be running under init(1M) 的解决方法
http://blog.csdn.net/tianlesoftware/archive/2010/02/21/5314804.aspx





























评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值