今天清明节的第2天了, 接到一个陌生电话,结果是现在项目经理, 给我“哭诉”, 系统登陆不起了,局方在导数据。
因为最近在研究大数据,所以oracle的事情很少碰。 这些系统也比较稳定了。
1. 开始了,收到了登陆方式,立马打开电脑,ssh 上去, df -h, 察。。。 HP-UX , 立马bdf ,看看是不是系统log100%》无法记录导致。结果不是
2.
$ crs_stat -t
CRS-0184: Cannot communicate with the CRS daemon
3, 这里出问题,一般情况下是raw设备权限丢失, 磁盘坏block, 网卡异常。
4. 检查下是否 4个都异常了
$ crsctl check crs
CRS-4638: Oracle High Availability Services is online
CRS-4535: Cannot communicate with Cluster Ready Services
CRS-4529: Cluster Synchronization Services is online
CRS-4533: Event Manager is online
5.看下crs日志
2015-04-05 08:05:12.419: [ OCRASM][1]ASM Error Stack :
2015-04-05 08:05:12.517: [ OCRASM][1]proprasmo: kgfoCheckMount returned [6]
2015-04-05 08:05:12.517: [ OCRASM][1]proprasmo: Th<span style="color:#ff0000;">e ASM disk group DG01R10 is not found or not mounted</span>
2015-04-05 08:05:12.519: [ OCRRAW][1]proprioo: Failed to open [+DG01R10]. Returned proprasmo() with [26]. Marking location as UNAVAILABLE.
2015-04-05 08:05:12.519: [ OCRRAW][1]proprioo: No OCR/OLR devices are usable
2015-04-05 08:05:12.519: [ OCRASM][1]proprasmcl: asmhandle is NULL
2015-04-05 08:05:12.519: [ GIPC][1] gipcCheckInitialization: possible incompatible non-threaded init from [prom.c : 690], original from [clsss.c : 5326]
2015-04-05 08:05:12.523: [ default][1]clsvactversion:4: Retrieving Active Version from local storage.
2015-04-05 08:05:12.526: [ OCRRAW][1]proprrepauto: The local OCR configuration matches with the configuration published by OCR Cache Writer. No repair required.
2015-04-05 08:05:12.530: [ OCRRAW][1]proprinit: Could not open raw device
2015-04-05 08:05:12.530: [ OCRASM][1]proprasmcl: asmhandle is NULL
2015-04-05 08:05:12.532: [ OCRAPI][1]a_init:16!: Backend init unsuccessful : [26]
2015-04-05 08:05:12.532: [ CRSOCR][1] OCR context init failure. Error: PROC-26: Error while accessing the physical storage
2015-04-05 08:05:12.533: [ CRSMAIN][1] Created alert : (:CRSD00111:) : Could not init OCR, error: PROC-26: Error while accessing the physical storage
SQL> select name,state,total_mb,free_mb from v$asm_diskgroup;
NAME STATE TOTAL_MB FREE_MB
------------------------------ ----------- ---------- ----------
DG01R10 MOUNTED 19436544 5783632
DG02R10REDO01 MOUNTED 307200 183808
DG02R5 MOUNTED 139124736 36946816
DG03R10REDO01 MOUNTED 307200 183876
DG_NETAPP MOUNTED 31713372 2910471
7. 拉起crs服务
$ pwd
/u01/app/crs_home/bin
$ /u01/app/crs_home/bin/crsctl start res ora.crsd -init
CRS-2672: Attempting to start 'ora.crsd' on 'psmsdb01'
CRS-2676: Start of 'ora.crsd' on 'psmsdb01' succeeded
8.然后,再查看状态
$ crsctl check crs 这里还不正常,可能还在检查中, 所以别猴急
CRS-4638: Oracle High Availability Services is online
CRS-4535: Cannot communicate with Cluster Ready Services
CRS-4529: Cluster Synchronization Services is online
CRS-4533: Event Manager is online
$ crsctl check crs 好了
CRS-4638: Oracle High Availability Services is online
CRS-4537: Cluster Ready Services is online
CRS-4529: Cluster Synchronization Services is online
CRS-4533: Event Manager is online
9.收工。