对于前段时间在某公司营销管理系统上部署实施的一套RAC的总结。
1、系统环境
root@jfp595902c:/>oslevel -s
6100-07-05-1228
2、软件环境
11.2.0.3.0
对于出现的问题如下描述:
在第一个节点执行root.sh现象
root@jfp595901c:/>/u01/app/oraInventory/orainstRoot.sh
Changing permissions of /u01/app/oraInventory.
Adding read,write permissions for group.
Removing read,write,execute permissions for world.
Changing groupname of /u01/app/oraInventory to oinstall.
The execution of the script is complete.
root@jfp595901c:/>/u01/app/11.2.0/grid/root.sh
Performing root user operation for Oracle 11g
The following environment variables are set as:
ORACLE_OWNER= grid
ORACLE_HOME= /u01/app/11.2.0/grid
Enter the full pathname of the local bin directory: [/usr/local/bin]:
The contents of "dbhome" have not changed. No need to overwrite.
The contents of "oraenv" have not changed. No need to overwrite.
The contents of "coraenv" have not changed. No need to overwrite.
Creating /etc/oratab file...
Entries will be added to the /etc/oratab file as needed by
Database Configuration Assistant when a database is created
Finished running generic part of root script.
Now product-specific root actions will be performed.
Using configuration parameter file: /u01/app/11.2.0/grid/crs/install/crsconfig_params
Creating trace directory
User ignored Prerequisites during installation
User grid has the required capabilities to run CSSD in realtime mode
OLR initialization - successful
root wallet
root wallet cert
root cert export
peer wallet
profile reader wallet
pa wallet
peer wallet keys
pa wallet keys
peer cert request
pa cert request
peer cert
pa cert
peer root cert TP
profile reader root cert TP
pa root cert TP
peer pa cert TP
pa peer cert TP
profile reader pa cert TP
profile reader peer cert TP
peer user cert
pa user cert
Adding Clusterware entries to inittab
CRS-2672: Attempting to start 'ora.mdnsd' on 'jfp595901c'
CRS-2676: Start of 'ora.mdnsd' on 'jfp595901c' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'jfp595901c'
CRS-2676: Start of 'ora.gpnpd' on 'jfp595901c' succeeded
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'jfp595901c'
CRS-2672: Attempting to start 'ora.gipcd' on 'jfp595901c'
CRS-2676: Start of 'ora.cssdmonitor' on 'jfp595901c' succeeded
CRS-2676: Start of 'ora.gipcd' on 'jfp595901c' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'jfp595901c'
CRS-2672: Attempting to start 'ora.diskmon' on 'jfp595901c'
CRS-2676: Start of 'ora.diskmon' on 'jfp595901c' succeeded
CRS-2676: Start of 'ora.cssd' on 'jfp595901c' succeeded
ASM created and started successfully.
Disk Group DATA created successfully.
The ora.asm resource is not ONLINE
Did not succssfully configure and start ASM at /u01/app/11.2.0/grid/crs/install/crsconfig_lib.pm line 6763.
/u01/app/11.2.0/grid/perl/bin/perl -I/u01/app/11.2.0/grid/perl/lib -I/u01/app/11.2.0/grid/crs/install /u01/app/11.2.0/grid/crs/install/rootcrs.pl execution failed
root@jfp595901c:/>
在出现这个现象的时候就去查看了状态:
root@jfp595901c:/u01/app/11.2.0/grid/bin>./crsctl status resource ora.asm -init
NAME=ora.asm
TYPE=ora.asm.type
TARGET=ONLINE
STATE=ONLINE on jfp595901c
root@jfp595901c:/u01/app/11.2.0/grid/bin>./crsctl status resource -t -init
--------------------------------------------------------------------------------
NAME TARGET STATE SERVER STATE_DETAILS
--------------------------------------------------------------------------------
Cluster Resources
--------------------------------------------------------------------------------
ora.asm
1 ONLINE ONLINE jfp595901c Started
ora.cluster_interconnect.haip
1 ONLINE OFFLINE
ora.crf
1 OFFLINE OFFLINE
ora.crsd
1 OFFLINE OFFLINE
ora.cssd
1 ONLINE ONLINE jfp595901c
ora.cssdmonitor
1 ONLINE ONLINE jfp595901c
ora.ctssd
1 ONLINE ONLINE jfp595901c OBSERVER
ora.diskmon
1 OFFLINE OFFLINE
ora.drivers.acfs
1 ONLINE ONLINE jfp595901c
ora.evmd
1 OFFLINE OFFLINE
ora.gipcd
1 ONLINE ONLINE jfp595901c
ora.gpnpd
1 ONLINE ONLINE jfp595901c
ora.mdnsd
1 ONLINE ONLINE jfp595901c
再去查看dg 的状态:
$ sqlplus / as sysasm
SQL*Plus: Release 11.2.0.3.0 Production on Thu Apr 25 19:51:49 2013
Copyright (c) 1982, 2011, Oracle. All rights reserved.
Connected to:
Oracle Database 11g Enterprise Edition Release 11.2.0.3.0 - 64bit Production
With the Real Application Clusters and Automatic Storage Management options
SQL> desc v$asm_diskgroup
Name Null? Type
----------------------------------------- -------- ----------------------------
GROUP_NUMBER NUMBER
NAME VARCHAR2(30)
SECTOR_SIZE NUMBER
BLOCK_SIZE NUMBER
ALLOCATION_UNIT_SIZE NUMBER
STATE VARCHAR2(11)
TYPE VARCHAR2(6)
TOTAL_MB NUMBER
FREE_MB NUMBER
HOT_USED_MB NUMBER
COLD_USED_MB NUMBER
REQUIRED_MIRROR_FREE_MB NUMBER
USABLE_FILE_MB NUMBER
OFFLINE_DISKS NUMBER
COMPATIBILITY VARCHAR2(60)
DATABASE_COMPATIBILITY VARCHAR2(60)
VOTING_FILES VARCHAR2(1)
SQL> select name ,STATE from v$asm_diskgroup;
NAME STATE
------------------------------ -----------
DATA MOUNTED
SQL> !ps -ef | grep asm_
grid 8323296 1 0 17:34:04 - 0:00 asm_psp0_+ASM1
grid 8388698 1 0 17:34:06 - 0:00 asm_ping_+ASM1
grid 10092624 1 0 17:34:06 - 0:03 asm_diag_+ASM1
grid 10289216 1 0 17:34:05 - 0:03 asm_vktm_+ASM1
grid 10354750 1 0 17:34:04 - 0:00 asm_pmon_+ASM1
grid 10616900 1 0 17:34:06 - 0:00 asm_gen0_+ASM1
grid 10682438 1 0 17:34:06 - 0:35 asm_dia0_+ASM1
grid 10747976 1 1 17:34:06 - 0:10 asm_lmon_+ASM1
grid 10813514 1 0 17:34:06 - 0:04 asm_lmd0_+ASM1
grid 10879052 1 0 17:34:06 - 0:21 asm_lms0_+ASM1
grid 10944592 1 0 17:34:06 - 0:00 asm_mman_+ASM1
grid 11075666 1 0 17:34:06 - 0:00 asm_lmhb_+ASM1
grid 11141204 1 0 17:34:06 - 0:00 asm_dbw0_+ASM1
grid 11206742 1 0 17:34:06 - 0:00 asm_lgwr_+ASM1
grid 11272280 1 0 17:34:06 - 0:00 asm_ckpt_+ASM1
grid 11337818 1 0 17:34:06 - 0:00 asm_smon_+ASM1
grid 11403356 1 0 17:34:06 - 0:01 asm_rbal_+ASM1
grid 11468894 1 0 17:34:06 - 0:00 asm_gmon_+ASM1
grid 11534432 1 0 17:34:07 - 0:00 asm_mmon_+ASM1
grid 11599970 1 0 17:34:07 - 0:01 asm_mmnl_+ASM1
grid 11665508 1 0 17:34:07 - 0:00 asm_lck0_+ASM1
grid 11927798 10551392 0 19:56:44 pts/0 0:00 grep asm_
root@jfp595901c:/u01/app/grid/diag/asm/+asm/+ASM1/trace>cat alert_+ASM1.log
Thu Apr 25 17:33:59 2013
MEMORY_TARGET defaulting to 318767104.
* instance_number obtained from CSS = 1, checking for the existence of node 0...
* node 0 does not exist. instance_number = 1
Starting ORACLE instance (normal)
LICENSE_MAX_SESSION = 0
LICENSE_SESSIONS_WARNING = 0
Private Interface 'en7' configured from GPnP for use as a private interconnect.
[name='en7', type=1, ip=2.2.139.14, mac=00-11-25-be-62-ec, net=2.2.139.0/24, mask=255.255.255.0, use=cluster_interconnect/6]
Public Interface 'en6' configured from GPnP for use as a public interface.
[name='en6', type=1, ip=10.9.144.53, mac=00-11-25-08-cb-32, net=10.9.144.0/25, mask=255.255.255.128, use=public/1]
Shared memory segment for instance monitoring created
Picked latch-free SCN scheme 3
Using LOG_ARCHIVE_DEST_1 parameter default value as /u01/app/11.2.0/grid/dbs/arch
Autotune of undo retention is turned on.
LICENSE_MAX_USERS = 0
SYS auditing is disabled
NOTE: Volume support enabled
Starting up:
Oracle Database 11g Enterprise Edition Release 11.2.0.3.0 - 64bit Production
With the Real Application Clusters and Automatic Storage Management options.
ORACLE_HOME = /u01/app/11.2.0/grid
System name: AIX
Node name: jfp595901c
Release: 1
Version: 6
Machine: 00CE5DE34C00
Using parameter settings in client-side pfile /u01/app/11.2.0/grid/dbs/init+ASM1.ora on machine jfp595901c
System parameters with non-default values:
large_pool_size = 12M
instance_type = "asm"
remote_login_passwordfile= "EXCLUSIVE"
asm_power_limit = 1
diagnostic_dest = "/u01/app/grid"
Cluster communication is configured to use the following interface(s) for this instance
2.2.139.14
cluster interconnect IPC version:Oracle UDP/IP (generic)
IPC Vendor 1 proto 2
Thu Apr 25 17:34:04 2013
PMON started with pid=2, OS id=10354750
Thu Apr 25 17:34:04 2013
PSP0 started with pid=3, OS id=8323296
Thu Apr 25 17:34:06 2013
VKTM started with pid=4, OS id=10289216 at elevated priority
VKTM running at (10)millisec precision with DBRM quantum (100)ms
Thu Apr 25 17:34:06 2013
GEN0 started with pid=5, OS id=10616900
Thu Apr 25 17:34:06 2013
DIAG started with pid=6, OS id=10092624
Thu Apr 25 17:34:06 2013
PING started with pid=7, OS id=8388698
Thu Apr 25 17:34:06 2013
DIA0 started with pid=8, OS id=10682438
Thu Apr 25 17:34:06 2013
LMON started with pid=9, OS id=10747976
Thu Apr 25 17:34:06 2013
LMD0 started with pid=10, OS id=10813514
* System load used for high load check
* New Low - High Load Threshold Range = [55296 - 73728]
Thu Apr 25 17:34:06 2013
LMS0 started with pid=11, OS id=10879052 at elevated priority
Thu Apr 25 17:34:06 2013
LMHB started with pid=12, OS id=11075666
Thu Apr 25 17:34:06 2013
MMAN started with pid=13, OS id=10944592
Thu Apr 25 17:34:06 2013
DBW0 started with pid=14, OS id=11141204
Thu Apr 25 17:34:06 2013
LGWR started with pid=15, OS id=11206742
Thu Apr 25 17:34:06 2013
CKPT started with pid=16, OS id=11272280
Thu Apr 25 17:34:06 2013
SMON started with pid=17, OS id=11337818
Thu Apr 25 17:34:06 2013
RBAL started with pid=18, OS id=11403356
Thu Apr 25 17:34:07 2013
GMON started with pid=19, OS id=11468894
Thu Apr 25 17:34:07 2013
MMON started with pid=20, OS id=11534432
Thu Apr 25 17:34:07 2013
MMNL started with pid=21, OS id=11599970
lmon registered with NM - instance number 1 (internal mem no 0)
Reconfiguration started (old inc 0, new inc 2)
ASM instance
List of instances:
1 (myinst: 1)
Global Resource Directory frozen
* allocate domain 0, invalid = TRUE
Communication channels reestablished
Master broadcasted resource hash value bitmaps
Non-local Process blocks cleaned out
LMS 0: 0 GCS shadows cancelled, 0 closed, 0 Xw survived
Set master node info
Submitted all remote-enqueue requests
Dwn-cvts replayed, VALBLKs dubious
All grantable enqueues granted
Post SMON to start 1st pass IR
Submitted all GCS remote-cache requests
Post SMON to start 1st pass IR
Fix write in gcs resources
Reconfiguration complete
Thu Apr 25 17:34:07 2013
LCK0 started with pid=22, OS id=11665508
ORACLE_BASE from environment = /u01/app/grid
Thu Apr 25 17:34:08 2013
SQL> ALTER DISKGROUP ALL MOUNT
Diskgroup used for OCR is:DATA
NOTE: cache registered group DATA number=1 incarn=0x86a1151b
NOTE: cache began mount (first) of group DATA number=1 incarn=0x86a1151b
ERROR: no read quorum in group: required 2, found 0 disks
NOTE: cache dismounting (clean) group 1/0x86A1151B (DATA)
NOTE: messaging CKPT to quiesce pins Unix process pid: 10485834, image: oracle@jfp595901c (TNS V1-V3)
NOTE: dbwr not being msg'd to dismount
NOTE: lgwr not being msg'd to dismount
NOTE: cache dismounted group 1/0x86A1151B (DATA)
NOTE: cache ending mount (fail) of group DATA number=1 incarn=0x86a1151b
NOTE: cache deleting context for group DATA 1/0x86a1151b
GMON dismounting group 1 at 2 for pid 23, osid 10485834
ERROR: diskgroup DATA was not mounted
WARNING: Disk Group DATA containing configured OCR is not mounted
ORA-15032: not all alterations performed
ORA-15017: diskgroup "DATA" cannot be mounted
ORA-15063: ASM discovered an insufficient number of disks for diskgroup "DATA"
ERROR: ALTER DISKGROUP ALL MOUNT
Thu Apr 25 17:34:09 2013
NOTE: No asm libraries found in the system
Thu Apr 25 17:34:09 2013
SQL> CREATE DISKGROUP DATA EXTERNAL REDUNDANCY DISK '/dev/rhdiskpower10',
'/dev/rhdiskpower11',
'/dev/rhdiskpower12',
'/dev/rhdiskpower13',
'/dev/rhdiskpower6',
'/dev/rhdiskpower7',
'/dev/rhdiskpower8',
'/dev/rhdiskpower9' ATTRIBUTE 'compatible.asm'='11.2.0.0.0','au_size'='1M' /* ASMCA */
ASM Health Checker found 1 new failures
NOTE: Assigning number (1,0) to disk (/dev/rhdiskpower10)
NOTE: Assigning number (1,1) to disk (/dev/rhdiskpower11)
NOTE: Assigning number (1,2) to disk (/dev/rhdiskpower12)
NOTE: Assigning number (1,3) to disk (/dev/rhdiskpower13)
NOTE: Assigning number (1,4) to disk (/dev/rhdiskpower6)
NOTE: Assigning number (1,5) to disk (/dev/rhdiskpower7)
NOTE: Assigning number (1,6) to disk (/dev/rhdiskpower8)
NOTE: Assigning number (1,7) to disk (/dev/rhdiskpower9)
NOTE: initializing header on grp 1 disk DATA_0000
NOTE: initializing header on grp 1 disk DATA_0001
NOTE: initializing header on grp 1 disk DATA_0002
NOTE: initializing header on grp 1 disk DATA_0003
NOTE: initializing header on grp 1 disk DATA_0004
NOTE: initializing header on grp 1 disk DATA_0005
NOTE: initializing header on grp 1 disk DATA_0006
NOTE: initializing header on grp 1 disk DATA_0007
NOTE: initiating PST update: grp = 1
GMON updating group 1 at 3 for pid 23, osid 11927668
NOTE: group DATA: initial PST location: disk 0000 (PST copy 0)
NOTE: PST update grp = 1 completed successfully
NOTE: cache registered group DATA number=1 incarn=0x8f41151f
NOTE: cache began mount (first) of group DATA number=1 incarn=0x8f41151f
NOTE: cache opening disk 0 of grp 1: DATA_0000 path:/dev/rhdiskpower10
NOTE: cache opening disk 1 of grp 1: DATA_0001 path:/dev/rhdiskpower11
NOTE: cache opening disk 2 of grp 1: DATA_0002 path:/dev/rhdiskpower12
NOTE: cache opening disk 3 of grp 1: DATA_0003 path:/dev/rhdiskpower13
NOTE: cache opening disk 4 of grp 1: DATA_0004 path:/dev/rhdiskpower6
NOTE: cache opening disk 5 of grp 1: DATA_0005 path:/dev/rhdiskpower7
NOTE: cache opening disk 6 of grp 1: DATA_0006 path:/dev/rhdiskpower8
NOTE: cache opening disk 7 of grp 1: DATA_0007 path:/dev/rhdiskpower9
* allocate domain 1, invalid = TRUE
NOTE: attached to recovery domain 1
NOTE: cache creating group 1/0x8F41151F (DATA)
NOTE: cache mounting group 1/0x8F41151F (DATA) succeeded
NOTE: allocating F1X0 on grp 1 disk DATA_0000
NOTE: Created Used Space Directory for 1 threads
NOTE: diskgroup must now be re-mounted prior to first use
NOTE: cache dismounting (clean) group 1/0x8F41151F (DATA)
NOTE: messaging CKPT to quiesce pins Unix process pid: 11927668, image: oracle@jfp595901c (TNS V1-V3)
NOTE: lgwr not being msg'd to dismount
freeing rdom 1
NOTE: detached from domain 1
NOTE: cache dismounted group 1/0x8F41151F (DATA)
GMON dismounting group 1 at 4 for pid 23, osid 11927668
GMON dismounting group 1 at 5 for pid 23, osid 11927668
NOTE: Disk in mode 0x8 marked for de-assignment
NOTE: Disk in mode 0x8 marked for de-assignment
NOTE: Disk in mode 0x8 marked for de-assignment
NOTE: Disk in mode 0x8 marked for de-assignment
NOTE: Disk in mode 0x8 marked for de-assignment
NOTE: Disk in mode 0x8 marked for de-assignment
NOTE: Disk in mode 0x8 marked for de-assignment
NOTE: Disk in mode 0x8 marked for de-assignment
SUCCESS: diskgroup DATA was created
NOTE: cache deleting context for group DATA 1/0x8f41151f
NOTE: cache registered group DATA number=1 incarn=0xa4d11521
NOTE: cache began mount (first) of group DATA number=1 incarn=0xa4d11521
NOTE: Assigning number (1,0) to disk (/dev/rhdiskpower10)
NOTE: Assigning number (1,1) to disk (/dev/rhdiskpower11)
NOTE: Assigning number (1,2) to disk (/dev/rhdiskpower12)
NOTE: Assigning number (1,3) to disk (/dev/rhdiskpower13)
NOTE: Assigning number (1,4) to disk (/dev/rhdiskpower6)
NOTE: Assigning number (1,5) to disk (/dev/rhdiskpower7)
NOTE: Assigning number (1,6) to disk (/dev/rhdiskpower8)
NOTE: Assigning number (1,7) to disk (/dev/rhdiskpower9)
Thu Apr 25 17:34:19 2013
NOTE: GMON heartbeating for grp 1
GMON querying group 1 at 8 for pid 23, osid 11927668
NOTE: cache opening disk 0 of grp 1: DATA_0000 path:/dev/rhdiskpower10
NOTE: F1X0 found on disk 0 au 2 fcn 0.0
NOTE: cache opening disk 1 of grp 1: DATA_0001 path:/dev/rhdiskpower11
NOTE: cache opening disk 2 of grp 1: DATA_0002 path:/dev/rhdiskpower12
NOTE: cache opening disk 3 of grp 1: DATA_0003 path:/dev/rhdiskpower13
NOTE: cache opening disk 4 of grp 1: DATA_0004 path:/dev/rhdiskpower6
NOTE: cache opening disk 5 of grp 1: DATA_0005 path:/dev/rhdiskpower7
NOTE: cache opening disk 6 of grp 1: DATA_0006 path:/dev/rhdiskpower8
NOTE: cache opening disk 7 of grp 1: DATA_0007 path:/dev/rhdiskpower9
NOTE: cache mounting (first) external redundancy group 1/0xA4D11521 (DATA)
Thu Apr 25 17:34:19 2013
* allocate domain 1, invalid = TRUE
Thu Apr 25 17:34:19 2013
NOTE: attached to recovery domain 1
Thu Apr 25 17:34:19 2013
NOTE: cache recovered group 1 to fcn 0.0
NOTE: redo buffer size is 256 blocks (1053184 bytes)
Thu Apr 25 17:34:19 2013
NOTE: LGWR attempting to mount thread 1 for diskgroup 1 (DATA)
NOTE: LGWR found thread 1 closed at ABA 0.10750
NOTE: LGWR mounted thread 1 for diskgroup 1 (DATA)
NOTE: LGWR opening thread 1 at fcn 0.0 ABA 2.0
NOTE: setting 11.2 start ABA for group DATA thread 1 to 2.0
NOTE: cache mounting group 1/0xA4D11521 (DATA) succeeded
NOTE: cache ending mount (success) of group DATA number=1 incarn=0xa4d11521
GMON querying group 1 at 9 for pid 18, osid 11403356
Thu Apr 25 17:34:20 2013
NOTE: Instance updated compatible.asm to 11.2.0.0.0 for grp 1
SUCCESS: diskgroup DATA was mounted
SUCCESS: CREATE DISKGROUP DATA EXTERNAL REDUNDANCY DISK '/dev/rhdiskpower10',
'/dev/rhdiskpower11',
'/dev/rhdiskpower12',
'/dev/rhdiskpower13',
'/dev/rhdiskpower6',
'/dev/rhdiskpower7',
'/dev/rhdiskpower8',
'/dev/rhdiskpower9' ATTRIBUTE 'compatible.asm'='11.2.0.0.0','au_size'='1M' /* ASMCA */
Thu Apr 25 17:34:20 2013
WARNING: failed to online diskgroup resource ora.DATA.dg (unable to communicate with CRSD/OHASD)
ERROR: failed to update diskgroup resource ora.DATA.dg
NOTE: Advanced to new COD format for group DATA
NOTE: updated gpnp profile ASM diskstring:
NOTE: updated gpnp profile ASM SPFILE to +DATA/jfp5959-cluster/asmparameterfile/registry.253.813692061
待续