今天早上一个朋友和我说他们RAC dbca创建库不成功提示ORA-00443错误,让我帮他们分析下是什么原因导致
提示错误如图
环境状况
OS:LINUX REDHAT x86_64 5.7
kernel:2.6.18-194.el5
memory:100G
CPU:ntel(R) Xeon(R) CPU E7- 8837 @ 2.67GHz * 64
ORACLE:10.2.0.4
查看alert日志错误
Wed Sep 5 01:32:33 2012
cluster interconnect IPC version:Oracle UDP/IP (generic)
IPC Vendor 1 proto 2
PMON started with pid=2, OS id=17859
DIAG started with pid=7, OS id=17861
PSP0 started with pid=12, OS id=17863
LMON started with pid=17, OS id=17865
LMD0 started with pid=22, OS id=17867
MMAN started with pid=27, OS id=17869
DBW0 started with pid=32, OS id=17871
Wed Sep 5 01:32:33 2012
Errors in file /u01/app/oracle/admin/dtjcdb/bdump/dtjcdb1_ora_17873.trc:
ORA-00600: internal error code, arguments: [ksbmoveme4], [], [], [], [], [], [], []
ORA-27300: OS system dependent operation:run on node failed with status: 2
ORA-27301: OS failure message: No such file or directory
ORA-27302: failure occurred at: skgpmoveme:1
Wed Sep 5 01:32:34 2012
Trace dumping is performing id=[cdmp_20120905013234]
Wed Sep 5 01:32:34 2012
Process DBW1 died, see its trace file
USER: terminating instance due to error 443
Instance terminated by USER, pid = 17857
trace文件中内容
*** 2012-09-05 01:32:33.996
ksedmp: internal or fatal error
ORA-00600: internal error code, arguments: [ksbmoveme4], [], [], [], [], [], [], []
ORA-27300: OS system dependent operation:run on node failed with status: 2
ORA-27301: OS failure message: No such file or directory
ORA-27302: failure occurred at: skgpmoveme:1
Current SQL information unavailable - no session.
----- Call Stack Trace -----
calling call entry argument values in hex
location type point (? means dubious value)
-------------------- -------- -------------------- ----------------------------
ksedst()+31 call ksedst1() 000000000 ? 000000001 ?
7FFFC4ABA4C0 ? 7FFFC4ABA520 ?
7FFFC4ABA460 ? 000000000 ?
ksedmp()+610 call ksedst() 000000000 ? 000000001 ?
7FFFC4ABA4C0 ? 7FFFC4ABA520 ?
7FFFC4ABA460 ? 000000000 ?
ksfdmp()+21 call ksedmp() 000000003 ? 000000001 ?
7FFFC4ABA4C0 ? 7FFFC4ABA520 ?
7FFFC4ABA460 ? 000000000 ?
kgerinv()+161 call ksfdmp() 000000003 ? 000000001 ?
7FFFC4ABA4C0 ? 7FFFC4ABA520 ?
7FFFC4ABA460 ? 000000000 ?
kgesinv()+33 call kgerinv() 0068966E0 ? 000000000 ?
7FFFC4ABA520 ? 7FFFC4ABA460 ?
000000000 ? 000000000 ?
ksesin()+211 call kgesinv() 0068966E0 ? 000000000 ?
7FFFC4ABA520 ? 7FFFC4ABA460 ?
000000000 ? 000000000 ?
ksbmoveme()+350 call ksesin() 00533D5C8 ? 000000000 ?
006896FA3 ? 000000001 ?
000000001 ? 000000000 ?
ksosp_set_current() call ksbmoveme() 000000001 ? 000000000 ?
+117 006896FA3 ? 000000001 ?
000000001 ? 000000000 ?
kso_init()+161 call ksosp_set_current() 151056D10 ? 000000000 ?
006896FA3 ? 000000001 ?
000000001 ? 000000000 ?
opirip()+523 call kso_init() 151056D10 ? 000000000 ?
006896FA3 ? 000000001 ?
000000001 ? 000000000 ?
opidrv()+582 call opirip() 000000032 ? 000000004 ?
7FFFC4ABC128 ? 000000001 ?
000000001 ? 000000000 ?
sou2o()+114 call opidrv() 000000032 ? 000000004 ?
7FFFC4ABC128 ? 000000001 ?
000000001 ? 000000000 ?
opimai_real()+317 call sou2o() 7FFFC4ABC100 ? 000000032 ?
000000004 ? 7FFFC4ABC128 ?
000000001 ? 000000000 ?
main()+116 call opimai_real() 000000003 ? 7FFFC4ABC190 ?
000000004 ? 7FFFC4ABC128 ?
000000001 ? 000000000 ?
__libc_start_main() call main() 000000003 ? 7FFFC4ABC190 ?
+244 000000004 ? 7FFFC4ABC128 ?
000000001 ? 000000000 ?
_start()+41 call __libc_start_main() 000723088 ? 000000001 ?
7FFFC4ABC2E8 ? 000000000 ?
000000001 ? 000000003 ?
--------------------- Binary Stack Dump ---------------------
通过查询MOS发现[ID 422908.1]有类此的错误提示,但是该提示说是因为系统重新增加了过多CPU导致数据库crashed掉并且出现 ORA-27300 ORA-27301 ORA-27302 错误.在该案例中,起始就是64c,根据经验在win的10.2.0.4中如果cpu超过32c也是在dbcd创建数据库2%的地方hang住,所以怀疑该错误也是由于cpu太多导致.
处理方法
To solve the problem:
1) apply patch:6471079
- or -
2) apply the 10.2.0.5 (when available)
- or -
3) upgrade to 11g
朋友打上patch:6471079,dbca正常建库