今天rac系统两实例crash,
查看问题崩溃原因:
IPC Send timeout detected.Sender: ospid 9241
Receiver: inst 2 binc 429430976 ospid 8907
Errors in file /opt/app/oracle/admin/PRI/bdump/pri1_diag_7793.trc:
ORA-00600: internal error code, arguments: [kjzcreaprqhq1], [], [], [], [], [], [], []
IPC Send timeout detected.Sender: ospid 9241
Receiver: inst 2 binc 429430976 ospid 8907
Errors in file /opt/app/oracle/admin/PRI/bdump/pri1_diag_7793.trc:
ORA-00600: internal error code, arguments: [kjzcreaprqhq1], [], [], [], [], [], [], []
实例没能自动回复:redo文件缺失
ORA-00313: open failed for members of log group 4 of thread 2
ORA-00312: online log 4 thread 2: '/home/oracle/onlinelog/redo4002.log'
ORA-27037: unable to obtain file status
ORA-00312: online log 4 thread 2: '/home/oracle/onlinelog/redo4002.log'
ORA-27037: unable to obtain file status
分析问题发生可能:oraclebug,恢复过程中redo异常
redo情况:每个实例有两个日志组,分别存放本地,这是实例无法恢复的原因,根据提示可见,实例恢复时需要所有的redo
解决方法:拷贝需要的redo文件,即用剩余的文件复制和修改成所需文件,尝试启动数据库。
经验教训:rac环境下redo文件不能存储在本地,还是应该同数据文件一样存储在共享存储上,否则一旦实例崩溃,smon就无法进行crash recovery。
来自 “ ITPUB博客 ” ,链接:http://blog.itpub.net/15702005/viewspace-624363/,如需转载,请注明出处,否则将追究法律责任。
转载于:http://blog.itpub.net/15702005/viewspace-624363/