database version:11.2.0.3 RAC
goldengate version :11.1.1.1.2
早上发现数据同步异常,source端状态如下:
查看告警日志ggserr.log
发现存在OGG-01738提示
2015-01-15 21:12:37 INFO OGG-01517 Position of first record processed Sequence 30, RBA 170907152, SCN 0.2262790, Jan 15, 20
goldengate version :11.1.1.1.2
早上发现数据同步异常,source端状态如下:
GGSCI (ulecardrac1) 3> info all
Program Status Group Lag Time Since Chkpt
MANAGER RUNNING
EXTRACT RUNNING EXT232 00:00:00 06:32:33
EXTRACT RUNNING PUMP232 00:00:00 00:00:03
status还是为RUNNING,但是已经有六个半小时没有update了,其实该进程已经hang住查看告警日志ggserr.log
发现存在OGG-01738提示
2015-01-15 21:12:37 INFO OGG-01517 Position of first record processed Sequence 30, RBA 170907152, SCN 0.2262790, Jan 15, 20
15 7:49:50 PM.
2015-01-16 01:12:41 INFO OGG-01738 BOUNDED RECOVERY: CHECKPOINT: for object pool 1: p6427_extr: start=SeqNo: 35, RBA: 45666
320, SCN: 0.2287580 (2287580), Timestamp: 2015-01-16 01:12:39.000000, Thread: 1, end=SeqNo: 35, RBA: 45667328, SCN: 0.2287580 (2
287580), Timestamp: 2015-01-16 01:12:39.000000, Thread: 1.
2015-01-16 05:12:43 INFO OGG-01738 BOUNDED RECOVERY: CHECKPOINT: for object pool 1: p6427_extr: start=SeqNo: 35, RBA: 58063
376, SCN: 0.2298655 (2298655), Timestamp: 2015-01-16 05:12:26.000000, Thread: 1, end=SeqNo: 35, RBA: 58063872, SCN: 0.2298655 (2
298655), Timestamp: 2015-01-16 05:12:26.000000, Thread: 1.
2015-01-16 09:12:52 INFO OGG-01738 BOUNDED RECOVERY: CHECKPOINT: for object pool 1: p6427_extr: start=SeqNo: 35, RBA: 10132
2256, SCN: 0.2310699 (2310699), Timestamp: 2015-01-16 09:12:44.000000, Thread: 1, end=SeqNo: 35, RBA: 101322752, SCN: 0.2310699
(2310699), Timestamp: 2015-01-16 09:12:44.000000, Thread: 1.
2015-01-16 10:17:23 INFO OGG-06508 Wildcard MAPTABLE resolved (entry scott.*): table "SCOTT"."EMP".
MOS上有一篇关于该错误的文章 note 1293772.1
GGSCI> start <extract_name> BRRESET
因为extract进程ext232已经假死,无法stop掉,甚至用'send ext232 forcestop'和'stop mgr'也无法stop掉该extract进程
最后只能在shell下kill掉进程,再重新执行
GGSCI> start ext232 BRRESET
重新启动后,发现状态已经正常,同步已经基本无延迟。
该bug只在RAC中或者单实例设置了多个thread的情况下出现,而且在更高级版本中已经修复,为了一劳永逸,可以考虑将ogg升级至11.2.1.0.1
注意: 这里goole 一篇文章, 说是这个报错是ogg 11的一个bug, 但是,我的报错,最后通过这个命令启动,解决了:
GGSCI > START EXTR_1 BRRESET