DB2,Oracle备份问题
操作系统:RedHat5.8 for Z
环境:TSM6.2
两套TSM,有问题的是TSM1。
TSM1配置了8个drive,有2个Oracle节点,4个DB2节点。
两个Oracle节点分别是JTFXPSTA1和RFIDPSTA1。
tsm: TSM>q path f=d
Source Name Source Type Destination Destination Library Node Name Device External LUN Initiator Directory On-Line Last Update by Last Upda-
Name Type Manager (administrator) te Date/T-
ime
----------- ----------- ----------- ----------- ----------- ----------- ----------- ----------- ----------- ----------- ----------- ------- --------------- ----------
TSM SERVER TS3500LIB1 LIBRARY /dev/IBMch- 0 Yes ADMIN 02/28/2015
anger2 12:30:47
TSM SERVER DRIVER11 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes SERVER_CONSOLE 02/28/2015
pe11 13:39:45
TSM SERVER DRIVER12 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes SERVER_CONSOLE 02/28/2015
pe12 13:03:13
TSM SERVER DRIVER13 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes SERVER_CONSOLE 02/28/2015
pe13 13:13:41
TSM SERVER DRIVER14 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes SERVER_CONSOLE 02/28/2015
pe14 13:29:34
TSM SERVER DRIVER15 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes SERVER_CONSOLE 02/28/2015
pe15 14:09:28
TSM SERVER DRIVER16 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes SERVER_CONSOLE 02/28/2015
pe16 14:04:17
TSM SERVER DRIVER17 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes SERVER_CONSOLE 02/28/2015
pe17 12:46:35
JTFXPSTA1 SERVER DRIVER14 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes ADMIN 02/28/2015
pe14 14:22:27
JTFXPSTA1 SERVER DRIVER15 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes ADMIN 02/28/2015
pe15 14:22:27
JTFXPSTA1 SERVER DRIVER16 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes ADMIN 02/28/2015
pe16 14:22:27
JTFXPSTA1 SERVER DRIVER17 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes ADMIN 02/28/2015
pe17 14:22:28
RFIDPSTA1 SERVER DRIVER14 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes ADMIN 02/28/2015
pe14 13:37:34
RFIDPSTA1 SERVER DRIVER15 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes ADMIN 02/28/2015
pe15 13:37:34
RFIDPSTA1 SERVER DRIVER16 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes ADMIN 02/28/2015
pe16 13:37:34
RFIDPSTA1 SERVER DRIVER17 DRIVE TS3500LIB1 /dev/IBMta- 0 Yes ADMIN 02/28/2015
pe17 13:37:35
情况描述:
昨天上班发现备份大多都失败了,检查path有6个是offline的,update online=yes之后,手动在Oracle服务器上执行arch.sh脚本。
但好像在TSM上同时只能有一个drive工作,因为arch.sh脚本中同时跑2个channel,tsm也收到了2个session,但是始终只有1个session在传输数据。
如下是arch.sh的日志:
released channel: t1
released channel: t2
RMAN-00571: ===========================================================
RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS ===============
RMAN-00571: ===========================================================
RMAN-03009: failure of backup command on t2 channel at 02/28/2015 13:34:19
ORA-19506: failed to create sequential file, name="arch_13709_1", parms=""
ORA-27028: skgfqcre: sbtbackup returned error
ORA-19511: Error received from media manager layer, error text:
ANU2503E Backup object '/JTFXPDB//arch_13709_1' already exists on TSM Server.
Recovery Manager complete.
这是同时在另一个Oracle服务器节点上发起arch备份:
channel t2: starting piece 1 at 28-FEB-15
RMAN-03009: failure of backup command on t1 channel at 02/28/2015 13:57:04
ORA-19502: write error on file "arch_25104_1", blockno 449 (blocksize=4096)
ORA-27030: skgfwrt: sbtwrite2 returned error
ORA-19511: Error received from media manager layer, error text:
ANS1017E (RC-50) Session rejected: TCP/IP connection failure
channel t1 disabled, job failed on it will be run on another channel
released channel: t1
released channel: t2
RMAN-00571: ===========================================================
RMAN-00569: =============== ERROR MESSAGE STACK FOLLOWS ===============
RMAN-00571: ===========================================================
RMAN-03009: failure of backup command on t2 channel at 02/28/2015 13:57:04
ORA-19502: write error on file "arch_25105_1", blockno 449 (blocksize=4096)
ORA-27030: skgfwrt: sbtwrite2 returned error
ORA-19511: Error received from media manager layer, error text:
ANS1017E (RC-50) Session rejected: TCP/IP connection failure
Recovery Manager complete.
当前session列表:
tsm: TSM>q sess
Sess Comm. Sess Wait Bytes Bytes Sess Platform Client Name
Number Method State Time Sent Recvd Type
------ ------ ------ ------ ------- ------- ----- -------- --------------------
16 Tcp/Ip IdleW 2.0 H 446 371 Node AIX ECMRM2
17 Tcp/Ip IdleW 1.1 H 1.6 K 2.0 K Node AIX ECMRM2
18 Tcp/Ip RecvW 39 S 1.4 K 6.5 G Node AIX ECMRM2
19 Tcp/Ip Run 0 S 629.8 K 11.4 K Admin Linux390 ADMIN
40 Tcp/Ip IdleW 1.8 H 446 371 Node AIX ECMLS2
41 Tcp/Ip IdleW 56.7 M 1.6 K 2.0 K Node AIX ECMLS2
42 Tcp/Ip RecvW 4 S 1.6 K 10.0 G Node AIX ECMLS2
44 Tcp/Ip RecvW 1.4 H 1.7 K 1.1 K Node TDPO JTFXPDB
LinuxZ-
64
53 Tcp/Ip RecvW 1.3 H 1.7 K 1.1 K Node TDPO JTFXPDB
LinuxZ-
64
77 Tcp/Ip RecvW 50.2 M 1.3 K 1.1 K Node TDPO RFIDPDB
LinuxZ-
64
78 Tcp/Ip RecvW 1.0 H 1.3 K 1.1 K Node TDPO RFIDPDB
LinuxZ-
64
107 Tcp/Ip RecvW 26.2 M 1.3 K 1.1 K Node TDPO RFIDPDB
LinuxZ-
64
108 Tcp/Ip RecvW 21.1 M 1.3 K 1.1 K Node TDPO RFIDPDB
LinuxZ-
64
114 Tcp/Ip Run 0 S 1.1 K 863 Node DB2/LIN- $$_TSMDBMGR_$$
UXZ64
当前drive mount情况:
tsm: TSM>q mo
ANR8330I 3592 volume NJ0126 is mounted R/W in drive DRIVER10 (/dev/IBMtape28), status: IN USE.
ANR8330I 3592 volume NJ0109 is mounted R/W in drive DRIVER17 (/dev/IBMtape17), status: IN USE.
ANR8330I 3592 volume NJ0106 is mounted R/W in drive DRIVER16 (/dev/IBMtape16), status: IN USE.
ANR8330I 3592 volume NJ0081 is mounted R/W in drive DRIVER12 (/dev/IBMtape12), status: IN USE.
ANR8330I 3592 volume NJ0097 is mounted R/W in drive DRIVER13 (/dev/IBMtape13), status: IN USE.
ANR8330I 3592 volume NJ0054 is mounted R/W in drive DRIVER11 (/dev/IBMtape11), status: IN USE.
ANR8330I 3592 volume NJ0021 is mounted R/W in drive DRIVER14 (/dev/IBMtape14), status: IN USE.
ANR8330I 3592 volume NJ0129 is mounted R/W in drive DRIVER15 (/dev/IBMtape15), status: IN USE.
ANR8334I 8 matches found.
附件是actlog
附件: