昨天nbu的几个备份策略突然报错
(98) error requesting media (tpreq)
具体的信息是这样的
06/18/2008 12:00:24 - requesting resource bwdb-hcart2 06/18/2008 12:00:24 - requesting resource bkup.NBU_CLIENT.MAXJOBS.bwdb 06/18/2008 12:00:24 - requesting resource bkup.NBU_POLICY.MAXJOBS.bw_db_arch 06/18/2008 12:00:25 - awaiting resource bkup.NBU_POLICY.MAXJOBS.bw_db_arch. Will retry logical later. 06/18/2008 12:24:19 - awaiting resource bwdb-hcart2. Waiting for resources. Reason: Robotic library is down on server, Media server: bwdb, Robot Type(Number): TLD(0), Media ID: N/A, Drive Name: N/A, Volume Pool: DB_bw_arch, Storage Unit: bwdb-hcart2, Drive Scan Host: N/A 06/18/2008 12:37:14 - granted resource bkup.NBU_CLIENT.MAXJOBS.bwdb 06/18/2008 12:37:14 - granted resource bkup.NBU_POLICY.MAXJOBS.bw_db_arch 06/18/2008 12:37:14 - granted resource U525L2 06/18/2008 12:37:14 - granted resource HPUltrium2-SCSI1 06/18/2008 12:37:14 - granted resource bwdb-hcart2 06/18/2008 12:37:17 - started process bpbrm (pid=15576) 06/18/2008 12:37:17 - connecting 06/18/2008 12:37:25 - connected; connect time: 0:00:00 06/18/2008 12:37:30 - mounting U525L2 06/18/2008 12:51:48 - Error bptm (pid=15589) error requesting media, TpErrno = Robot operation failed 06/18/2008 12:54:57 - Warning bptm (pid=15589) media id U525L2 load operation reported an error 06/18/2008 12:54:58 - current media U525L2 complete, requesting next media Any 06/18/2008 13:29:16 - Error bptm (pid=15589) NBEMM returned an extended error status: invalid error number (2005023) 06/18/2008 13:29:19 - end writing error requesting media (tpreq) (98)
手工发起备份报错如下
06/19/2008 09:02:29 - requesting resource r3prd-hcart2-robot-tld-0 06/19/2008 09:02:29 - requesting resource bkup.NBU_CLIENT.MAXJOBS.r3prd 06/19/2008 09:02:29 - requesting resource bkup.NBU_POLICY.MAXJOBS.r3_db_arch 06/19/2008 09:02:30 - awaiting resource r3prd-hcart2-robot-tld-0. Waiting for resources. Reason: Robotic library is down on server, Media server: r3prd, Robot Type(Number): TLD(0), Media ID: N/A, Drive Name: N/A, Volume Pool: DB_r3_arch, Storage Unit: r3prd-hcart2-robot-tld-0, Drive Scan Host: N/A
提示机械臂down掉了,使用ioscan检查带库状态
#[/]ioscan -fnCtape Class I H/W Path Driver S/W State H/W Type Description ========================================================================== tape 0 0/1/1/1.2.0 stape CLAIMED DEVICE HP C5683A /dev/rmt/0m /dev/rmt/0mnb /dev/rmt/c3t2d0BESTn /dev/rmt/c3t2d0DDSb /dev/rmt/0mb /dev/rmt/c3t2d0BEST /dev/rmt/c3t2d0BESTnb /dev/rmt/c3t2d0DDSn /dev/rmt/0mn /dev/rmt/c3t2d0BESTb /dev/rmt/c3t2d0DDS /dev/rmt/c3t2d0DDSnb tape 3 0/4/1/0.97.26.255.1.3.0 stape CLAIMED DEVICE HP Ultrium 2-SCSI /dev/rmt/3m /dev/rmt/3mn /dev/rmt/c18t3d0BEST /dev/rmt/c18t3d0BESTn /dev/rmt/3mb /dev/rmt/3mnb /dev/rmt/c18t3d0BESTb /dev/rmt/c18t3d0BESTnb tape 7 0/4/1/0.97.26.255.1.3.1 stape CLAIMED DEVICE HP Ultrium 2-SCSI /dev/rmt/4m /dev/rmt/4mn /dev/rmt/c18t3d1BEST /dev/rmt/c18t3d1BESTn /dev/rmt/4mb /dev/rmt/4mnb /dev/rmt/c18t3d1BESTb /dev/rmt/c18t3d1BESTnb tape 5 0/4/1/1.97.25.255.1.3.1 stape NO_HW DEVICE HP Ultrium 2-SCSI /dev/rmt/5m /dev/rmt/5mn /dev/rmt/c13t3d1BEST /dev/rmt/c13t3d1BESTn /dev/rmt/5mb /dev/rmt/5mnb /dev/rmt/c13t3d1BESTb /dev/rmt/c13t3d1BESTnb tape 6 0/4/1/1.97.25.255.1.3.2 stape NO_HW DEVICE HP Ultrium 2-SCSI /dev/rmt/6m /dev/rmt/6mn /dev/rmt/c13t3d2BEST /dev/rmt/c13t3d2BESTn /dev/rmt/6mb /dev/rmt/6mnb /dev/rmt/c13t3d2BESTb /dev/rmt/c13t3d2BESTnb
发现其中一块光纤卡上的两个设备连接不上了,再查看光纤卡的状态
#[/]ioscan -fnCfc Class I H/W Path Driver S/W State H/W Type Description ================================================================= fc 0 0/4/1/0 fcd CLAIMED INTERFACE HP 2Gb Dual Port PCI/PCI-X Fibre Channel Adapter (Port 1) /dev/fcd0 fc 1 0/4/1/1 fcd CLAIMED INTERFACE HP 2Gb Dual Port PCI/PCI-X Fibre Channel Adapter (Port 2) /dev/fcd1
光纤卡状态正常。到机房查看硬件设备状态,发现有一块SCSI卡无信号灯闪烁,于是重启带库,带库重启后状态正常,再扫描带机
#[/]ioscan -fnCtape Class I H/W Path Driver S/W State H/W Type Description ========================================================================== tape 0 0/1/1/1.2.0 stape CLAIMED DEVICE HP C5683A /dev/rmt/0m /dev/rmt/0mnb /dev/rmt/c3t2d0BESTn /dev/rmt/c3t2d0DDSb /dev/rmt/0mb /dev/rmt/c3t2d0BEST /dev/rmt/c3t2d0BESTnb /dev/rmt/c3t2d0DDSn /dev/rmt/0mn /dev/rmt/c3t2d0BESTb /dev/rmt/c3t2d0DDS /dev/rmt/c3t2d0DDSnb tape 3 0/4/1/0.97.26.255.1.3.0 stape CLAIMED DEVICE HP Ultrium 2-SCSI /dev/rmt/3m /dev/rmt/3mn /dev/rmt/c18t3d0BEST /dev/rmt/c18t3d0BESTn /dev/rmt/3mb /dev/rmt/3mnb /dev/rmt/c18t3d0BESTb /dev/rmt/c18t3d0BESTnb tape 7 0/4/1/0.97.26.255.1.3.1 stape CLAIMED DEVICE HP Ultrium 2-SCSI /dev/rmt/4m /dev/rmt/4mn /dev/rmt/c18t3d1BEST /dev/rmt/c18t3d1BESTn /dev/rmt/4mb /dev/rmt/4mnb /dev/rmt/c18t3d1BESTb /dev/rmt/c18t3d1BESTnb tape 5 0/4/1/1.97.25.255.1.3.1 stape CLAIMED DEVICE HP Ultrium 2-SCSI /dev/rmt/5m /dev/rmt/5mn /dev/rmt/c13t3d1BEST /dev/rmt/c13t3d1BESTn /dev/rmt/5mb /dev/rmt/5mnb /dev/rmt/c13t3d1BESTb /dev/rmt/c13t3d1BESTnb tape 6 0/4/1/1.97.25.255.1.3.2 stape CLAIMED DEVICE HP Ultrium 2-SCSI /dev/rmt/6m /dev/rmt/6mn /dev/rmt/c13t3d2BEST /dev/rmt/c13t3d2BESTn /dev/rmt/6mb /dev/rmt/6mnb /dev/rmt/c13t3d2BESTb /dev/rmt/c13t3d2BESTnb
OK,故障解决。