故障现象:
客户在某天正常完成NBU磁带出库后,过了3天后再进行出库,发现vault的速度特别慢,但是提示没有任何报错。
故障背景:
客户备份设备出问题,暂时停掉了备份,客户之前vault出库是正常的。
相关日志如下:
2021-7-6 18:09:49 - vault waiting for global lock
2021-7-6 18:09:49 - requesting resource nbuser.NBVAULT.MAXJOBS
2021-7-6 18:09:51 - vault global lock acquired
2021-7-6 18:09:51 - vault waiting for session ID lock
2021-7-6 18:09:51 - granted resource nbuser.NBVAULT.MAXJOBS
2021-7-6 18:09:51 - requesting resource nbuser.VAULT_CREATE_SESSION_ID.LOCK_TLD(0)_MyVault
2021-7-6 18:09:56 - vault session ID lock acquired
2021-7-6 18:09:56 - vault session ID lock released
2021-7-6 18:09:56 - granted resource nbuser.VAULT_CREATE_SESSION_ID.LOCK_TLD(0)_MyVault
2021-7-6 18:10:01 - vault waiting for duplication lock
2021-7-6 18:10:01 - requesting resource nbuser.VAULT_DUPLICATION.LOCK_TLD(0)_MyVault
2021-7-6 18:10:03 - vault duplication lock acquired
2021-7-6 18:10:03 - granted resource nbuser.VAULT_DUPLICATION.LOCK_TLD(0)_MyVault
2021-7-6 18:10:11 - begin Duplicating Images
2021-7-6 18:10:11 - vault duplication validation of 4 images failed: Already duplicated
2021-7-6 18:10:11 - end Duplicating Images; elapsed time 0:00:00
2021-7-6 18:10:12 - vault duplication lock released
2021-7-6 18:10:12 - Catalog Backup skipped
2021-7-6 18:10:12 - Eject skipped
2021-7-6 18:10:12 - Reports deferred
2021-7-6 18:10:12 - vault global lock released
the requested operation was successfully completed (0)
故障原因分析:
nbu 在vault出库时,是基于backup images来的,比如出库设置为出进8天某系统备份。举例如下:
7.6 日进行了正常的vault 出库操作。
7.7日 进行了再次出库(实际生产环境,如果nbu运行正常,数据库归档备份正常,每4小时对归档做一次备份,肯定会产生新的备份images),此时出库vault会正常出库。
但是:如果7.6-7.7之间nbu 故障,在7.6日至7.7之间nbu 没有进行备份作业,此时如果再进行vault
出库操作,虽然vault 作业正常,并同时启动了duplicate复制的作业,会发生再次vault的时候会特别慢,最后提示success,这种是正常现象。
vault的工作原因如此,无法进行修改,在vault的日志中也会给出一些信息vault duplication validation of 4 images failed: Already duplicated
以上,特此记录。