今天做一个恢复任务的时候,发现netbackup如下问题:
1、恢复操作迟迟没有完成,查看jobs
# bpdbjobs
JobID Type State Status Policy Schedule Client Dest Media Svr Active PID FATPipe
7129 Restore Active sol2 22233
7128 Image Delete Done 0 20130
该job7129一直为active
2、先查看磁带信息,有空闲空间:
# available_media
media media robot robot robot side/ ret size status
ID type type # slot face level KBytes
----------------------------------------------------------------------------
CatalogBackup pool
0018L4 HCART TLD 0 2 - 5 40311168 ACTIVE
0019L4 HCART TLD 0 10 - - - AVAILABLE
DataStore pool
NetBackup pool
None pool
NU03CU HC_CLN TLD 0 23 - - - AVAILABLE
NU04CU HC_CLN TLD 0 16 - - - AVAILABLE
Oracle pool
0002L4 HCART TLD 0 21 - 3 2698358560 ACTIVE
0003L4 HCART TLD 0 6 - 3 2297925856 ACTIVE
0000L4 HCART TLD 0 19 - - - AVAILABLE
0001L4 HCART TLD 0 7 - - - AVAILABLE
0004L4 HCART TLD 0 15 - - - AVAILABLE
0005L4 HCART TLD 0 5 - - - AVAILABLE
0006L4 HCART TLD 0 18 - - - AVAILABLE
0007L4 HCART TLD 0 13 - - - AVAILABLE
0008L4 HCART TLD 0 14 - - - AVAILABLE
0009L4 HCART TLD 0 8 - - - AVAILABLE
0010L4 HCART TLD 0 12 - - - AVAILABLE
0011L4 HCART TLD 0 1 - - - AVAILABLE
0012L4 HCART TLD 0 22 - - - AVAILABLE
0013L4 HCART TLD 0 3 - - - AVAILABLE
0014L4 HCART TLD 0 4 - - - AVAILABLE
0015L4 HCART TLD 0 17 - - - AVAILABLE
0016L4 HCART TLD 0 9 - - - AVAILABLE
0017L4 HCART TLD 0 11 - - - AVAILABLE
3、查看详细错误信息为:
awaiting resource cbshp02-hcart3-robot-tld-0. Waiting for resources.
Reason: Tape media server is not active, Media server: sol1
Robot Type(Number): TLD(0), Media ID: N/A, Drive Name: N/A,
Volume Pool: unix, Storage Unit: cbshp02-hcart3-robot-tld-0, Drive Scan Host: N/A
4、查看media server进程,该进程没有running,原因不明。
# vmoprcmd -d
oprd returned abnormal status (96)
IPC Error: Daemon may not be running
# vmoprcmd -activate_host -h sol1
oprd returned abnormal status (96)
IPC Error: Daemon may not be running
5、参考相关文档:
http://www.symantec.com/business/support/index?page=content&id=TECH70502
http://www.symantec.com/connect/forums/media-server-not-active
http://www.ansonnotes.com/?p=106
6、依照此步骤处理:
Solution/Workaround:
1)、Ran nbrbutil -resetMediaServer (preferably nbrbutil -resetall)
2)、Restarted NetBackup daemons on media server - now tldd is running again
3)、vmoprcmd now works fine, and the volume pools "came back."
7、重启MediaServer
# nbrbutil -resetall
# nbrbutil -resetMediaServer all
8、查看MediaServer进程已经启动:
# vmoprcmd -activate_host -h sol1
# vmoprcmd
HOST STATUS
Host Name Version Host Status
========================================= ======= ===========
sol1 655000 ACTIVE
PENDING REQUESTS
DRIVE STATUS
Drive Name Label Ready RecMID ExtMID Wr.Enbl. Type
Host DrivePath Status
=============================================================================
HP.ULTRIUM4-SCSI.000 No No No hcart
sol1 /dev/rmt/1cbn TLD
HP.ULTRIUM4-SCSI.001 No No No hcart
sol1 /dev/rmt/0cbn TLD
恢复完成
[@more@]来自 “ ITPUB博客 ” ,链接:http://blog.itpub.net/7417660/viewspace-1055094/,如需转载,请注明出处,否则将追究法律责任。
转载于:http://blog.itpub.net/7417660/viewspace-1055094/