Not all drives back in use after library failure?
Hi,
My StorageTek SL500 lost 2 PSUs and started telling me that it only had 2 drives at invalid addresses instead of 4 drives. Our remote team went into the data centre and shuffled the PSUs so we are now running with 3 ot of 4 PSUs and all 4 tape drives show online with valid addresses.
In NBU I gave up on the Device Monitor section of the GUI, and instead ran the commands:
/usr/openv/volmgr/bin/vmoprcmd -up 0
/usr/openv/volmgr/bin/vmoprcmd -up 1
/usr/openv/volmgr/bin/vmoprcmd -up 2
/usr/openv/volmgr/bin/vmoprcmd -up 3
and now it looks like everything is OK. I've started a few manual backups using at least 4 different volume pools, and then monitored the drives with:
/usr/openv/volmgr/bin/vmoprcmd -dPENDING REQUESTS<NONE>DRIVE STATUSDrv Type Control User Label RecMID ExtMID Ready Wr.Enbl. ReqId0 hcart TLD - No - 00 hcart TLD - No - 01 hcart TLD Yes DUB050 DUB050 Yes Yes 01 hcart TLD Yes DUB050 DUB050 Yes Yes 02 hcart TLD Yes DUB055 DUB055 Yes Yes 02 hcart TLD Yes DUB055 DUB055 Yes Yes 03 hcart TLD Yes DUB060 DUB060 Yes Yes 03 hcart TLD Yes DUB060 DUB060 Yes Yes 0ADDITIONAL DRIVE STATUSDrv DriveName Shared Assigned Comment0 HP.ULTRIUM4-SCSI.002 Yes -0 HP.ULTRIUM4-SCSI.002 Yes -1 HP.ULTRIUM4-SCSI.003 Yes nbu-master011 HP.ULTRIUM4-SCSI.003 Yes nbu-master012 HP.ULTRIUM4-SCSI.001 Yes nbu-master012 HP.ULTRIUM4-SCSI.001 Yes nbu-master013 HP.ULTRIUM4-SCSI.000 Yes nbu-master013 HP.ULTRIUM4-SCSI.000 Yes nbu-master01
A reboot would fix issues at OS level.
The only way to know why NBU would not/could not use the drive is to look at logs. If logs did not exist at the time that the issue was experienced, there is unfortunately no way to tell.
/var/adm/messages may contain some device info/errors.For future troubleshooting, enable logging as follows:
Add VERBOSE entry to /usr/openv/volmgr/vm.conf, then restart ltid.
Additional media manager/device info will now be logged to /var/adm/messagesCreate /usr/openv/netbackup/logs/bptm folder to log NBU I/O activity.

