Greetings,
Due to a hardware failure, I have recently replaced our Dell PowerVault 122T (LTO2) with a PowerVault 124T (LTO3). I then removed all instances of the LTO2 drive/robot from Netbackup and configured for the new robot/drive.
So, here are the details regarding the current configuration:
RHEL4 (Update 8)
Dell PowerVault 124T Autoloader (LTO3) at /dev/sg1
Quantum Ultrium 3 Drive (Internal to robot) at /dev/nst0
NetBackup 6.0 MP7
I have everything configured to the point that I can successfully carry out most operations (namely inventory, inquiry, label, drive and robot diagnostics). However, when I attempt to run a backup process, everything goes sour. In particular, the following happens:
02/11/2010 15:41:20 - requesting resource storage-unit-hcart3
02/11/2010 15:41:20 - requesting resource hemi.NBU_CLIENT.MAXJOBS.cygnus
02/11/2010 15:41:20 - requesting resource hemi.NBU_POLICY.MAXJOBS.http_mail
02/11/2010 15:41:21 - granted resource hemi.NBU_CLIENT.MAXJOBS.cygnus
02/11/2010 15:41:21 - granted resource hemi.NBU_POLICY.MAXJOBS.http_mail
02/11/2010 15:41:21 - granted resource 0002L3
02/11/2010 15:41:21 - granted resource QUANTUM.ULTRIUM3.000
02/11/2010 15:41:21 - granted resource storage-unit-hcart3
02/11/2010 15:41:22 - started process bpbrm (pid=2466)
02/11/2010 15:41:22 - connecting
02/11/2010 15:41:22 - connected; connect time: 0:00:00
02/11/2010 15:41:26 - mounting 0002L3
02/11/2010 15:43:00 - mounted 0002L3; mount time: 0:01:34
02/11/2010 15:43:00 - positioning 0002L3 to file 1
02/11/2010 15:43:09 - positioned 0002L3; position time: 0:00:09
02/11/2010 15:43:09 - begin writing
02/11/2010 16:13:14 - Error bptm (pid=2467) media manager terminated by parent process
02/11/2010 16:21:24 - Error bptm (pid=2467) ioctl (MTBSF) failed on media id 0002L3, drive index 0, Input/output error (bptm.c.8229)
02/11/2010 16:21:24 - end writing; write time: 0:38:15
network connection timed out (41)
Following this, the devices produce the following:
The Robot gets listed as "Enabled - No" and diagnostics produces "Failed - Communication failure with robot control daemon." across the board.
The Drive is listed as "Enabled - Yes" with a Drive Path of "MISSING_DRIVE:<serial>" and diagnostics fail with "Drive already in use, aborting test."
To resolve this, it seems I have to remove and readd the drive, and a power cycle of the robot seems to allow the robot to be communicated with once again.
I have been Googling to no end with very little success. If anyone has any insight, I would be most appreciative. Please do not hesitate to let me know if more information is required.
Thanks!