01-31-2011 07:38 AM
Hi,
I've an Netbackup 6.5.6 environment with a separate Master server and media servers.
The problem is that if i try to backup one of these media servers i receive following details of the job.
1/31/2011 3:51:04 PM - requesting resource Server01N2-hcart-robot-tld0
1/31/2011 3:51:04 PM - requesting resource Server03.eu.jnj.com.NBU_CLIENT.MAXJOBS.Server01n2.eu.jnj.com
1/31/2011 3:51:04 PM - requesting resource Server03.eu.jnj.com.NBU_POLICY.MAXJOBS.Server01n2
1/31/2011 3:51:04 PM - granted resource Server03.eu.jnj.com.NBU_CLIENT.MAXJOBS.Server01n2.eu.jnj.com
1/31/2011 3:51:04 PM - granted resource Server03.eu.jnj.com.NBU_POLICY.MAXJOBS.Server01n2
1/31/2011 3:51:04 PM - granted resource LN3328
1/31/2011 3:51:04 PM - granted resource R0-B1-DT011
1/31/2011 3:51:04 PM - granted resource Server01N2-hcart-robot-tld0
1/31/2011 3:51:11 PM - estimated 56869729 kbytes needed
1/31/2011 3:51:16 PM - mounting LN3328
1/31/2011 3:51:21 PM - current media LN3328 complete, requesting next resource Any
1/31/2011 3:51:35 PM - granted resource LN3328
1/31/2011 3:51:35 PM - granted resource R0-B1-DT012
1/31/2011 3:51:35 PM - granted resource Server01N2-hcart-robot-tld0
1/31/2011 3:51:38 PM - mounting LN3328
1/31/2011 3:51:54 PM - connecting
1/31/2011 3:51:54 PM - connected; connect time: 00:00:00
1/31/2011 3:52:02 PM - Error bptm(pid=5700) error requesting media, TpErrno = Robot operation failed
1/31/2011 3:52:02 PM - Warning bptm(pid=5700) media id LN3328 load operation reported an error
1/31/2011 3:52:07 PM - current media LN3328 complete, requesting next resource Any
1/31/2011 3:52:21 PM - granted resource LN3210
1/31/2011 3:52:21 PM - granted resource R0-B1-DT006
1/31/2011 3:52:21 PM - granted resource Server-hcart-robot-tld0
1/31/2011 3:52:22 PM - mounting LN3210
1/31/2011 3:52:26 PM - current media LN3210 complete, requesting next resource Any
1/31/2011 3:52:39 PM - granted resource LN3210
1/31/2011 3:52:39 PM - granted resource R0-B1-DT000
1/31/2011 3:52:39 PM - granted resource Server01N2-hcart-robot-tld0
1/31/2011 3:52:40 PM - mounting LN3210
1/31/2011 3:52:44 PM - current media LN3210 complete, requesting next resource Any
1/31/2011 3:52:58 PM - granted resource LN3176
1/31/2011 3:52:58 PM - granted resource R0-B1-DT011
1/31/2011 3:52:58 PM - granted resource Server01N2-hcart-robot-tld0
1/31/2011 3:52:59 PM - mounting LN3176
if i do not stop this job, it freezes all my volumes.
The attached drives seems to be Ok and i can not see where it comes from.
This wat bpcar logfile tells me:
4:23:31.796 PM: [2304.7704] <16> dtcp_write: TCP - failure: send socket (1852) (TCP 10054: Connection reset by peer)
4:23:31.796 PM: [2304.7704] <16> dtcp_write: TCP - failure: attempted to send 84 bytes
---- a lot of these entries
....
My other clients do not have this problem.
Thx in advance! :)
Kind regards,
Loko32
Solved! Go to Solution.
01-31-2011 09:58 PM
I also used to face invalid element address in ESL tape library
This generally indicates misplaced barcode in tape or misplaced tape in library that causes robot to be unable to move the tape in drive
Only soultion given by HP for this issue is to reboot the tape library and once reboot is completed ufreeze tapes through netbackup
There is no problem with nbu in this case , just reboot your library and things will be fine
Do you see any errors in front panel in library?
01-31-2011 08:12 AM
01-31-2011 10:22 AM
Create bptm log on this media server. Also add VERBOSE entry to vm.conf and stop/start NBU on media server. More info will be logged in bptm as well as OS logs (Event Viewer on Windows and Syslog on Unix)
Please also share which OS on this Media Server. Is this an SSO environment? Can you verify that drives on Media Server show up as Shared?
Have you verified that device mapping is correct? i.e OS device name mapped to correct robotic drive number.
01-31-2011 10:40 AM
Hi,
Thx for the quick response.
I 've checked the Library:
Model: ESL712e
FWvers.: 7.0
Drives: 20 LTO 4 drives
In the log files i can see following warnings:
01-31-2011 11:04 AM
"Invalid element address" points to a configuration mismatch.
Have you tried to delete devices and rescan again on this media server?
Does this media server have its own robot or sharing drives in a robot controlled by another media server?
01-31-2011 11:06 AM
Hi,
i've added the "VERBOSE" line to vm.conf
BPTM enabled. (attached)
Stop start NB
Start job
Os: Windows 2003 R2 Enterprise Edition
Service pack 2
The drives are shared.
Thx for helping me!
grts,
Loko32
01-31-2011 11:21 AM
Hi,
I've just done.
I've deleted the drives and run the wizard + checked the mappings.
drives are all up and running.
Start job same issue!
1 Robot and several media servers with shared drives.
01-31-2011 11:40 AM
Any errors in Event Viewer System and/or Application log?
Have you verified pbx-level comms with robot control host?
01-31-2011 09:58 PM
I also used to face invalid element address in ESL tape library
This generally indicates misplaced barcode in tape or misplaced tape in library that causes robot to be unable to move the tape in drive
Only soultion given by HP for this issue is to reboot the tape library and once reboot is completed ufreeze tapes through netbackup
There is no problem with nbu in this case , just reboot your library and things will be fine
Do you see any errors in front panel in library?
02-01-2011 03:01 AM
Hi karia,
Great, Lib rebooted and all ok! :)
Thx Karia and thx to all who spend time to support me.
Thumbs up for this forum!
Kind regards,
Loko32
02-01-2011 03:27 AM
Good to know issue sorted out
If you have installed command view TL, you can check out on event viewer on exactly what caused the error and find out media which caused the issue.
Also check out firmware version of tape library.. if new f/w is available you may have to conact HP