cancel
Showing results for 
Search instead for 
Did you mean: 

Storage Unit issue!

loko32
Level 5
Partner Accredited

Hi,

 

I've an Netbackup 6.5.6 environment with a separate Master server and media servers.

The problem is that if i try to backup one of these media servers i receive following details of the job.

1/31/2011 3:51:04 PM - requesting resource Server01N2-hcart-robot-tld0
1/31/2011 3:51:04 PM - requesting resource Server03.eu.jnj.com.NBU_CLIENT.MAXJOBS.Server01n2.eu.jnj.com
1/31/2011 3:51:04 PM - requesting resource Server03.eu.jnj.com.NBU_POLICY.MAXJOBS.Server01n2
1/31/2011 3:51:04 PM - granted resource Server03.eu.jnj.com.NBU_CLIENT.MAXJOBS.Server01n2.eu.jnj.com
1/31/2011 3:51:04 PM - granted resource Server03.eu.jnj.com.NBU_POLICY.MAXJOBS.Server01n2
1/31/2011 3:51:04 PM - granted resource LN3328
1/31/2011 3:51:04 PM - granted resource R0-B1-DT011
1/31/2011 3:51:04 PM - granted resource Server01N2-hcart-robot-tld0
1/31/2011 3:51:11 PM - estimated 56869729 kbytes needed
1/31/2011 3:51:16 PM - mounting LN3328
1/31/2011 3:51:21 PM - current media LN3328 complete, requesting next resource Any
1/31/2011 3:51:35 PM - granted resource LN3328
1/31/2011 3:51:35 PM - granted resource R0-B1-DT012
1/31/2011 3:51:35 PM - granted resource Server01N2-hcart-robot-tld0
1/31/2011 3:51:38 PM - mounting LN3328
1/31/2011 3:51:54 PM - connecting
1/31/2011 3:51:54 PM - connected; connect time: 00:00:00
1/31/2011 3:52:02 PM - Error bptm(pid=5700) error requesting media, TpErrno = Robot operation failed    
1/31/2011 3:52:02 PM - Warning bptm(pid=5700) media id LN3328 load operation reported an error    
1/31/2011 3:52:07 PM - current media LN3328 complete, requesting next resource Any
1/31/2011 3:52:21 PM - granted resource LN3210
1/31/2011 3:52:21 PM - granted resource R0-B1-DT006
1/31/2011 3:52:21 PM - granted resource Server-hcart-robot-tld0
1/31/2011 3:52:22 PM - mounting LN3210
1/31/2011 3:52:26 PM - current media LN3210 complete, requesting next resource Any
1/31/2011 3:52:39 PM - granted resource LN3210
1/31/2011 3:52:39 PM - granted resource R0-B1-DT000
1/31/2011 3:52:39 PM - granted resource Server01N2-hcart-robot-tld0
1/31/2011 3:52:40 PM - mounting LN3210
1/31/2011 3:52:44 PM - current media LN3210 complete, requesting next resource Any
1/31/2011 3:52:58 PM - granted resource LN3176
1/31/2011 3:52:58 PM - granted resource R0-B1-DT011
1/31/2011 3:52:58 PM - granted resource Server01N2-hcart-robot-tld0
1/31/2011 3:52:59 PM - mounting LN3176

 

if i do not stop this job, it freezes all my volumes.

The attached drives seems to be Ok and i can not see where it comes from.

This wat bpcar logfile tells me:

4:23:31.796 PM: [2304.7704] <16> dtcp_write: TCP - failure: send socket (1852) (TCP 10054: Connection reset by peer)

4:23:31.796 PM: [2304.7704] <16> dtcp_write: TCP - failure: attempted to send 84 bytes

---- a lot of these entries

....

My other clients do not have this problem.

 

 

Thx in advance! :)

 

Kind regards,

Loko32

1 ACCEPTED SOLUTION

Accepted Solutions

Amit_Karia
Level 6

I also used to face invalid element address in ESL tape library

This generally indicates misplaced barcode in tape or misplaced tape in library that causes robot to be unable to move the tape in drive

Only soultion given by HP for this issue is to reboot the tape library and once reboot is completed ufreeze tapes through netbackup

There is no problem with nbu in this case , just reboot your library and things will be fine

Do you see any errors in front panel in library?

View solution in original post

10 REPLIES 10

Marianne
Level 6
Partner    VIP    Accredited Certified

Create bptm log on this media server. Also add VERBOSE entry to vm.conf and stop/start NBU on media server. More info will be logged in bptm as well as OS logs (Event Viewer on Windows and Syslog on Unix)

Please also share which OS on this Media Server. Is this an SSO environment? Can you verify that drives on Media Server show up as Shared?

Have you verified that device mapping is correct? i.e OS device name mapped to correct robotic drive number.

loko32
Level 5
Partner Accredited

Hi,

 

Thx for the quick response.

I 've checked the Library:

Model: ESL712e

FWvers.: 7.0

Drives: 20 LTO 4 drives

In the log files i can see following warnings:

The storage units are configued for LTO4 drives
 
The labels on the Tapes are correct:
LN4062 LN4062L4 HCART TLD   0 Server03.eu.jnj.com  77 000_00000_TLD SCRATCH   3   - NetBackup   -  ---  Active
 
Further any ideas?
Thx in advance for any help!
 
grts,
Loko32

Marianne
Level 6
Partner    VIP    Accredited Certified

"Invalid element address" points to a configuration mismatch.

Have you tried to delete devices and rescan again on this media server?

Does this media server have its own robot or sharing drives in a robot controlled by another media server?

loko32
Level 5
Partner Accredited

Hi,

 

i've added the "VERBOSE" line to vm.conf

BPTM enabled. (attached)

Stop start NB

Start job

Os: Windows 2003 R2 Enterprise Edition

Service pack 2

The drives are shared.

 

Thx for helping me!

 

grts,

Loko32

loko32
Level 5
Partner Accredited

Hi,

 

I've just done.

I've deleted the drives and run the wizard + checked the mappings.

drives are all up and running.

Start job same issue!

1 Robot and several media servers with shared drives.

Marianne
Level 6
Partner    VIP    Accredited Certified

Any errors in Event Viewer System and/or Application log?

Have you verified pbx-level comms with robot control host?

Amit_Karia
Level 6

I also used to face invalid element address in ESL tape library

This generally indicates misplaced barcode in tape or misplaced tape in library that causes robot to be unable to move the tape in drive

Only soultion given by HP for this issue is to reboot the tape library and once reboot is completed ufreeze tapes through netbackup

There is no problem with nbu in this case , just reboot your library and things will be fine

Do you see any errors in front panel in library?

loko32
Level 5
Partner Accredited

Hi karia,

 

Great, Lib rebooted and all ok! :)

Thx Karia and thx to all who spend time to support me.

Thumbs up for this forum!

 

Kind regards,

Loko32

Amit_Karia
Level 6

Good to know issue sorted out

If you have installed command view TL, you can check out on event viewer on exactly what caused the error and find out media which caused the issue.

Also check out firmware version of tape library.. if new f/w is available you may have to conact HP