cancel
Showing results forΒ 
Search instead forΒ 
Did you mean:Β 

Incremental backups for HP-UX client fail intermittently with status 40

WayneLackey
Level 5

Environment:

Master and media servers: Windows Server 2008 running NBU 7.6.0.1

Client: HP-UX B.11.31 running NBU client 7.0

Problem: One HP-UX client has recently started having intermittent backup failures on its incremental backups. The incremental backup kicks off as normal, it starts writing for a bit, then fails. Full backups are successful. I am sometimes able to re-run the incremental backup in the morning with success, sometimes not. Backup job detail follows:

2/4/2014 8:38:43 AM - Info nbjm(pid=20084) starting backup job (jobid=2911697) for client bkod4mishpv01, policy MIS_FF_SuperDome_Weekly, schedule Differential 
2/4/2014 8:38:43 AM - Info nbjm(pid=20084) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=2911697, request id:{1A65565B-42D4-4C9B-A135-

DDAAF3272B9C}) 
2/4/2014 8:38:43 AM - requesting resource Leveraged_DSSU
2/4/2014 8:38:43 AM - requesting resource bkusolcwbdcs001.NBU_CLIENT.MAXJOBS.bkod4mishpv01
2/4/2014 8:38:43 AM - requesting resource bkusolcwbdcs001.NBU_POLICY.MAXJOBS.MIS_FF_SuperDome_Weekly
2/4/2014 8:39:11 AM - granted resource bkusolcwbdcs001.NBU_CLIENT.MAXJOBS.bkod4mishpv01
2/4/2014 8:39:11 AM - granted resource bkusolcwbdcs001.NBU_POLICY.MAXJOBS.MIS_FF_SuperDome_Weekly
2/4/2014 8:39:11 AM - granted resource MediaID=@aaad1;DiskVolume=W:\;DiskPool=DCS006-W-R6;Path=W:\;StorageServer=bkusolpwbdcs006;MediaServe...
2/4/2014 8:39:11 AM - granted resource DCS006-W-R6
2/4/2014 8:39:13 AM - estimated 41281958 Kbytes needed
2/4/2014 8:39:13 AM - Info nbjm(pid=20084) started backup (backupid=bkod4mishpv01_1391521152) job for client bkod4mishpv01, policy MIS_FF_SuperDome_Weekly, schedule Differential on storage unit DCS006-W-R6
2/4/2014 8:39:15 AM - started process bpbrm (16244)
2/4/2014 8:39:16 AM - connecting
2/4/2014 8:39:17 AM - Info bpbrm(pid=16244) bkod4mishpv01 is the host to backup data from    
2/4/2014 8:39:17 AM - Info bpbrm(pid=16244) reading file list for client       
2/4/2014 8:39:19 AM - Info bpbrm(pid=16244) starting bpbkar32 on client        
2/4/2014 8:39:19 AM - connected; connect time: 0:00:03
2/4/2014 8:39:20 AM - Info bpbkar32(pid=0) Backup started          
2/4/2014 8:39:20 AM - Info bptm(pid=10196) start           
2/4/2014 8:39:20 AM - Info bptm(pid=10196) using 524288 data buffer size       
2/4/2014 8:39:20 AM - Info bptm(pid=10196) setting receive network buffer to 2098176 bytes     
2/4/2014 8:39:20 AM - Info bptm(pid=10196) using 48 data buffers        
2/4/2014 8:39:21 AM - Info bptm(pid=10196) start backup          
2/4/2014 8:39:23 AM - Info bptm(pid=10196) backup child process is pid 13924.13980      
2/4/2014 8:39:23 AM - Info bptm(pid=13924) start           
2/4/2014 8:39:23 AM - begin writing
2/4/2014 8:40:00 AM - Info bpbrm(pid=16244) from client bkod4mishpv01: TRV - /var/hpsrp/usolinsp514/var/spool/sockets/pwgr/client21297 is a socket special file. Skipping.
2/4/2014 8:40:00 AM - Info bpbrm(pid=16244) from client bkod4mishpv01: TRV - /var/hpsrp/usolinsp514/var/spool/sockets/pwgr/client21463 is a socket special file. Skipping.
2/4/2014 8:41:26 AM - Info bpbrm(pid=16244) from client bkod4mishpv01: TRV - /var/hpsrp/mishp1ap/export/home/oracle is in a different file system from /var/hpsrp/mishp1ap/export/home. Skipping.
2/4/2014 8:43:13 AM - Info bpbrm(pid=16244) from client bkod4mishpv01: TRV - /var/hpsrp/mishp1ap/u01/app/oracle is in a different file system from /var/hpsrp/mishp1ap/u01/app. Skipping.
2/4/2014 8:43:57 AM - Info bpbrm(pid=16244) from client bkod4mishpv01: TRV - /var/hpsrp/mishp1ap/var/spool/sockets/pwgr/client16831 is a socket special file. Skipping.
... (cut out more skipped files/folder for the sake of brevity)
2/4/2014 9:07:48 AM - Info bpbrm(pid=16244) from client bkod4mishpv01: TRV - /var/hpsrp/mishp1db/u27/oracle/oradata is in a different file system from /var/hpsrp/mishp1db. Skipping.
2/4/2014 9:07:48 AM - Info bpbrm(pid=16244) from client bkod4mishpv01: TRV - /var/hpsrp/mishp1db/tmpsort is in a different file system from /var/hpsrp/mishp1db. Skipping.
2/4/2014 9:07:48 AM - Info bpbrm(pid=16244) from client bkod4mishpv01: TRV - /var/hpsrp/mishp1db/dbtmp is in a different file system from /var/hpsrp/mishp1db. Skipping.
2/4/2014 9:40:10 AM - Info bpbrm(pid=16244) from client bkod4mishpv01: TRV - /var/hpsrp/mishp0ap/opt/oracle is in a different file system from /var/hpsrp/mishp0ap. Skipping.
2/4/2014 9:50:30 AM - Info bpbkar32(pid=0) done. status: 40: network connection broken      
2/4/2014 9:50:30 AM - end writing; write time: 1:11:07
network connection broken(40)

Any ideas as to what I should look at first?

Thanks,

Wayne

2 REPLIES 2

sri_vani
Level 6
Partner

I think for ur two posts--the issue will get resolve if you exlude /dev directory.

Try the backup by excluding dev directory as it is not reqd

                                                                                          OR Add IGNORE_XATTR = YES to bp.conf

 

Ref links:

http://www.symantec.com/business/support/index?page=content&id=TECH73719

http://www.symantec.com/business/support/index?page=content&id=TECH71070

Mark_Solutions
Level 6
Partner Accredited Certified

Because of the files it is reading and skipping it is taking a long time to prepare its data stream to apss to the media server

You will see it gets to about 10 minutes into the job when it fails - that is 300 seconds

300 seconds is the default client read timeout so try changing that to 1800 initially - do this on the Timeout tab of the Media Servers Host properties (not the clients)

Hope this helps