unable to backup particular folder
we have couple of WAN backups in my environment.I have enabled multi streaming.usually it runs quite long time but completes with in schedule/some times after manual re run.
But from last one week two clients unable to take f drive folders failing with 13read file error.
I have increased time settings that dint help me to fix this issue.i reverted the changes now and tried again ..dint wrk out.
Client A:detail log:recent:
9/19/2012 4:29:50 PM - requesting resource ppme307-hcart2-robot-tld-0
9/19/2012 4:29:50 PM - requesting resource ppme304.NBU_CLIENT.MAXJOBS.fps-pst-01
9/19/2012 4:29:50 PM - requesting resource ppme304.NBU_POLICY.MAXJOBS.win_pst
9/19/2012 4:29:50 PM - Info nbrb(pid=7864518) Limit has been reached for the logical resource ppme304.NBU_CLIENT.MAXJOBS.fps-pst-01
9/19/2012 4:30:21 PM - awaiting resource ppme307-hcart2-robot-tld-0 - No drives are available
9/19/2012 4:36:02 PM - granted resource ppme304.NBU_CLIENT.MAXJOBS.fps-pst-01
9/19/2012 4:36:02 PM - granted resource ppme304.NBU_POLICY.MAXJOBS.win_pst
9/19/2012 4:36:02 PM - granted resource 0066L5
9/19/2012 4:36:02 PM - granted resource IBM.ULT3580-TD5.001
9/19/2012 4:36:02 PM - granted resource ppme307-hcart2-robot-tld-0
9/19/2012 4:36:02 PM - estimated 99010686 Kbytes needed
9/19/2012 4:36:03 PM - started process bpbrm (23527534)
9/19/2012 4:36:12 PM - mounting 0066L5
9/19/2012 4:36:15 PM - connecting
9/19/2012 4:36:17 PM - connected; connect time: 00:00:02
9/19/2012 4:36:26 PM - Warning bpbrm(pid=4063288) from client fps-pst-01: WRN - can't open file: F:\Spool\FP04824.SHD (WIN32 32: The process cannot access the file because it is being used by another process. )
9/19/2012 4:36:27 PM - Warning bpbrm(pid=4063288) from client fps-pst-01: WRN - can't open file: F:\System Volume Information\SRM\quota.md (WIN32 32: The process cannot access the file because it is being used by another process. )
9/19/2012 4:36:43 PM - mounted; mount time: 00:00:31
9/19/2012 4:36:43 PM - positioning 0066L5 to file 1196
9/19/2012 4:37:34 PM - positioned 0066L5; position time: 00:00:51
9/19/2012 4:37:34 PM - begin writing
9/19/2012 9:19:29 PM - Error bpbrm(pid=4063288) socket read failed: errno = 119 - System call timed out
9/19/2012 9:19:32 PM - end writing; write time: 04:41:58
file read failed(13)
9/20/2012 10:34:05 AM - job 52868 was restarted as job 53002
Client B Activty log:
9/19/2012 10:54:01 PM - requesting resource ppme307-hcart2-robot-tld-0
9/19/2012 10:54:01 PM - requesting resource ppme304.NBU_CLIENT.MAXJOBS.inf-pst-01
9/19/2012 10:54:01 PM - requesting resource ppme304.NBU_POLICY.MAXJOBS.win_pst
9/19/2012 10:54:01 PM - granted resource ppme304.NBU_CLIENT.MAXJOBS.inf-pst-01
9/19/2012 10:54:01 PM - granted resource ppme304.NBU_POLICY.MAXJOBS.win_pst
9/19/2012 10:54:01 PM - granted resource 0066L5
9/19/2012 10:54:01 PM - granted resource IBM.ULT3580-TD5.002
9/19/2012 10:54:01 PM - granted resource ppme307-hcart2-robot-tld-0
9/19/2012 10:54:01 PM - estimated 217546741 Kbytes needed
9/19/2012 10:54:02 PM - started process bpbrm (10420398)
9/19/2012 10:54:07 PM - mounting 0066L5
9/19/2012 10:54:07 PM - connecting
9/19/2012 10:54:07 PM - connected; connect time: 00:00:00
9/19/2012 10:54:40 PM - mounted; mount time: 00:00:33
9/19/2012 10:54:40 PM - positioning 0066L5 to file 1277
9/19/2012 10:55:40 PM - positioned 0066L5; position time: 00:01:00
9/19/2012 10:55:40 PM - begin writing
9/20/2012 2:14:59 AM - Error bpbrm(pid=23068702) socket read failed: errno = 119 - System call timed out
9/20/2012 2:21:33 AM - Error bptm(pid=12451988) read of TIR file failed: the file list is incomplete
9/20/2012 2:21:33 AM - end writing; write time: 03:25:53
file read failed(13)
9/20/2012 10:34:46 AM - job 52882 was restarted as job 53003
All we can see in the log is a break in network connection:
4:51:54.533 PM: [908.5244] <2> tar_base::V_vTarMsgM: DIR - 316 132540 45811565 31 16832 root root 0 1348644992 1315517488 1315517487 /F/u1/s9014079/New Restore PST/ 4:55:02.895 PM: [908.5244] <16> dtcp_write: TCP - failure: send socket (1848) (TCP 10054: Connection reset by peer) 4:55:02.895 PM: [908.5244] <16> dtcp_write: TCP - failure: attempted to send 1 bytes (Lots of these 2 lines......) 4:59:16.648 PM: [908.5244] <2> tar_base::V_vTarMsgW: FTL - tar file write error (40) 4:59:16.664 PM: [908.5244] <2> tar_base::V_vTarMsgW: INF - EXIT STATUS 14: file write failed
Have a look at troubleshooting steps listed for Status 14:■ On Windows systems, the client bpbkar log may contain a 10054 ConnectionReset Error (usually indicates a hardware error). Somewhere between theNetBackup client and server, the connection was reset. When NetBackupreceives this error, it cannot continue the backup. This error has been attributedto the following:■ A hiccup in the network.■ A bad network interface card on a NetBackup client.■ A bad network interface card on the NetBackup server.■ Faulty routers.■ Any other applications that interfere with NetBackup connections.You may want to upgrade to NBU 7.5 and read up on the new Wan Resilient feature.
See this White Paper: http://www.symantec.com/docs/TECH183552
Prior to NetBackup 7.5 release, any disconnection in network over WAN environment will cause backups to fail....
NetBackup 7.5 includes a feature, Resilient Network, which will help data transfers over a WAN environment by becoming resilient to WAN interruptions.
Refer attached document for more details.