Forum Discussion

sri_vani's avatar
sri_vani
Level 6
12 years ago

unable to backup particular folder

we have couple of WAN backups in my environment.I have enabled multi streaming.usually it runs quite long time but completes with in schedule/some times after manual re run.

But from last one week two clients unable to take f drive folders failing with 13read file error.

I have increased time settings that dint help me to fix this issue.i reverted the changes now and tried again ..dint wrk out.

Client A:detail log:recent:

9/19/2012 4:29:50 PM - requesting resource ppme307-hcart2-robot-tld-0
9/19/2012 4:29:50 PM - requesting resource ppme304.NBU_CLIENT.MAXJOBS.fps-pst-01
9/19/2012 4:29:50 PM - requesting resource ppme304.NBU_POLICY.MAXJOBS.win_pst
9/19/2012 4:29:50 PM - Info nbrb(pid=7864518) Limit has been reached for the logical resource ppme304.NBU_CLIENT.MAXJOBS.fps-pst-01    
9/19/2012 4:30:21 PM - awaiting resource ppme307-hcart2-robot-tld-0 - No drives are available
9/19/2012 4:36:02 PM - granted resource ppme304.NBU_CLIENT.MAXJOBS.fps-pst-01
9/19/2012 4:36:02 PM - granted resource ppme304.NBU_POLICY.MAXJOBS.win_pst
9/19/2012 4:36:02 PM - granted resource 0066L5
9/19/2012 4:36:02 PM - granted resource IBM.ULT3580-TD5.001
9/19/2012 4:36:02 PM - granted resource ppme307-hcart2-robot-tld-0
9/19/2012 4:36:02 PM - estimated 99010686 Kbytes needed
9/19/2012 4:36:03 PM - started process bpbrm (23527534)
9/19/2012 4:36:12 PM - mounting 0066L5
9/19/2012 4:36:15 PM - connecting
9/19/2012 4:36:17 PM - connected; connect time: 00:00:02
9/19/2012 4:36:26 PM - Warning bpbrm(pid=4063288) from client fps-pst-01: WRN - can't open file: F:\Spool\FP04824.SHD (WIN32 32: The process cannot access the file because it is being used by another process. )
9/19/2012 4:36:27 PM - Warning bpbrm(pid=4063288) from client fps-pst-01: WRN - can't open file: F:\System Volume Information\SRM\quota.md (WIN32 32: The process cannot access the file because it is being used by another process. )
9/19/2012 4:36:43 PM - mounted; mount time: 00:00:31
9/19/2012 4:36:43 PM - positioning 0066L5 to file 1196
9/19/2012 4:37:34 PM - positioned 0066L5; position time: 00:00:51
9/19/2012 4:37:34 PM - begin writing
9/19/2012 9:19:29 PM - Error bpbrm(pid=4063288) socket read failed: errno = 119 - System call timed out  
9/19/2012 9:19:32 PM - end writing; write time: 04:41:58
file read failed(13)
9/20/2012 10:34:05 AM - job 52868 was restarted as job 53002
 

Client B Activty log:

 

 

9/19/2012 10:54:01 PM - requesting resource ppme307-hcart2-robot-tld-0

9/19/2012 10:54:01 PM - requesting resource ppme304.NBU_CLIENT.MAXJOBS.inf-pst-01

9/19/2012 10:54:01 PM - requesting resource ppme304.NBU_POLICY.MAXJOBS.win_pst

9/19/2012 10:54:01 PM - granted resource ppme304.NBU_CLIENT.MAXJOBS.inf-pst-01

9/19/2012 10:54:01 PM - granted resource ppme304.NBU_POLICY.MAXJOBS.win_pst

9/19/2012 10:54:01 PM - granted resource 0066L5

9/19/2012 10:54:01 PM - granted resource IBM.ULT3580-TD5.002

9/19/2012 10:54:01 PM - granted resource ppme307-hcart2-robot-tld-0

9/19/2012 10:54:01 PM - estimated 217546741 Kbytes needed

9/19/2012 10:54:02 PM - started process bpbrm (10420398)

9/19/2012 10:54:07 PM - mounting 0066L5

9/19/2012 10:54:07 PM - connecting

9/19/2012 10:54:07 PM - connected; connect time: 00:00:00

9/19/2012 10:54:40 PM - mounted; mount time: 00:00:33

9/19/2012 10:54:40 PM - positioning 0066L5 to file 1277

9/19/2012 10:55:40 PM - positioned 0066L5; position time: 00:01:00

9/19/2012 10:55:40 PM - begin writing

9/20/2012 2:14:59 AM - Error bpbrm(pid=23068702) socket read failed: errno = 119 - System call timed out 

9/20/2012 2:21:33 AM - Error bptm(pid=12451988) read of TIR file failed: the file list is incomplete  

9/20/2012 2:21:33 AM - end writing; write time: 03:25:53

file read failed(13)

9/20/2012 10:34:46 AM - job 52882 was restarted as job 53003

  • All we can see in the log is a break in network connection:

     4:51:54.533 PM: [908.5244] <2> tar_base::V_vTarMsgM: DIR - 316 132540 45811565 31 16832 root root 0 1348644992 1315517488 1315517487 /F/u1/s9014079/New Restore PST/
    4:55:02.895 PM: [908.5244] <16> dtcp_write: TCP - failure: send socket (1848) (TCP 10054: Connection reset by peer)
    4:55:02.895 PM: [908.5244] <16> dtcp_write: TCP - failure: attempted to send 1 bytes
    (Lots of these 2 lines......)
    4:59:16.648 PM: [908.5244] <2> tar_base::V_vTarMsgW: FTL - tar file write error (40)
    4:59:16.664 PM: [908.5244] <2> tar_base::V_vTarMsgW: INF - EXIT STATUS 14: file write failed
     
    Have a look at troubleshooting steps listed for Status 14:
     
    ■ On Windows systems, the client bpbkar log may contain a 10054 Connection
    Reset Error (usually indicates a hardware error). Somewhere between the
    NetBackup client and server, the connection was reset. When NetBackup
    receives this error, it cannot continue the backup. This error has been attributed
    to the following:
    ■ A hiccup in the network.
    ■ A bad network interface card on a NetBackup client.
    ■ A bad network interface card on the NetBackup server.
    ■ Faulty routers.
    ■ Any other applications that interfere with NetBackup connections.

     

    You may want to upgrade to NBU 7.5 and read up on the new Wan Resilient feature.

    See this White Paper: http://www.symantec.com/docs/TECH183552

    Prior to NetBackup 7.5 release, any disconnection in network over WAN environment will cause backups to fail.
    ...

     NetBackup 7.5 includes a feature, Resilient Network, which will help data transfers over a WAN environment by becoming resilient to WAN interruptions.

    Refer attached document for more details.

     

15 Replies