Forum Discussion

CamaroSS's avatar
CamaroSS
Level 3
10 years ago

socket operation failed - 10054 (at child.c.1293)

Hello,

I need some help. I am trying to backup CIFS. I have created the windows policy, selected a client as host, enter UNC path, created an AD account with rights to the CIFS shares and started the Netbackup client with it. When I start the job, I get the error socket operation failed - 10054 (at child.c.1293). It also says "file read failed (13)

Can anyone see what I'm doing wrong?

thanks

  • How do you specify your UNC path? 

    Check this out: http://www.symantec.com/docs/TECH198175

  • Is NBU Client Service on the client started with the domain account?

    After how long does it fail?

    Is any data actually transferred?

    Please show us all text in Activity Monitor Details tab as well as policy config:

    bppllist <policy-name> -U

    You may want to ask your network team to monitor comms between NAS -> Client -> Media server.

  • It seems this is the actual problem : Waiting for mount of media id  000033 (copy 1) on server caharrisstlap02.ca.state.sbu. 9/22/2015 4:08:28 PM - Info bptm(pid=4776) start            9/22/2015 4:08:28 PM - mounting 000033 9/22/2015 4:08:30 PM - Error bptm(pid=3924) error requesting media, TpErrno =  Robot operation failed     You need to troubleshoot this tape mount issue first to see if this is the actual problem causing the rest of the processes to fail with status 13.
  • thank you. to answer your questions:

    yes, the client service is started with domain account.

    it fails right away after start up of job.

    no data is being transferred.

    in the backup policy, I have selected backup network drives.

    The activity monitor is below:

    9/22/2015 4:08:12 PM - Info nbjm(pid=3760) starting backup job (jobid=2943) for client Act2, policy CANSSSTORE2_WEEKLY, schedule Full 
    9/22/2015 4:08:12 PM - Info nbjm(pid=3760) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=2943, request id:{2C9067C3-002D-4497-AC33-8177BF3DBB9B}) 
    9/22/2015 4:08:12 PM - requesting resource caharriss2-hcart-robot-tld-0
    9/22/2015 4:08:12 PM - requesting resource caharris..NBU_CLIENT.MAXJOBS.Act2
    9/22/2015 4:08:12 PM - requesting resource caharrisstlap01.ca.state.sbu.NBU_POLICY.MAXJOBS.CANSSSTORE2_WEEKLY
    9/22/2015 4:08:13 PM - granted resource caharriss..NBU_CLIENT.MAXJOBS.Act2
    9/22/2015 4:08:13 PM - granted resource caharrisstlap01.ca.state.sbu.NBU_POLICY.MAXJOBS.CANSSSTORE2_WEEKLY
    9/22/2015 4:08:13 PM - granted resource 000033
    9/22/2015 4:08:13 PM - granted resource HP.ULTRIUM4-SCSI.001
    9/22/2015 4:08:13 PM - granted resource caharriss-hcart-robot-tld-0
    9/22/2015 4:08:13 PM - estimated 0 Kbytes needed
    9/22/2015 4:08:13 PM - Info nbjm(pid=3760) started backup job for client Act2, policy CANSSSTORE2_WEEKLY, schedule Full on storage unit caharrisstlap02-hcart-robot-tld-0
    9/22/2015 4:08:14 PM - started process bpbrm (3508)
    9/22/2015 4:08:16 PM - Info bpbrm(pid=3508) Act2 is the host to backup data from    
    9/22/2015 4:08:21 PM - Info bpbrm(pid=3508) reading file list from client       
    9/22/2015 4:08:21 PM - connecting
    9/22/2015 4:08:24 PM - Info bpbrm(pid=3508) starting bpbkar32 on client        
    9/22/2015 4:08:24 PM - connected; connect time: 00:00:03
    9/22/2015 4:08:27 PM - Info bpbkar32(pid=4504) Backup started          
    9/22/2015 4:08:27 PM - Info bptm(pid=3924) start           
    9/22/2015 4:08:27 PM - Info bptm(pid=3924) using 65536 data buffer size       
    9/22/2015 4:08:27 PM - Info bptm(pid=3924) setting receive network buffer to 263168 bytes     
    9/22/2015 4:08:27 PM - Info bptm(pid=3924) using 30 data buffers        
    9/22/2015 4:08:28 PM - Info bptm(pid=3924) start backup          
    9/22/2015 4:08:28 PM - Info bptm(pid=3924) backup child process is pid 4776.920      
    9/22/2015 4:08:28 PM - Info bptm(pid=3924) Waiting for mount of media id 000033 (copy 1) on server caharrisstlap02.ca.state.sbu.
    9/22/2015 4:08:28 PM - Info bptm(pid=4776) start           
    9/22/2015 4:08:28 PM - mounting 000033
    9/22/2015 4:08:30 PM - Error bptm(pid=3924) error requesting media, TpErrno = Robot operation failed    
    9/22/2015 4:08:30 PM - Warning bptm(pid=3924) media id 000033 load operation reported an error    
    9/22/2015 4:08:30 PM - current media 000033 complete, requesting next resource Any
    9/22/2015 4:08:34 PM - Error bpbrm(pid=3508) socket read failed, An existing connection was forcibly closed by the remote host.  (10054)
    9/22/2015 4:08:34 PM - Error bptm(pid=4776) socket operation failed - 10054 (at child.c.1293)     
    9/22/2015 4:08:35 PM - Error bptm(pid=4776) unable to perform read from client socket, connection may have been broken
    9/22/2015 4:08:36 PM - Error bpbrm(pid=3508) could not send server status message      
    9/22/2015 4:08:36 PM - granted resource 000027
    9/22/2015 4:08:36 PM - granted resource HP.ULTRIUM4-SCSI.000
    9/22/2015 4:08:36 PM - granted resource caharrisstlap02-hcart-robot-tld-0
    9/22/2015 4:08:38 PM - end writing
    file read failed(13)

  • Hello,

    I found the issue with the tape and fixed it, but it did not resolve the fail with status 13. Here is the activity details:

    9/23/2015 1:51:47 PM - Info nbjm(pid=3760) starting backup job (jobid=2958) for client Act2, policy CANSSSTORE2_WEEKLY, schedule Full 
    9/23/2015 1:51:47 PM - Info nbjm(pid=3760) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=2958, request id:{E27F8AEA-EB9C-47C9-9A28-2D217D710453}) 
    9/23/2015 1:51:47 PM - requesting resource caha-hcart-robot-tld-0
    9/23/2015 1:51:47 PM - requesting resource cah.NBU_CLIENT.MAXJOBS.Act2
    9/23/2015 1:51:47 PM - requesting resource cah.NBU_POLICY.MAXJOBS.CANSSSTORE2_WEEKLY
    9/23/2015 1:51:47 PM - granted resource cah.NBU_CLIENT.MAXJOBS.Act2
    9/23/2015 1:51:47 PM - granted resource caha.NBU_POLICY.MAXJOBS.CANSSSTORE2_WEEKLY
    9/23/2015 1:51:47 PM - granted resource 000033
    9/23/2015 1:51:47 PM - granted resource HP.ULTRIUM4-SCSI.001
    9/23/2015 1:51:47 PM - granted resource cah-hcart-robot-tld-0
    9/23/2015 1:51:47 PM - estimated 0 Kbytes needed
    9/23/2015 1:51:47 PM - Info nbjm(pid=3760) started backup job for client Act2, policy CANSSSTORE2_WEEKLY, schedule Full on storage unit caharrisstlap02-hcart-robot-tld-0
    9/23/2015 1:51:49 PM - started process bpbrm (676)
    9/23/2015 1:51:51 PM - Info bpbrm(pid=676) Act2 is the host to backup data from    
    9/23/2015 1:51:55 PM - Info bpbrm(pid=676) reading file list from client       
    9/23/2015 1:51:55 PM - connecting
    9/23/2015 1:51:58 PM - Info bpbrm(pid=676) starting bpbkar32 on client        
    9/23/2015 1:51:59 PM - connected; connect time: 00:00:04
    9/23/2015 1:52:00 PM - Info bpbkar32(pid=4004) Backup started          
    9/23/2015 1:52:00 PM - Info bptm(pid=3008) start           
    9/23/2015 1:52:01 PM - Info bptm(pid=3008) using 65536 data buffer size       
    9/23/2015 1:52:01 PM - Info bptm(pid=3008) setting receive network buffer to 263168 bytes     
    9/23/2015 1:52:01 PM - Info bptm(pid=3008) using 30 data buffers        
    9/23/2015 1:52:01 PM - Info bptm(pid=3008) start backup          
    9/23/2015 1:52:01 PM - Info bptm(pid=3008) backup child process is pid 3184.3164      
    9/23/2015 1:52:01 PM - Info bptm(pid=3008) Waiting for mount of media id 000033 (copy 1) on server caha.
    9/23/2015 1:52:01 PM - Info bptm(pid=3184) start           
    9/23/2015 1:52:01 PM - mounting 000033
    9/23/2015 1:52:06 PM - Error bptm(pid=3184) socket operation failed - 10054 (at child.c.1293)     
    9/23/2015 1:52:06 PM - Error bpbrm(pid=676) socket read failed, An existing connection was forcibly closed by the remote host.  (10054)
    9/23/2015 1:52:07 PM - Error bptm(pid=3184) unable to perform read from client socket, connection may have been broken
    9/23/2015 1:52:07 PM - Error bpbrm(pid=676) could not send server status message      
    9/23/2015 1:52:09 PM - end writing
    file read failed(13)

  • Check network between your media server with the NAS. Make sure there is no packet loss.

    Try to backup a smaller dataset to see if it works.

  • Hi,

    I tried what you suggested and I get the same error. I can access the shares thru windows explorer with the AD domain account, but I get the error when going thru Netbackup.

  • When configuring CIFS usually we can access the shares thru windows explorer with no problem, but that does not mean it's good for backup. 

    As suggested earlier, you will need to get your network team to check the network quality in between. Have you ever contacted Netbackup support? They can help you with running a tool called AppCritial (sas) to check the network quality.