cancel
Showing results for 
Search instead for 
Did you mean: 

socket operation failed - 10054 (at child.c.1293)

CamaroSS
Level 3

Hello,

I need some help. I am trying to backup CIFS. I have created the windows policy, selected a client as host, enter UNC path, created an AD account with rights to the CIFS shares and started the Netbackup client with it. When I start the job, I get the error socket operation failed - 10054 (at child.c.1293). It also says "file read failed (13)

Can anyone see what I'm doing wrong?

thanks

9 REPLIES 9

watsons
Level 6

How do you specify your UNC path? 

Check this out: http://www.symantec.com/docs/TECH198175

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

Is NBU Client Service on the client started with the domain account?

After how long does it fail?

Is any data actually transferred?

Please show us all text in Activity Monitor Details tab as well as policy config:

bppllist <policy-name> -U

You may want to ask your network team to monitor comms between NAS -> Client -> Media server.

CamaroSS
Level 3

Hi,

thank you.

I go to the policy, check to make sure client is selected. NB client service is restarted with AD domain account. On the backup selections, I enter the path of the CIFS shares. \\SANname\admin_shares$\snapshots\folder name.

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified
It seems this is the actual problem : Waiting for mount of media id  000033 (copy 1) on server caharrisstlap02.ca.state.sbu. 9/22/2015 4:08:28 PM - Info bptm(pid=4776) start            9/22/2015 4:08:28 PM - mounting 000033 9/22/2015 4:08:30 PM - Error bptm(pid=3924) error requesting media, TpErrno =  Robot operation failed     You need to troubleshoot this tape mount issue first to see if this is the actual problem causing the rest of the processes to fail with status 13.

CamaroSS
Level 3

thank you. to answer your questions:

yes, the client service is started with domain account.

it fails right away after start up of job.

no data is being transferred.

in the backup policy, I have selected backup network drives.

The activity monitor is below:

9/22/2015 4:08:12 PM - Info nbjm(pid=3760) starting backup job (jobid=2943) for client Act2, policy CANSSSTORE2_WEEKLY, schedule Full 
9/22/2015 4:08:12 PM - Info nbjm(pid=3760) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=2943, request id:{2C9067C3-002D-4497-AC33-8177BF3DBB9B}) 
9/22/2015 4:08:12 PM - requesting resource caharriss2-hcart-robot-tld-0
9/22/2015 4:08:12 PM - requesting resource caharris..NBU_CLIENT.MAXJOBS.Act2
9/22/2015 4:08:12 PM - requesting resource caharrisstlap01.ca.state.sbu.NBU_POLICY.MAXJOBS.CANSSSTORE2_WEEKLY
9/22/2015 4:08:13 PM - granted resource caharriss..NBU_CLIENT.MAXJOBS.Act2
9/22/2015 4:08:13 PM - granted resource caharrisstlap01.ca.state.sbu.NBU_POLICY.MAXJOBS.CANSSSTORE2_WEEKLY
9/22/2015 4:08:13 PM - granted resource 000033
9/22/2015 4:08:13 PM - granted resource HP.ULTRIUM4-SCSI.001
9/22/2015 4:08:13 PM - granted resource caharriss-hcart-robot-tld-0
9/22/2015 4:08:13 PM - estimated 0 Kbytes needed
9/22/2015 4:08:13 PM - Info nbjm(pid=3760) started backup job for client Act2, policy CANSSSTORE2_WEEKLY, schedule Full on storage unit caharrisstlap02-hcart-robot-tld-0
9/22/2015 4:08:14 PM - started process bpbrm (3508)
9/22/2015 4:08:16 PM - Info bpbrm(pid=3508) Act2 is the host to backup data from    
9/22/2015 4:08:21 PM - Info bpbrm(pid=3508) reading file list from client       
9/22/2015 4:08:21 PM - connecting
9/22/2015 4:08:24 PM - Info bpbrm(pid=3508) starting bpbkar32 on client        
9/22/2015 4:08:24 PM - connected; connect time: 00:00:03
9/22/2015 4:08:27 PM - Info bpbkar32(pid=4504) Backup started          
9/22/2015 4:08:27 PM - Info bptm(pid=3924) start           
9/22/2015 4:08:27 PM - Info bptm(pid=3924) using 65536 data buffer size       
9/22/2015 4:08:27 PM - Info bptm(pid=3924) setting receive network buffer to 263168 bytes     
9/22/2015 4:08:27 PM - Info bptm(pid=3924) using 30 data buffers        
9/22/2015 4:08:28 PM - Info bptm(pid=3924) start backup          
9/22/2015 4:08:28 PM - Info bptm(pid=3924) backup child process is pid 4776.920      
9/22/2015 4:08:28 PM - Info bptm(pid=3924) Waiting for mount of media id 000033 (copy 1) on server caharrisstlap02.ca.state.sbu.
9/22/2015 4:08:28 PM - Info bptm(pid=4776) start           
9/22/2015 4:08:28 PM - mounting 000033
9/22/2015 4:08:30 PM - Error bptm(pid=3924) error requesting media, TpErrno = Robot operation failed    
9/22/2015 4:08:30 PM - Warning bptm(pid=3924) media id 000033 load operation reported an error    
9/22/2015 4:08:30 PM - current media 000033 complete, requesting next resource Any
9/22/2015 4:08:34 PM - Error bpbrm(pid=3508) socket read failed, An existing connection was forcibly closed by the remote host.  (10054)
9/22/2015 4:08:34 PM - Error bptm(pid=4776) socket operation failed - 10054 (at child.c.1293)     
9/22/2015 4:08:35 PM - Error bptm(pid=4776) unable to perform read from client socket, connection may have been broken
9/22/2015 4:08:36 PM - Error bpbrm(pid=3508) could not send server status message      
9/22/2015 4:08:36 PM - granted resource 000027
9/22/2015 4:08:36 PM - granted resource HP.ULTRIUM4-SCSI.000
9/22/2015 4:08:36 PM - granted resource caharrisstlap02-hcart-robot-tld-0
9/22/2015 4:08:38 PM - end writing
file read failed(13)

CamaroSS
Level 3

Hello,

I found the issue with the tape and fixed it, but it did not resolve the fail with status 13. Here is the activity details:

9/23/2015 1:51:47 PM - Info nbjm(pid=3760) starting backup job (jobid=2958) for client Act2, policy CANSSSTORE2_WEEKLY, schedule Full 
9/23/2015 1:51:47 PM - Info nbjm(pid=3760) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=2958, request id:{E27F8AEA-EB9C-47C9-9A28-2D217D710453}) 
9/23/2015 1:51:47 PM - requesting resource caha-hcart-robot-tld-0
9/23/2015 1:51:47 PM - requesting resource cah.NBU_CLIENT.MAXJOBS.Act2
9/23/2015 1:51:47 PM - requesting resource cah.NBU_POLICY.MAXJOBS.CANSSSTORE2_WEEKLY
9/23/2015 1:51:47 PM - granted resource cah.NBU_CLIENT.MAXJOBS.Act2
9/23/2015 1:51:47 PM - granted resource caha.NBU_POLICY.MAXJOBS.CANSSSTORE2_WEEKLY
9/23/2015 1:51:47 PM - granted resource 000033
9/23/2015 1:51:47 PM - granted resource HP.ULTRIUM4-SCSI.001
9/23/2015 1:51:47 PM - granted resource cah-hcart-robot-tld-0
9/23/2015 1:51:47 PM - estimated 0 Kbytes needed
9/23/2015 1:51:47 PM - Info nbjm(pid=3760) started backup job for client Act2, policy CANSSSTORE2_WEEKLY, schedule Full on storage unit caharrisstlap02-hcart-robot-tld-0
9/23/2015 1:51:49 PM - started process bpbrm (676)
9/23/2015 1:51:51 PM - Info bpbrm(pid=676) Act2 is the host to backup data from    
9/23/2015 1:51:55 PM - Info bpbrm(pid=676) reading file list from client       
9/23/2015 1:51:55 PM - connecting
9/23/2015 1:51:58 PM - Info bpbrm(pid=676) starting bpbkar32 on client        
9/23/2015 1:51:59 PM - connected; connect time: 00:00:04
9/23/2015 1:52:00 PM - Info bpbkar32(pid=4004) Backup started          
9/23/2015 1:52:00 PM - Info bptm(pid=3008) start           
9/23/2015 1:52:01 PM - Info bptm(pid=3008) using 65536 data buffer size       
9/23/2015 1:52:01 PM - Info bptm(pid=3008) setting receive network buffer to 263168 bytes     
9/23/2015 1:52:01 PM - Info bptm(pid=3008) using 30 data buffers        
9/23/2015 1:52:01 PM - Info bptm(pid=3008) start backup          
9/23/2015 1:52:01 PM - Info bptm(pid=3008) backup child process is pid 3184.3164      
9/23/2015 1:52:01 PM - Info bptm(pid=3008) Waiting for mount of media id 000033 (copy 1) on server caha.
9/23/2015 1:52:01 PM - Info bptm(pid=3184) start           
9/23/2015 1:52:01 PM - mounting 000033
9/23/2015 1:52:06 PM - Error bptm(pid=3184) socket operation failed - 10054 (at child.c.1293)     
9/23/2015 1:52:06 PM - Error bpbrm(pid=676) socket read failed, An existing connection was forcibly closed by the remote host.  (10054)
9/23/2015 1:52:07 PM - Error bptm(pid=3184) unable to perform read from client socket, connection may have been broken
9/23/2015 1:52:07 PM - Error bpbrm(pid=676) could not send server status message      
9/23/2015 1:52:09 PM - end writing
file read failed(13)

watsons
Level 6

Check network between your media server with the NAS. Make sure there is no packet loss.

Try to backup a smaller dataset to see if it works.

CamaroSS
Level 3

Hi,

I tried what you suggested and I get the same error. I can access the shares thru windows explorer with the AD domain account, but I get the error when going thru Netbackup.

watsons
Level 6

When configuring CIFS usually we can access the shares thru windows explorer with no problem, but that does not mean it's good for backup. 

As suggested earlier, you will need to get your network team to check the network quality in between. Have you ever contacted Netbackup support? They can help you with running a tool called AppCritial (sas) to check the network quality.