cancel
Showing results for 
Search instead for 
Did you mean: 

24 - Socket write failed

ivanlmj
Not applicable

Hi, everyone!

 

I'm having a problem with an specific server wich I'm trying to backup its data. Here are some described information about one of its backup jobs.

....

....
08/22/2013 18:10:45 - granted resource  STU_MEDIA01_LTO_SL01
08/22/2013 18:10:45 - estimated 4572 kbytes needed
08/22/2013 18:10:45 - Info nbjm (pid=5777) started backup job for client code.tpn.terra.com, policy FS-Arqportal-code, schedule Cumulative-Inc on storage unit STU_MEDIA01_LTO_SL01
08/22/2013 18:10:45 - started process bpbrm (pid=11855)
08/22/2013 18:10:46 - Info bpbrm (pid=11855) code.tpn.terra.com is the host to backup data from
08/22/2013 18:10:46 - Info bpbrm (pid=11855) telling media manager to start backup on client
08/22/2013 18:10:46 - Info bptm (pid=11858) using 1048576 data buffer size
08/22/2013 18:10:46 - Info bptm (pid=11858) using 32 data buffers
08/22/2013 18:10:46 - Info bptm (pid=11858) start backup
08/22/2013 18:10:46 - Info bptm (pid=11858) Waiting for mount of media id BL0393 (copy 1) on server bkp-media01-poa.terra.com.br.
08/22/2013 18:10:46 - Info bpbrm (pid=11855) spawning a brm child process
08/22/2013 18:10:46 - Info bpbrm (pid=11855) child pid: 11860
08/22/2013 18:10:46 - mounting BL0393
08/22/2013 18:10:47 - Info bpbrm (pid=11855) sending bpsched msg: CONNECTING TO CLIENT FOR code.tpn.terra.com_1377205845
08/22/2013 18:10:47 - connecting
08/22/2013 18:10:48 - Info bpbrm (pid=11855) start bpbkar on client
08/22/2013 18:10:48 - Info bpbkar (pid=24856) Backup started
08/22/2013 18:10:48 - Info bpbrm (pid=11855) Sending the file list to the client
08/22/2013 18:10:48 - Info bptm (pid=11859) setting receive network buffer to 262144 bytes
08/22/2013 18:10:48 - connected; connect time: 0:00:00
08/22/2013 18:11:18 - Info bptm (pid=11858) media id BL0393 mounted on drive index 21, drivepath /dev/nst0, drivename SL3000_01_DRV02, copy 1
08/22/2013 18:11:18 - mounted BL0393; mount time: 0:00:32
08/22/2013 18:11:18 - positioning BL0393 to file 1595
08/22/2013 18:11:47 - positioned BL0393; position time: 0:00:29
08/22/2013 18:11:47 - begin writing
08/23/2013 04:11:47 - Error bpbrm (pid=11860) from client code.tpn.terra.com: ERR - Cannot write to STDOUT. Errno = 110: Connection timed out
08/23/2013 04:11:47 - Info bpbrm (pid=11855) sending message to media manager: STOP BACKUP code.tpn.terra.com_1377205845
08/23/2013 04:11:57 - Info bpbrm (pid=11855) media manager for backup id code.tpn.terra.com_1377205845 exited with status 150: termination requested by administrator
08/23/2013 04:11:57 - end writing; write time: 10:00:10
socket write failed  (24)

 

This situation is ocurring since last week and I steel can't figure what's going on.

I've made some tests from the master server with "bptesbpcd" and some telnet connections to 13782 and 13724, and everything is ok. Also, it seems that there's is no problema with the media servers too, because this situation is ocurring with all of ourr media servers, and furthermore, it's just ocurring with this server.

The policy's server has 3 selections configured, and it seems that just one of its selections is not possible to backup.

/root - ok

/home/git - ERR 24

/etc - ok

 

Also, the schedules from the policy are configured to "Allow multiple data streams" and Media Multiplexing "4".

 

Does anyone have any idea about what is the root cause of this problem? Is that possible that there's something wrong with the filesystem EXT4 of /home/git?

 

Thanks a lot, Symantec :).

Ivan Leon

1 REPLY 1

Nicolai
Moderator
Moderator
Partner    VIP   

Try to adjust the CLIENT_READ_TIMEOUT first to 1800

http://www.symantec.com/docs/HOWTO33199

If directory /home/git contain thousand of files it may simple be a processing issue. Alternative clean up /home/git for unwanted files.

Best Regards

Nicolai