Forum Discussion

MG11's avatar
MG11
Level 4
9 years ago

WAS backup failing intermitently with 13 error

Hi All,

I know that this error seems common, but i have tried most of the WA available but no luck.

Backup failing with below error in NETBACKUp SERVER : 
06/15/2016 10:04:52 - Info bptm (pid=2227) media id 4086L6 mounted on drive index 0, drivepath /dev/rmt/1cbn, drivename HP.ULTRIUM6-SCSI.000, copy 1 
06/15/2016 10:04:52 - mounted 4086L6; mount time: 0:01:07 
06/15/2016 10:04:52 - positioning 4086L6 to file 1791 
06/15/2016 10:05:11 - positioned 4086L6; position time: 0:00:19 
06/15/2016 10:05:11 - begin writing 
06/15/2016 12:05:44 - Error bpbrm (pid=1809) socket read failed: errno = 131 - Connection reset by peer 
06/15/2016 12:05:46 - Error bptm (pid=2227) media manager terminated by parent process 
06/15/2016 12:06:24 - Error bpbrm (pid=1809) could not send server status message 
06/15/2016 12:06:25 - Info bpbkar (pid=7096) done. status: 13: file read failed 
06/15/2016 12:06:25 - end writing; write time: 2:01:14 
file read failed (13)

ERROR ON CLIENT SIDE : 
12:12:12.701 [5004.7812] <2> TransporterRemote::write[2](): DBG - | An Exception of type [SocketWriteException] has occured at: | Module: @(#) $Source: src/ncf/tfi/lib/TransporterRemote.cpp,v $ $Revision: 1.55 $ , Function: TransporterRemote::write[2](), Line: 338 | Local Address: [::]:0 | Remote Address: [::]:0 | OS Error: 10060 (A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

) | Expected bytes: 131072 | (../TransporterRemote.cpp:338)

12:12:12.701 [5004.7812] <16> tar_tfi::processException:

An Exception of type [SocketWriteException] has occured at:

Module: @(#) $Source: src/ncf/tfi/lib/TransporterRemote.cpp,v $ $Revision: 1.55 $ , Function: TransporterRemote::write[2](), Line: 338

Module: @(#) $Source: src/ncf/tfi/lib/Packer.cpp,v $ $Revision: 1.91 $ , Function: Packer::getBuffer(), Line: 652

Module: tar_tfi::getBuffer, Function: D:\NB\NB_7.6.0.1\src\cl\clientpc\util\tar_tfi.cpp, Line: 311

Local Address: [::]:0

Remote Address: [::]:0

OS Error: 10060 (A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

)

  • At first glance, it looks like a network problem between Master/Media and Client. I would suggest to check your network.

    start ping for client from master/media and vise versa and redirect output to a text file to capture the results and then check how many RTO you obsever over the night.

     

  • Hi  pats729 I have already checked that is working fine. Moreover backups are not failing regularly sometimes they are getting successful sometimes failing . both manual and scheduled

  • Probably you must do another high level test for Network with help of your Network Admin.

    Or Try rebooting the client once.