NetBackup for VMware error 13
Hi All,
Our master server running on RHel 6.1 NB 7.7.1 and Media Server on Windows 2008 NB 7.7.1. VMware is 5.5 (The client is a RHel5.6) We have couple hunders of VMware client that were already running fine. about more that a week ago some the the client begin to failed (but other on the same folder still running fined BTW the percentage which failed are around only bellow 1% but still has to be resolved)
During the error 13 failure some file already successfully (?) backup by seeing the log:
02/15/2016 11:36:05 - Info bpbkar32 (pid=8236) 285000 entries sent to bpdbm
02/15/2016 11:36:11 - Info bpbkar32 (pid=8236) 301604 entries sent to bpdbm
02/15/2016 11:36:11 - Info bpbkar32 (pid=8236) 301605 entries sent to bpdbm
02/15/2016 11:36:15 - Error bpbrm (pid=8132) socket read failed, An existing connection was forcibly closed by the remote host. (10054)
02/15/2016 11:36:23 - Error bpbrm (pid=8132) could not send server status message
02/15/2016 11:36:25 - Info bpbkar32 (pid=0) done. status: 13: file read failed
02/15/2016 11:36:25 - end writing; write time: 0:01:44
file read failed (13)
(it is failed after "Current File Written" 148817 funny thing after I consolidate the snapshot and rerun again, it exacly failed at 148817 again even I tried the 3rd times, I don't know if it meant somthing)
I check the logs /var/log/hostd.log on esx server
There is some error:
2016-02-15T03:34:12.993Z [3B580B70 verbose 'Vmsvc.vm:/vmfs/volumes/561361d9-cf3232d4-10da-0017a4770404/vascasms01s/vascasms01s.vmx' opID=4978794e-b6 user=vpxuser] Create Snapshot: NBU_SNAPSHOT ebs5-bck 1455507248, memory=false, quiescent=true state=4
2016-02-15T03:34:12.993Z [3B580B70 info 'Vmsvc.vm:/vmfs/volumes/561361d9-cf3232d4-10da-0017a4770404/vascasms01s/vascasms01s.vmx' opID=4978794e-b6 user=vpxuser] State Transition (VM_STATE_ON -> VM_STATE_CREATE_SNAPSHOT)
2016-02-15T03:34:15.070Z [3B580B70 info 'Vimsvc.ha-eventmgr'] Event 73292 : The dvPort 68 link was down in the vSphere Distributed Switch in ha-datacenter
2016-02-15T03:34:15.071Z [3B580B70 info 'Vimsvc.ha-eventmgr'] Event 73293 : The dvPort 68 was not in passthrough mode in the vSphere Distributed Switch in ha-datacenter.
2016-02-15T03:34:15.507Z [39DC2B70 info 'Vimsvc.ha-eventmgr'] Event 73294 : The dvPort 68 was not in passthrough mode in the vSphere Distributed Switch in ha-datacenter.
2016-02-15T03:34:15.508Z [39DC2B70 info 'Vimsvc.ha-eventmgr'] Event 73295 : The dvPort 68 was unblocked in the vSphere Distributed Switch in ha-datacenter.
2016-02-15T03:34:15.509Z [39DC2B70 info 'Vimsvc.ha-eventmgr'] Event 73296 : The dvPort 68 was not in passthrough mode in the vSphere Distributed Switch in ha-datacenter.
2016-02-15T03:34:15.510Z [39DC2B70 info 'Vimsvc.ha-eventmgr'] Event 73297 : The dvPort 68 link was up in the vSphere Distributed Switch in ha-datacenter
2016-02-15T03:34:15.512Z [39DC2B70 info 'Vimsvc.ha-eventmgr'] Event 73298 : The dvPort 68 was not in passthrough mode in the vSphere Distributed Switch in ha-datacenter.
2016-02-15T03:34:16.148Z [FFCEEB70 info 'Hostsvc' opID=hostd-ab43] VsanSystemVmkProvider : GetConfig: Start
2016-02-15T03:34:16.148Z [FFCEEB70 info 'Hostsvc' opID=hostd-ab43] VsanSystemVmkProvider : GetConfig: Complete
Does it mean something?
Anyone know how to solve this?
Regards,
Iwan
@Iwan - it is highly likely that the Winsock errors are an "artefact" of the whatever the real problem is. When other processes related to backups die unexpectadly then this can cause surviving processes to report Winsock errors because the 'far side' of the TCP protocol converstation has died unexpectedly because the process no longer exists. IMO, personally, I wouldn't look for TCP issues right now.
So, Iwan, there are plenty of topics in this forum regarding how to capture the logs.