Reboot of MS Server 2008 R2 standard while the backup runs makes the backup job stall
Hi,
I run NetBackup 7.0.1, and the 7.0.1 backup client is installed on the servers.
I have some MS Server 2008 R2 Standard servers, and the daily backup runs at 22:00 hours.
When Windows Updates releases new updates, they are installed 03:00 and reboots the server afterwards.
When the server is online again, the bpbkar32.exe process runs on the server and use less than 10 % in CPU.
But in the NetBackup Adminsitration Console, the servers has the someting like the following properties:
Percent Complete: 1%
51298 minutes remaining.
Normally the server backup completes in less than 6 hours, but when the server reboots, it seems that it never completes.
If I terminate the bpbkar32.exe process, I get a file read failed (13):
15-08-2012 23:52:26 - started
16-08-2012 05:28:47 - requesting resource nbp_msdp1
16-08-2012 05:28:47 - requesting resource nbp.silk.local.NBU_CLIENT.MAXJOBS.al1.silk.dk
16-08-2012 05:28:47 - requesting resource nbp.silk.local.NBU_POLICY.MAXJOBS.Windows
16-08-2012 05:28:47 - granted resource nbp.silk.local.NBU_CLIENT.MAXJOBS. al1.silk.dk
16-08-2012 05:28:47 - granted resource nbp.silk.local.NBU_POLICY.MAXJOBS.Windows
16-08-2012 05:28:47 - granted resource MediaID=@aaaay;DiskVolume=PureDiskVolume;iskPool=nbp_ms1;Path=PureDiskVolume;StorageServer=nbp.silk.local;MediaServer=nbp.silk.local
16-08-2012 05:28:47 - granted resource nbp_msdp1
16-08-2012 05:28:47 - estimated 14309874 Kbytes needed
16-08-2012 05:28:48 - started process bpbrm (7428)
16-08-2012 05:28:53 - connecting
16-08-2012 05:28:56 - connected; connect time: 00:00:03
16-08-2012 05:29:05 - begin writing
16-08-2012 08:17:03 - Error bpbrm(pid=7428) socket read failed, An existing connection was forcibly closed by the remote host. (10054)
16-08-2012 08:17:03 - Error bptm(pid=9092) socket operation failed - 10054 (at child.c.1294)
16-08-2012 08:17:04 - Error bptm(pid=9092) unable to perform read from client socket, connection may have been broken
16-08-2012 08:17:10 - Info nbp.silk.local(pid=5056) StorageServer=PureDisk:nbp.silk.local; Report=PDDO Stats for (nbp.silk.local): scanned: 528399 KB, stream rate: 159.31 MB/sec, CR sent: 10469 KB, dedup: 98.0%, cache hits: 2209 (52.3%)
16-08-2012 08:17:11 - Error bpbrm(pid=7428) could not send server status message
16-08-2012 08:17:13 - end writing; write time: 02:48:08
file read failed(13)
It is true that it job cannot complete if the server is rebooted during the backup job on the server?
Kind regards,
Carl-Marius
Perfectly normal. Netbackup will appear to be 'hanging for the duration of 'Client Read Timeout'.
NetBackup is REPORTING the error here: Broken Connection.
unable to perform read from client socket, connection may have been broken
Depending on retry parameters in Master Server config, backup may be retried afterwards.
Best to disable automatic updates in a production environment.