Master media communication interruption during duplication to tape
Hi,
I 'm having an issue in our netbackup environment when a duplication job to tape is running on the media server and a network interruption happens between the master and the media server. I will try to explain.
We have a master server (7.7.3) on windows 2008 R2 in our main datacenter and a media server (7.7.3) also on Windows 2008 R2 in our second domain center. There is a tape robot directly connected to the media server. Both centers are on the same company domain so there is 40GB lan connection between both centers.
I'm using an SLP job to duplicate images between both centers and put a final copy on tape. The SLP job has 4 stages. The first one is backup to Puredisk volume on the master server, then we duplicate this image to an other Puredisk volume on the media server in the second data center. Then a rehydratation job occurs to an advanced disk on this media server and finally this image is put on tape.
When a duplication job is running as final stage of a SLP (Copy to tape from advanced disk pool) on the media server and there is a network interruption of a few seconds between the master and the media server. The duplication job errors, the bptm processes keeps running, the tape gets stuck in the robot,it's sitting idle and does nothing anymore and the duplication job gets in a loop aksing for a new tape. When this happens, I have to manually put the tape back in his slot, stop the bptm and bpdm process of that job before any other duplication job can start.
My question, why is the job interrupted when this network hickup happens? The job is running on the media server but when the master looses it's "view" of the processes on the media server for a few seconds every duplication job to tape gets stuck. Is there a way to prevent this?
Regards,
Erwin