06-29-2012 02:06 AM
Hi We are experiencing a problem with VM backups, in policy Existing snapshot handling is NBU rmove, but on few machines it still not deleting the snapshot and it is increasing the number of snapshots and all the times we have to delete these snapshots manually and its degrading the performance of Vm also creating athreat of crash of VM, Please advice on it.
06-29-2012 02:08 AM
Query was originally tagged to a Blog.
Have move to new discussion for greater exposure - i.e. you're more likely to get a response here!
Probably a good idea to provide a bit more info regarding your environment.
07-03-2012 03:51 AM
We had discussed with symantec, The snapshots which are not getting deleted those have name consolidate_helper_number, but NBU creates snapshot with name NBU_Snapshot, so these are not our snapshots, also NBU deletes last NBU_Snapshot , if its there. in any case if there is another snapshot with diffrent name afte the NBU_Snapshot, then NBU wont be able to delete any of them and backups will fail. in that case we have to manually delete these snapshots.
Detail of environment:
Master : SUN Solaris 10, NBU 7.1.0.2
Media: SUn Solaris 9 , NBU 6.5.6
Backup Proxy Host : VM machine, NBU 7.1.0.4 (recommend by symantec), Windows 2003
07-03-2012 03:56 AM
Now we are facing issue of EC 156 on one of Vm machine, its backups are failing because NBU is not able to quiesce the VM machine, and hence backups are failing and when we are disabling this feature in policies then backups for this Vm are running fine, but this practice is not recommended by Symantec.
PFA bpfis logs and suggest.
07-03-2012 04:22 AM
Can you clarify:
Your Backup Proxy Host is running 7.1.0.4
Your Master is running prior release 7.1.0.2
If so install 7.1.0.4 on the master ASAP
Then rerun the backup.
Also consider the requirements for hotadd as your transfer method as you have chosen to use a VM as your backup proxy. Please make sure your setup complies with the section in the NetBackup for VMware Guide on Notes on hotadd transfer type.
VMware Transport Modes: Best practices and troubleshooting
http://www.symantec.com/docs/TECH183072
snippet>>>
Troubleshooting for some common transport mode related failures
Backups/Restores failing with status 6 or status 13 or status 11 with following indication in Activity monitor might indicate that there is some issue with transport modes:-
Here are some tips on handling this kind of error:
Also consult the hotadd document on VMware too for background.
http://www.vmware.com/support/developer/vddk/VDDK-1.2.1-Relnotes.html
07-03-2012 04:23 AM
7.1.0.4 version is suggested by Symantec, bcz with 7.1.0.2 we were facing issue in restores.
Also if this is the reason, then I am little bit confused , bcz backups of rest of VM's are running fine.
07-03-2012 04:34 AM
Seeing at BPFIS
Hope below technotes would be helpful to you
status 156 - snapshot failed - vixapi freeze failed with 36 - vc error: creating quiesced snapshot failed because snapshot operation exceeded timeout limit.
http://www.symantec.com/docs/TECH137001
Vmware backup fails 'unable to quiesce file system'
http://www.symantec.com/docs/TECH145960
156:Receive "snapshot creation failed, status 156" when attempting to backup a VM.
http://www.symantec.com/docs/TECH154889
Not sure if 3rd one would be case in yours. But hopes so workaround would help you
07-03-2012 04:37 AM
Also found one of error.Cannot create a quiesced snapshot because the snapshot operation exceeded the time limit for holding off I/O in the frozen virtual machine
Refer VMWare technote for this :
07-03-2012 04:38 AM
OK, just normal best practice you upgrade the Master first and then Media Server and then clients. Whatever the version + maintenance release. (I may stand incorrect here in this instance.)
07-03-2012 04:41 AM
Found this above KB also.
You did not say what these problem VM's are running. Could be a High Transactional application with a DBMS running in it - MS SQL/Oracle/Exchange?. Therefore high I/O load and cannot commit back.
07-03-2012 04:42 AM
NetBackup Master Server Must be at highest level of version within NetBackup Domain
NetBackup Media servers and Client must be same or lower version compared to NetBackup master server
This is technically no longer true.
If the master, media and client are all at 7.1.0.x, for example, it doesn't matter if the client's "x" is higher than the servers', so long as they're all at the same minor version (in this case, 7.1).
07-03-2012 05:23 AM
Also consider with hot-add:-
VM-based backups impact the host ESX server. For a severely overtaxed ESX environment, physical host-based backups should be considered as an alternative to VM-based backups.
So your ESX server that these specific VM's are on, may be being pushed hard resource and performance.
Recommend to use the vcenter performance charts to rule this out when backup(s) are running.
Try vmotioning the VM's to another 'beefier' ESX host, but keep in mind that the ESX host must still be able to access these datastores that the VM for backup is on.
07-03-2012 12:00 PM
NetBackup Master Server Must be at highest level of version within NetBackup Domain
NetBackup Media servers and Client must be same or lower version compared to NetBackup master server
This is technically no longer true.
If the master, media and client are all at 7.1.0.x, for example, it doesn't matter if the client's "x" is higher than the servers', so long as they're all at the same minor version (in this case, 7.1).
EDIT: Sorry, I hit Edit instead of Reply!
07-03-2012 01:53 PM
Chris. Thanks for clearing that up.
07-04-2012 09:19 AM
So If i Understood correctly then it means that If Backup Proxy host is @ NBU 7.1.0.4 and master @ NBU 7.1.0.2 , tgis hardly makes any problem ?
also regarding this issue:
>> IP of the problemetic VM is static
>> we cant use pre/post scripts to stop/freeze applications, as it will be creating a downtime for application, which is not possible and disable Quiescing for whole system is not recommeneded by symantec and if we disable quiescing for application then no use of backing up that VM, bcz recovery of application is not guranteed, which is required in this case.
>> we have tried restarting the VMware tool services, its of no use.
>> We have increased the value of snapshottimout on proxy host, that is of no use again
>> Reintsalled the Vmware tools and still same issue.
But here is one catch:
Backups are failing from last 1 week and there were no error related to VSS in event viewer till today, but today we found a error in event viewer regarding it, I am astonished if earlier there was everythng fine then why it was not working and now its not working , then how come it encountered an error with VSS writers.
also please let me know if there is any way to increase the time out value for deleting snapshot in NBU.