12-07-2010 06:20 PM
Hi Guys,
My netbackup image cleanup failed with status code 174. I have no idea why is this problem suddenly happened as it was okay previously. Anyone can help me on this?
I am running on netbackup 6.5 btw.
rgds,
Ronny
12-07-2010 08:56 PM
I've seen similar errors when disk images cannot be found.
See if there's anything helpful in admin log on master.
12-08-2010 11:57 PM
Is there any particular messages I need to look at? There's alot of messages inside the log and I dun understand some.
12-09-2010 12:02 AM
Locate the time in the log that corresponds with the time of the image cleanup failure.
Look for <8> (Warning), <16> (Error) and <32> (Severe error).
Once you have located an error, read the couple of lines above the error to see actions leading up to the error.
12-09-2010 12:46 AM
This is what I got matching the time and job ID:
07:01:12.652 [3151] <2> logparams: /usr/openv/netbackup/bin/admincmd/nbdelete -allvolumes -jobid 152591
Also I got this error in the log:
03:11:43.801 [18742] <8> bpstsinfo/DPSPROXY WARNING: error on sts getcred (3/220)
531 03:11:46.014 [18752] <16> emmlib_ListStorageUnit: (0) LIST_STORAGE_UNIT failed, emmError = 4005006, nbError = 0
532 03:11:46.014 [18752] <16> db_getSTUNITlist: (-) Translating EMM_ERROR_DBServerDown(4005006) to 220 in the NetBackup context
533 03:11:46.016 [18752] <16> bpstsinfo/update ERROR: unable to retrieve stu list from server (220)
1681 03:16:46.219 [19303] <16> emmlib_ListStorageUnit: (0) LIST_STORAGE_UNIT failed, emmError = 4005006, nbError = 0
1682 03:16:46.219 [19303] <16> db_getSTUNITlist: (-) Translating EMM_ERROR_DBServerDown(4005006) to 220 in the NetBackup context
1683 03:16:46.220 [19303] <16> bpstsinfo/update ERROR: unable to retrieve stu list from server (220)
2769 03:21:49.953 [19730] <16> nbdelete: RemoveAllVolumes: Activity monitor job id = 152577
2792 03:21:52.238 [19730] <16> RequestInitialResources: MultiResReq.cpp:2268 resource request failed [800]
2795 03:21:52.239 [19730] <16> nbdelete: RemoveFragments: Cannot obtain resources for this job : error [800]
3436 06:51:46.206 [2230] <16> emmlib_ListStorageUnit: (0) LIST_STORAGE_UNIT failed, emmError = 4005006, nbError = 0
3437 06:51:46.206 [2230] <16> db_getSTUNITlist: (-) Translating EMM_ERROR_DBServerDown(4005006) to 220 in the NetBackup context
3438 06:51:46.208 [2230] <16> bpstsinfo/update ERROR: unable to retrieve stu list from server (220)
4578 06:56:46.387 [2748] <16> emmlib_ListStorageUnit: (0) LIST_STORAGE_UNIT failed, emmError = 4005006, nbError = 0
4579 06:56:46.388 [2748] <16> db_getSTUNITlist: (-) Translating EMM_ERROR_DBServerDown(4005006) to 220 in the NetBackup context
4580 06:56:46.389 [2748] <16> bpstsinfo/update ERROR: unable to retrieve stu list from server (220)
5555 07:01:12.972 [3151] <16> nbdelete: RemoveAllVolumes: Activity monitor job id = 152591
5578 07:01:15.099 [3151] <16> RequestInitialResources: MultiResReq.cpp:2268 resource request failed [800]
5581 07:01:15.099 [3151] <16> nbdelete: RemoveFragments: Cannot obtain resources for this job : error [800]
Any idea?
12-09-2010 01:26 AM
531 03:11:46.014 [18752] <16> emmlib_ListStorageUnit: (0) LIST_STORAGE_UNIT failed, emmError = 4005006, nbError = 0
532 03:11:46.014 [18752] <16> db_getSTUNITlist: (-) Translating EMM_ERROR_DBServerDown(4005006) to 220 in the NetBackup context
Check daemons on your master?
12-12-2010 09:19 PM
I assume that Image Cleanup jobs runs with offline catalog backup simultaneously.Check job activity.
If you configured offline catalog backup, I recommend you to change catalog backup method to online catalog backup.