10-19-2010 02:31 PM
Hi,
Please confirm whether my interpretation is wrong or if I misconfigured NetBackup or if NetBackup 6.5 2009.11.06 suffers a bug. I noticed something unexpected in the event log reported by /usr/openv/netbackup/bin/admincmd/bpdbjobs running on Solaris 10 10/09 s10x_u8wos_08a X86 (SunOS 5.10 Generic_141445-09 i86pc i386 i86pc) for a particular job (a failure, exit code 48). NetBackup attempted 15 times over the course of about one hour.
What bothers me is it looks like the event log message identifying the error that killed a particular attempt occurs as part of the messages for the previous attempt. It looks to me like NetBackup mis-aggregates the events into attempts. Do I interpret this wrong or is bpdbjobs's output wrong? See below one line of output split over ',' for easier reading, the 16-field head, and the tail after field 290. Please pay special attention to the time stamps.
Please advise,
Keith Cascio
keithcascio@master0:~$ sudo /usr/openv/netbackup/bin/admincmd/bpdbjobs -all_columns -jobid 140162 2>&1 | tr ',' '\n' | grep -v '^$' | gsed -nre '1,16p;17s/.*/\nSNIP!!!!\n/p;290,$p' 140162 0 3 48 happy-policy Daily_Full sad-client3.acme.com master0 1287425114 0000000005 1287425119 master0 14 100 9862 root SNIP!!!! 0 0 6287 master0 master0 1287424512 0000000007 1287424519 48 client hostname could not be found 12 10/18/10 19:55:12 - requesting resource master0 10/18/10 19:55:12 - requesting resource master0.NBU_CLIENT.MAXJOBS.sad-client3.acme.com 10/18/10 19:55:12 - requesting resource master0.NBU_POLICY.MAXJOBS.happy-policy 10/18/10 19:55:12 - granted resource master0.NBU_CLIENT.MAXJOBS.sad-client3.acme.com 10/18/10 19:55:12 - granted resource master0.NBU_POLICY.MAXJOBS.happy-policy 10/18/10 19:55:12 - granted resource G42339 10/18/10 19:55:12 - granted resource master0_hcart3_5 10/18/10 19:55:12 - granted resource master0 10/18/10 19:55:12 - estimated 315167798 kbytes needed 10/18/10 19:55:12 - started process bpbrm (6287) 10/18/10 19:55:13 - end writing 10/18/10 20:00:13 - Error bpbrm(pid=7853) bpcd on sad-client3.acme.com exited with status 48: client hostname could not be found 0 0 7853 master0 master0 1287424813 0000000006 1287424819 48 client hostname could not be found 12 10/18/10 20:00:13 - requesting resource master0 10/18/10 20:00:13 - requesting resource master0.NBU_CLIENT.MAXJOBS.sad-client3.acme.com 10/18/10 20:00:13 - requesting resource master0.NBU_POLICY.MAXJOBS.happy-policy 10/18/10 20:00:13 - granted resource master0.NBU_CLIENT.MAXJOBS.sad-client3.acme.com 10/18/10 20:00:13 - granted resource master0.NBU_POLICY.MAXJOBS.happy-policy 10/18/10 20:00:13 - granted resource G42339 10/18/10 20:00:13 - granted resource master0_hcart3_4 10/18/10 20:00:13 - granted resource master0 10/18/10 20:00:13 - estimated 315167798 kbytes needed 10/18/10 20:00:13 - started process bpbrm (7853) 10/18/10 20:00:14 - end writing 10/18/10 20:05:15 - Error bpbrm(pid=9862) bpcd on sad-client3.acme.com exited with status 48: client hostname could not be found 0 0 9862 master0 master0 1287425114 0000000005 1287425119 48 client hostname could not be found 11 10/18/10 20:05:14 - requesting resource master0 10/18/10 20:05:14 - requesting resource master0.NBU_CLIENT.MAXJOBS.sad-client3.acme.com 10/18/10 20:05:14 - requesting resource master0.NBU_POLICY.MAXJOBS.happy-policy 10/18/10 20:05:14 - granted resource master0.NBU_CLIENT.MAXJOBS.sad-client3.acme.com 10/18/10 20:05:14 - granted resource master0.NBU_POLICY.MAXJOBS.happy-policy 10/18/10 20:05:14 - granted resource G42339 10/18/10 20:05:14 - granted resource master0_hcart3_1 10/18/10 20:05:14 - granted resource master0 10/18/10 20:05:14 - estimated 315167798 kbytes needed 10/18/10 20:05:14 - started process bpbrm (9862) 10/18/10 20:05:15 - end writing 0 0 140162 0 1 0 0 sad-client3.acme.com_1287425114 0