cancel
Showing results for 
Search instead for 
Did you mean: 

Re:Error code 1

sahil_shine
Level 4
Hi All ,

We are using netbackup 6.5 and solaris as 10 O.S .We are backing most databases and we are also backing the backups taken by Rman backups of oracle database.other than that few few database filesystem.But the thing is that i am getting error code -1 ....  for each of my parent job .But the child processes are completing successfully.Is there any issue is the backup successfull.
I am confused are my backups happening proper or not ? .If not how should i resolve these issue as for each and every job i am getting these  error code 1 .What are the thing i need to check.To over come these problem?

Thanks
--------
1 ACCEPTED SOLUTION

Accepted Solutions

CY
Level 6
Certified
OK, your bperror outputs show the errors are related to BMR (Bare Metal Restore) such as this one  "bpbrm BMRERR: failed to connect via vnetd to bmr master daemon: cannot connect on socket (25)"

Did you set up BMR server?

If not, no wonder all your backups ended with status 1.   Your real backup jobs were all successful, but the clients could not get BMR configuration saved.

You should not enable "Collect disaster recovery information for Bare Metal Restore" attribute in your backup policies.

View solution in original post

13 REPLIES 13

sahil_shine
Level 4
I am getting these things in my bppkar log file 00:25:57.418 [17387] <4> bpbkar: INF - setenv FINISHED=1
00:25:59.930 [17392] <4> bpbkar main: real locales <C>
00:25:59.930 [17392] <4> bpbkar main: standardized locales - lc_messages <C> lc_
ctype <C> lc_time <C> lc_collate <C> lc_numeric <C>
00:25:59.931 [17392] <2> logparams: bpbkar -IEL -nfsok -dt 0 -kl 0 -ru root -noc
ont -no_security /usr/openv/volmgr/database
00:25:59.936 [17392] <4> bpbkar: INF - setenv KEYWORD=NONE
00:25:59.936 [17392] <4> bpbkar: INF - setenv STREAM_PID=17392
00:25:59.936 [17392] <4> bpbkar: INF - setenv STREAM_NUMBER=0
00:25:59.936 [17392] <4> bpbkar: INF - setenv STREAM_COUNT=0
00:25:59.937 [17392] <4> bpbkar: INF - setenv STREAMS=0
00:25:59.937 [17392] <4> bpbkar: INF - setenv BPSTART_TIMEOUT=0
00:25:59.937 [17392] <4> bpbkar: INF - setenv BPEND_TIMEOUT=0
00:25:59.937 [17392] <4> bpbkar: INF - setenv RESTARTED=0
00:25:59.937 [17392] <4> bpbkar: INF - Estimate:-1 -1
00:25:59.938 [17392] <2> bpbkar add_to_filelist: starting sizeof(filelistrec) <6
8>
00:25:59.938 [17392] <4> bpbkar: INF - Processing /usr/openv/volmgr/database
00:25:59.957 [17392] <4> bpbkar: INF - Client completed sending data for backup

00:25:59.958 [17392] <4> bpbkar: INF - bpbkar exit normal
00:25:59.958 [17392] <4> bpbkar: INF - EXIT STATUS 0: the requested operation wa
s successfully completed
00:25:59.958 [17392] <4> bpbkar: INF - setenv FINISHED=1
00:26:03.756 [17397] <4> bpbkar main: real locales <C>
00:26:03.756 [17397] <4> bpbkar main: standardized locales - lc_messages <C> lc_
ctype <C> lc_time <C> lc_collate <C> lc_numeric <C>
00:26:03.756 [17397] <2> logparams: bpbkar -IEL -nfsok -dt 0 -kl 0 -ru root -noc
ont -no_security /usr/openv/var/global
00:26:03.762 [17397] <4> bpbkar: INF - setenv KEYWORD=NONE
00:26:03.762 [17397] <4> bpbkar: INF - setenv STREAM_PID=17397
00:26:03.762 [17397] <4> bpbkar: INF - setenv STREAM_NUMBER=0
00:26:03.762 [17397] <4> bpbkar: INF - setenv STREAM_COUNT=0
00:26:03.762 [17397] <4> bpbkar: INF - setenv STREAMS=0
00:26:03.762 [17397] <4> bpbkar: INF - setenv BPSTART_TIMEOUT=0
00:26:03.762 [17397] <4> bpbkar: INF - setenv BPEND_TIMEOUT=0
00:26:03.762 [17397] <4> bpbkar: INF - setenv RESTARTED=0
00:26:03.762 [17397] <4> bpbkar: INF - Estimate:-1 -1
00:26:03.763 [17397] <2> bpbkar add_to_filelist: starting sizeof(filelistrec) <6
8>
00:26:03.763 [17397] <4> bpbkar: INF - Processing /usr/openv/var/global
00:26:03.793 [17397] <4> bpbkar: INF - Client completed sending data for backup

00:26:03.797 [17397] <4> bpbkar: INF - bpbkar exit normal
00:26:03.797 [17397] <4> bpbkar: INF - EXIT STATUS 0: the requested operation wa
s successfully completed
00:26:03.797 [17397] <4> bpbkar: INF - setenv FINISHED=1
00:26:07.532 [17406] <4> bpbkar main: real locales <C>
00:26:07.532 [17406] <4> bpbkar main: standardized locales - lc_messages <C> lc_
ctype <C> lc_time <C> lc_collate <C> lc_numeric <C>
00:26:07.533 [17406] <2> logparams: bpbkar -IEL -nfsok -dt 0 -kl 0 -ru root -noc
ont -no_security /usr/openv/var/auth
00:26:07.538 [17406] <4> bpbkar: INF - setenv KEYWORD=NONE
00:26:07.539 [17406] <4> bpbkar: INF - setenv STREAM_PID=17406
00:26:07.539 [17406] <4> bpbkar: INF - setenv STREAM_NUMBER=0
00:26:07.539 [17406] <4> bpbkar: INF - setenv STREAM_COUNT=0
00:26:07.539 [17406] <4> bpbkar: INF - setenv STREAMS=0
00:26:07.539 [17406] <4> bpbkar: INF - setenv BPSTART_TIMEOUT=0
00:26:07.539 [17406] <4> bpbkar: INF - setenv BPEND_TIMEOUT=0
00:26:07.539 [17406] <4> bpbkar: INF - setenv RESTARTED=0
00:26:07.539 [17406] <4> bpbkar: INF - Estimate:-1 -1
00:26:07.540 [17406] <2> bpbkar add_to_filelist: starting sizeof(filelistrec) <6
8>
00:26:07.540 [17406] <4> bpbkar: INF - Processing /usr/openv/var/auth
00:26:07.570 [17406] <4> bpbkar: INF - Client completed sending data for backup

00:26:07.570 [17406] <4> bpbkar: INF - bpbkar exit normal
00:26:07.570 [17406] <4> bpbkar: INF - EXIT STATUS 0: the requested operation wa
s successfully completed
00:26:07.570 [17406] <4> bpbkar: INF - setenv FINISHED=1
00:26:11.207 [17414] <4> bpbkar main: real locales <C>
00:26:11.207 [17414] <4> bpbkar main: standardized locales - lc_messages <C> lc_
ctype <C> lc_time <C> lc_collate <C> lc_numeric <C>
00:26:11.207 [17414] <2> logparams: bpbkar -IEL -nfsok -dt 0 -kl 0 -ru root -noc
ont -no_security /usr/openv/var/vxss


Please advice or let me know which log out put should i post ?

Thanks
--------

sahil_shine
Level 4
00:18:07.685 [16987] <2> bpcd peer_hostname: Connection from host pbh04backupsrv-B (192.168.252.143) port 62747
00:18:07.685 [16987] <2> bpcd valid_server: comparing pbh04backupsrv-B and pbh04backupsrv-B
00:18:07.686 [16987] <4> bpcd valid_server: hostname comparison succeeded
00:18:07.686 [16987] <2> bpcd main: output socket port number = 1
00:18:07.701 [16987] <2> bpcd main: Duplicated vnetd socket on stderr
00:18:07.702 [16987] <2> bpcd main: <---- NetBackup 6.5 0 ------------initiated
00:18:07.702 [16987] <2> bpcd main: VERBOSE = 0
00:18:07.702 [16987] <2> bpcd main: Not using VxSS authentication with pbh04backupsrv-B
00:18:07.712 [16987] <2> bpcd main: BPCD_GET_PROCESS_STATUS_RQST
00:18:07.712 [16987] <2> find_processes: total malloc-ed = 4096
00:18:07.742 [16987] <2> is_it_a_keeper: realloc UP from 4191 to 8287
00:18:07.764 [16987] <2> get_unxwre_processes: 173 files considered, 171 files opened/read
00:18:07.764 [16987] <2> find_processes: progs_list_size = 8287 == total calculated size = 8287 == (total_space_used = 8161 + whats_left = 126)
00:18:07.764 [16987] <2> find_processes: realloc down to 8161
00:18:07.764 [16987] <16> bpcd main: strlen(pProcList) = 8160
00:18:07.764 [16987] <16> bpcd main: char_count = 8161, .line_count = 67
00:18:07.766 [16987] <2> bpcd main: BPCD_DISCONNECT_RQST
00:18:07.766 [16987] <2> bpcd exit_bpcd: exit status 0 ----------->exiting
00:19:07.966 [17021] <2> bpcd main: offset to GMT -19800
00:19:07.966 [17021] <2> logconnections: BPCD ACCEPT FROM 192.168.252.143.62768 TO 192.168.252.143.13724
00:19:07.967 [17021] <2> bpcd main: setup_sockopts complete
00:19:07.970 [17021] <2> bpcd peer_hostname: Connection from host pbh04backupsrv-B (192.168.252.143) port 62768
00:19:07.970 [17021] <2> bpcd valid_server: comparing pbh04backupsrv-B and pbh04backupsrv-B
00:19:07.971 [17021] <4> bpcd valid_server: hostname comparison succeeded
00:19:07.971 [17021] <2> bpcd main: output socket port number = 1
00:19:07.989 [17021] <2> bpcd main: Duplicated vnetd socket on stderr
00:19:07.989 [17021] <2> bpcd main: <---- NetBackup 6.5 0 ------------initiated
00:19:07.989 [17021] <2> bpcd main: VERBOSE = 0
00:19:07.989 [17021] <2> bpcd main: Not using VxSS authentication with pbh04backupsrv-B
00:19:07.990 [17021] <2> bpcd main: BPCD_GET_PROCESS_STATUS_RQST
00:19:07.990 [17021] <2> find_processes: total malloc-ed = 4096
00:19:08.020 [17021] <2> is_it_a_keeper: realloc UP from 4191 to 8287
00:19:08.039 [17021] <8> get_unxwre_processes: ignoring open error of file /proc/17022/psinfo, (2):No such file or directory
00:19:08.043 [17021] <2> get_unxwre_processes: 174 files considered, 171 files opened/read
00:19:08.043 [17021] <2> find_processes: progs_list_size = 8287 == total calculated size = 8287 == (total_space_used = 8161 + whats_left = 126)
00:19:08.043 [17021] <2> find_processes: realloc down to 8161
00:19:08.055 [17021] <16> bpcd main: strlen(pProcList) = 8160


Anything to do with cannot open /proc file ??? I am too confused where to look for these matter my every job is getting error code 1 ???

CY
Level 6
Certified
/proc is a memory image of each process. It does not contain real files on disk anywhere.  It is used for programs such as ps and all the tools in /usr/proc/bin that can be used to examine process state. 

So it is very safe to exclude /proc to be back up.  Just add /proc on your NBU client's exclude list file:

/usr/openv/netbackup/exclude_list

sahil_shine
Level 4



00:19:08.039 [17021] <8> get_unxwre_processes: ignoring open error of file /proc/17022/psinfo, (2):No such file or directory


So is these the real problem ???That's why my backups are showing error code 1?? But but veritas manual shows these files are exlude automatically during backups ?? Please advice ???

J_H_Is_gone
Level 6
Cy is correct you can see the issue in your log

Verify the exclude list on your client.
if it is not there add it to make sure then see if the next backup of the server gets a 1 or a 0.

sahil_shine
Level 4
Hi All,

Still i am getting Error code 1. Please let me know wht to do ???????????????????

sahil_shine
Level 4
I have done the same but no result'ss ???

Yasuhisa_Ishika
Level 6
Partner Accredited Certified
What type is this policy?

If this policys is Oracle type, you should check your RMAN output and script. Probably you can find some of RMAN-/ORA- error. Enabling bphdb debug log on client host is helpful.

If this policys is Standard type, check NetBackup logs under /usr/openv/netbackup/db/error (or bperror command) first to find ERR, TRV token.
Error is occured in parent job really, bpbrm debug log or nbpem unified log will helps.

Cheers!

sahil_shine
Level 4

Hi ,

These policy is standard type .. actually we are backing  few database files  and our DBA team is taking backups of database through RMAN on a filesystem and we are backing up that file system.And one more thing when i check the policies -option when configuring policy type i can't see oracle policy option ..I think we dont have licencse ?? IS these whats causing the  problem ??? I am new to these organization and guy before me have configured all the policies and left away :(

CY
Level 6
Certified
OK, here are couple things:

1. Yes, apparently you do not have the "NetBackup for Oracle option" licensed, so you cannot find the "Oracle" policy type available.

2. That is fine the way you back up the Oracle DB.  Basically your DBA does the RMAN backup to dump DB into a filesystem.  Then NetBackup just back up the files in that file system as flat files - since NetBackup is not back up the live DB, you don't need the Oracle policy type.  "Standard" policy type is perfect for this situation.

3. I'd like to know if you specify the "backup selection list" with the file system(s) you want to back up, or you just use "ALL_LOCAL_DRIVES" as the selection?

4. Please run the NetBackup built-in error report for this client, and post it here - that way we can see where the error is from.

sahil_shine
Level 4
root@pbh04backupsrv # bperror -client pbh05asapdb-B
1241528403 1 4 4 pbh04backupsrv-B 1750 1750 0 pbh05asapdb-B nbjm started backup job for client pbh05asapdb-B, policy rmanbkp-asapdb, schedule Asapdb_bkp on storage unit pbh04backupsrv-B-hcart-robot-tld-0
1241528407 1 4 16 pbh04backupsrv-B 1750 1750 0 pbh05asapdb-B bpbrm BMRERR: failed to connect via vnetd to bmr master daemon: cannot connect on socket (25)
1241528407 1 4 4 pbh04backupsrv-B 1752 1750 0 pbh05asapdb-B nbjm started backup job for client pbh05asapdb-B, policy rmanbkp-asapdb, schedule Asapdb_bkp on storage unit pbh04backupsrv-B-hcart-robot-tld-0
1241528481 1 388 4 pbh04backupsrv-B 1752 1750 0 pbh05asapdb-B bptm begin writing backup id pbh05asapdb-B_1241528407, copy 1, fragment 1, to media id TIK016 on drive HP.ULTRIUM4-SCSI.002 (index 2)
1241528491 1 4 4 pbh04backupsrv-B 1752 1750 0 pbh05asapdb-B bptm successfully wrote backup id pbh05asapdb-B_1241528407, copy 1, fragment 1, 441696 Kbytes at 58033.898 Kbytes/sec
1241528507 1 68 4 pbh04backupsrv-B 1752 1750 0 pbh05asapdb-B nbpem CLIENT pbh05asapdb-B POLICY rmanbkp-asapdb SCHED Asapdb_bkp EXIT STATUS 0 (the requested operation was successfully completed)
1241528507 1 68 4 pbh04backupsrv-B 1750 1750 0 pbh05asapdb-B nbpem CLIENT pbh05asapdb-B POLICY rmanbkp-asapdb SCHED Asapdb_bkp EXIT STATUS 1 (the requested operation was partially successful)
1241528507 1 4 16 pbh04backupsrv-B 1750 1750 0 pbh05asapdb-B nbpem backup of client pbh05asapdb-B exited with status 1 (the requested operation was partially successful)


root@pbh04backupsrv # bperror -client pbh11aiadb01-B
1241535603 1 4 4 pbh04backupsrv-B 1756 1756 0 pbh11aiadb01-B nbjm started backup job for client pbh11aiadb01-B, policy rmanbkp_aiadb, schedule Rman-aiadb-diff-bkp on storage unit pbh04backupsrv-B-hcart-robot-tld-0
1241535607 1 4 16 pbh04backupsrv-B 1756 1756 0 pbh11aiadb01-B bpbrm BMRERR: failed to connect via vnetd to bmr master daemon: cannot connect on socket (25)
1241535607 1 4 4 pbh04backupsrv-B 1758 1756 0 pbh11aiadb01-B nbjm started backup job for client pbh11aiadb01-B, policy rmanbkp_aiadb, schedule Rman-aiadb-diff-bkp on storage unit pbh04backupsrv-B-hcart-robot-tld-0
1241535768 1 388 4 pbh04backupsrv-B 1758 1756 0 pbh11aiadb01-B bptm begin writing backup id pbh11aiadb01-B_1241535607, copy 1, fragment 1, to media id TIK017 on drive HP.ULTRIUM4-SCSI.001 (index 1)
1241536417 1 4 4 pbh04backupsrv-B 1758 1756 0 pbh11aiadb01-B bptm successfully wrote backup id pbh11aiadb01-B_1241535607, copy 1, fragment 1, 35353696 Kbytes at 54809.298 Kbytes/sec
1241536433 1 68 4 pbh04backupsrv-B 1758 1756 0 pbh11aiadb01-B nbpem CLIENT pbh11aiadb01-B POLICY rmanbkp_aiadb SCHED Rman-aiadb-diff-bkp EXIT STATUS 0 (the requested operation was successfully completed)
1241536433 1 68 4 pbh04backupsrv-B 1756 1756 0 pbh11aiadb01-B nbpem CLIENT pbh11aiadb01-B POLICY rmanbkp_aiadb SCHED Rman-aiadb-diff-bkp EXIT STATUS 1 (the requested operation was partially successful)
1241536433 1 4 16 pbh04backupsrv-B 1756 1756 0 pbh11aiadb01-B nbpem backup of client pbh11aiadb01-B exited with status 1 (the requested operation was partially successful)



Actually i get Error code 1 for my each and every backup ?And when i check logs its /proc/112/psinfo directory missing .Everywhere i see psinfo missing.I am not taking full  O.S backup  and also i have putted in  exclude_list.Still the same problem  :(

CY
Level 6
Certified
OK, your bperror outputs show the errors are related to BMR (Bare Metal Restore) such as this one  "bpbrm BMRERR: failed to connect via vnetd to bmr master daemon: cannot connect on socket (25)"

Did you set up BMR server?

If not, no wonder all your backups ended with status 1.   Your real backup jobs were all successful, but the clients could not get BMR configuration saved.

You should not enable "Collect disaster recovery information for Bare Metal Restore" attribute in your backup policies.

sahil_shine
Level 4
Yes.. the error's are due to BMR...I unchecked one of the client and checked the status now the status is 0.

Haa..haaa I was  missing that error...

Thanks a lot .. CY ....