04-20-2016 06:36 AM
I have this problem where my master server stops responding after about 5 days. please note that its the nbu that stops responding but there OS runs fine. restarting nbu services works but i dont want to keep on restarting services every week.
OS version: SuSE 12.
NBU Version 7.7.2
when i try to log on it only stops at the stage: Connecting Netbackup service layer. when this happens i get a lot of bpdbm processes (more than 20) running on the server. Normal operation its only 3 or 4 bpdbm processes.
i am running backups to quantum dxi using OST and replicating to DR on this environment.
what do i need to check when netbackup stops responding?
Solved! Go to Solution.
04-20-2016 06:00 PM
Hi Ngarart,
I would ask typical questions:
* Is this a new setup?
* If not, anything changed prior to it happening? e.g. upgraded from version x.x.x.x to 7.7.2, added Oracle policy etc.
* What's the output of /usr/openv/netbackup/bin/bpps -x? Any critical processes missing?
* Check what the bpdbm and nbsl logs say, they are in /usr/openv/netbackup/logs/bpdbm/ and /usr/openv/logs/nbsl/ respectively.
* Have you tuned the master server ? https://www.veritas.com/support/en_US/article.TECH167095 is a good start.
04-20-2016 06:00 PM
Hi Ngarart,
I would ask typical questions:
* Is this a new setup?
* If not, anything changed prior to it happening? e.g. upgraded from version x.x.x.x to 7.7.2, added Oracle policy etc.
* What's the output of /usr/openv/netbackup/bin/bpps -x? Any critical processes missing?
* Check what the bpdbm and nbsl logs say, they are in /usr/openv/netbackup/logs/bpdbm/ and /usr/openv/logs/nbsl/ respectively.
* Have you tuned the master server ? https://www.veritas.com/support/en_US/article.TECH167095 is a good start.
04-21-2016 03:31 AM
What does /var/log/messages say ?
Have you run the SORT collecttor to see if OS has the right kernel settings :
https://sort.veritas.com/netbackup
See the "Custom Reports Using Data Collectors".
04-22-2016 01:09 AM
Hi Wiriadi
This is a new setup less than two months old. Nothing changed.
I have updated /etc/security/limits.conf values as recommended.
I also did add fs.file-max = 2097152 in /etc/sysctl.conf. Without that i was getting Too many open files in system error.
Now I will have to wait for about 5 days to see if this is going to happen again.
04-26-2016 11:19 PM
Hi Ngarart,
That's a good start. let's see how it goes.
Cheers!