NB_dbsrv consuming 100% of CPU

Question

Hi all,
&nbsp;
I'm experiencying the same issue as&nbsp;https://www-secure.symantec.com/connect/forums/heavy-load-emm-db-high-cpu-utilization... with NBU 7.1.0.4 on hpux master server and having a lot of jobs starting but going to queue state showing "Waiting in NetBackup scheduler work queue on server ..." I noted that NB_dbsrv is consuming 100% of CPU... I do not have a big EMM_DATA.db file as you can see below:
ebrbsnp05 &gt;&gt; find /nbdb_catalog -name EMM_DATA.db
	/nbdb_catalog/data/EMM_DATA.db
	ebrbsnp05 &gt;&gt; ls -l /nbdb_catalog/data/EMM_DATA.db
	-rw-------&nbsp;&nbsp; 1 root&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; sys&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 51531776 Aug 15 12:26 /nbdb_catalog/data/EMM_DATA.db
	ebrbsnp05 &gt;&gt; du -k /nbdb_catalog/data/EMM_DATA.db
	50324&nbsp;&nbsp; /nbdb_catalog/data/EMM_DATA.db
	ebrbsnp05 &gt;&gt;
But I'm having 100 active jobs and 140 queued... all of the queued jobs are "Waiting in NetBackup scheduler work queue on server &lt;master server name&gt;"
I have opened a case with Symantec but it looks like they have not a clear understanding about the issue.
Have you finally got a solution for this case? How does the nbdb rebuild/reorganize that mph999 suggested above work? Did this help? Have you tried this?
Let me mention that I tried to perform this rebuild/reorganize yesterday, but I got an error while trying to take a backup of the nbdb vefore to do the first rebuild (I tried to take a backup of the nbdb just to be safe)... see below what I got...
I'm receiving the following error Segmentation fault (core dumped) when trying to perform a backup of the NDBD... could you please give us a hand?
Check below please…
ebrbsnp05 &gt;&gt; /usr/openv/netbackup/bin/nbdbms_start_stop start
ebrbsnp05 &gt;&gt; ../bpps -x
NB Processes
------------
&nbsp;&nbsp;&nbsp; root 11363&nbsp;&nbsp;&nbsp;&nbsp; 1&nbsp; 0 11:22:32 ?&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 0:00 /usr/openv/db//bin/NB_dbsrv @/usr/openv/var/global/server.conf @/usr/openv/var/global/databases.conf -hn 7
&nbsp;
&nbsp;
MM Processes
------------
&nbsp;
&nbsp;
Shared Symantec Processes
-------------------------
&nbsp;&nbsp;&nbsp; root 11324&nbsp;&nbsp;&nbsp;&nbsp; 1&nbsp; 0 11:22:27 ?&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 0:00 /opt/VRTSpbx/bin/pbx_exchange
ebrbsnp05 &gt;&gt; /usr/openv/db/bin/nbdb_ping
Database [NBDB] is alive and well on server [NB_ebrvmsnp01].
ebrbsnp05 &gt;&gt; mkdir /nbdb_catalog/backup
ebrbsnp05 &gt;&gt; /usr/openv/db/bin/nbdb_backup -dbn NBDB -online /nbdb_catalog/backup/backup1
Segmentation fault (core dumped)
ebrbsnp05 &gt;&gt;
&nbsp;
Symantec is looking into some logs now... could anybody please help? Thanks in advance...
Let me add that I have also rebooted the master server yesterday, but the issue is still there... the reboot did not help. :(
Seba.

jaime_vazquez · Accepted Answer

The crash of the nbdb_backup is a bit disturbing.&nbsp; A segmentation violation is due to some sort of memory allocation/deallocation or reference error.&nbsp; Check the system error logs to see what it is reporting for that.
The NB_dbsrv process performs SQL actions against all of the NBU databases (NBDB, EMMDB, BMRDB). If it is consuming 100% of the CPU then a query action is looping rather badly.&nbsp; This can cause lock-out conditions against other database connections.&nbsp; This may be a result of some data corruption internal to the DB&nbsp;&nbsp; Do a check of the server.log file to see if it displays any sort of DB access errors or problems. Always a good place to start.&nbsp; The NB_dvsrv process logs it's activity to the file noted in its 'server.conf' file.&nbsp; The location is specified by the "-o " option. Default location is "/usr/openv/db/log/server.log&nbsp;&nbsp;".&nbsp;
To make a copy of the DB files without the use of the nbdb_backup command:
1. Stop all NBU services:&nbsp; ../netbackup/bin/rc.kill_all
This will quiesce the databases and all connections to them.
2. Manually copy the contents of the directory to another location.&nbsp; Based on your initial input:
cd /nbdb_catalog/data
	cp * /nbdb_catalog/backup/backup1
Make any changes you want to the "server.conf" &nbsp;file at this time as well. If something goes awry you can revert back by stopping all sercies and copying back the files to their original location.
3. Restart services:&nbsp; ./netbackup/bin/rc.start_all
4. Perform the nbdb rebuild/reorganize noted previously.
5. The NB_dbsrv is multi-threaded and by default can have up to 20 open connections.&nbsp; The "-gn ##" modifies this.&nbsp; Larger values will help under specific loads.&nbsp; However, stay below 40.
6. The "-ch ###" sets the high water mark of cached shared memory for the process. I consider the value "-ch 3G" to be a bit too aggressive as it can tie up overall system resources.&nbsp;The information submitted does not indicate memory resources of the server. &nbsp;I try to limit this to "-ch 1G" under most circumstances.&nbsp; To see if NB_dbsrv is actively allocating more shared memory, look in the '/usr/openv/db/log/server.log' file.
Let's see how things go after making&nbsp;the changes.
&nbsp;

anonymous · Answer

@Seba
Before a year we dump an HPUX master server to a AIX system because of memory, CPU and many CORBA problems.
I suggest you to do the same. Go to an&nbsp;other platform, Linux is a good choice.&nbsp;

sebaquadri · Answer

Changing from HPUX to AIX or Linux is not an option for me as I work on HP :)
......
@mph999 I've already modifed some files, per symantec advise, and the issue has gone for almost 3 weeks... but last monday the issue was back and today I have 300 queued jobs and 150 active... I'm already using USE_HASH=1 as you mentioned...
The changes I did 3 weeks ago, when the issue was "fixed" (for 3 weeks) are these:
============begin===============
Install the following &nbsp;EEB and add USE_HASH=1 &nbsp; in the &nbsp;/usr/openv/var/global/emm.conf
=====================================
&nbsp;
&nbsp;NetBackup_7.1.0.4 &nbsp;2762882
&nbsp;
Problem Description
Due to the way Sybase processes the query involving backup-id field of EMM_ImageCopy and
EMM_Image tables, the query takes a long time to execute.
&nbsp;
Installed Files
&nbsp; &nbsp;/usr/openv/netbackup/bin/nbemm
&nbsp;
DOWNLOAD LINK:&nbsp;
ftp://iosupport:M3Q9r*SI0di7@ftp.entsupport.symantec.com/pub/support/outgoing/04757956/eebinstaller.2762882.1.hpia64
&nbsp;
=================================================================================
=================================================================================
&nbsp;
(1) &nbsp;NBU Config Tuning
=================================================================================
=================================================================================
&nbsp;
&nbsp;
1.A.)
Reduce Master / Media Server socket usage
&nbsp;
Move NBU internal VNETD socket connections on master servers to server loopback interface instead of using VNETD daemon &nbsp;--Add the following line to &nbsp;/usr/openv/netbackup/bp.conf
&nbsp; &nbsp; &nbsp; &nbsp;CONNECT_OPTIONS = localhost 1 0 2
&nbsp;
No restart needed
&nbsp;
=================================================================================
&nbsp;
1.B.)
Master Server
Add more connections to the EMM database if the environment has 10+ Media Servers &nbsp;and additional remote admin consoles / other increased backup activity.
&nbsp;
STOP NBU on the Master
&nbsp;
CREATE file--
&nbsp; &nbsp; &nbsp; UNIX: /usr/openv/var/global/emm.conf
&nbsp;&nbsp;
&nbsp; Add contents--
NUM_DB_BROWSE_CONNECTIONS=20
NUM_DB_CONNECTIONS=21
NUM_ORB_THREADS=35
USE_HASH=1 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; (This line Not needed for 7.5, 7.1 needs EEB's)
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;
&nbsp;
REFERENCE: &nbsp; &nbsp;http://www.symantec.com/docs/TECH57277
----------------------------------------------------------------------------------------------
&nbsp;
Add more memory and threads to EMM DB due to large environment &nbsp; (35 media servers).
&nbsp;
&nbsp;
To change , edit file:
&nbsp; &nbsp; &nbsp; &nbsp;Unix: /usr/openv/var/global/server.conf&nbsp;
&nbsp; &nbsp; &nbsp; &nbsp;Make a backup copy of the file before modifying
&nbsp; &nbsp; &nbsp; &nbsp;
&nbsp;
&nbsp;
CURRENT file settings
--------------------------------
# cat -s /usr/openv/var/global/server.conf &nbsp;-n NB_ebrvmsnp01
&nbsp; &nbsp;-x tcpip(LocalOnly=YES;ServerPort=13785) &nbsp;-gp 4096 -gd DBA -gk DBA -gl DBA -ti 0 -c 25M -ch 500M -cl 25M -zl -os 1M &nbsp;-o /usr/openv/db//log/server.log &nbsp;-ud&nbsp;
&nbsp;
&nbsp;
MAKE CHANGES
---------------------------------
Update the following parameters in the file
&nbsp;
&nbsp; &nbsp;-ch 500M &nbsp; &nbsp; &nbsp;to new value &nbsp; &nbsp;-ch 3G
&nbsp; &nbsp;-gn 32 &nbsp; &nbsp; &nbsp; &nbsp; ( Add Missing parameter - &nbsp; Add '-gn 32' &nbsp;after &nbsp;'-gl DB ' &nbsp;Afor more DB threads
&nbsp; &nbsp;-m &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ( Add Missing parameter - &nbsp; &nbsp; &nbsp;Add &nbsp; &nbsp;' -m ' &nbsp; &nbsp; &nbsp; after ' -ud ' )&nbsp;
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; -m Helps to trim the NBDB.log file, this will be the default in NBU 7.5&nbsp;
&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;
&nbsp;
AFTER CHANGES
---------------------------------
# cat -s /usr/openv/var/global/server.conf &nbsp;-n NB_ebrvmsnp01
&nbsp; &nbsp;-x tcpip(LocalOnly=YES;ServerPort=13785) &nbsp;-gp 4096 -gd DBA -gk DBA -gl DBA -gn 32 -ti 0 -c 25M -ch 3G -cl 25M -zl -os 1M &nbsp;-o /usr/openv/db//log/server.log &nbsp;-ud -m
&nbsp;
&nbsp;
Restart services on the master&nbsp;
&nbsp;
&nbsp;
REFERENCE:
http://www.symantec.com/docs/HOTO67149
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;
=================================================================================
&nbsp;
1.C.)
&nbsp;
&nbsp;
Verify minimum MASTER NBRB tuning file is in place
&nbsp;
CREATE file if missing--
&nbsp; &nbsp; &nbsp; UNIX: /usr/openv/var/global/nbrb.conf
&nbsp; &nbsp; &nbsp; Windows: &lt;install_path&gt;\Veritas\NetBackup\var\global
brb.conf
&nbsp;
Add contents--
SECONDS_FOR_EVAL_LOOP_RELEASE = 180
RESPECT_REQUEST_PRIORITY = 0
DO_INTERMITTENT_UNLOADS = 1
&nbsp;
&nbsp;
If file exists, do not adjust values to the ones noted above. As they may have been customized for your environment
&nbsp;REFERENCE: &nbsp; &nbsp;http://www.symantec.com/docs/TECH57942&nbsp;
&nbsp;
&nbsp; &nbsp;
&nbsp; &nbsp;
=================================================================================
=================================================================================
&nbsp;OS &nbsp;TUNING
=================================================================================
=================================================================================
&nbsp;
2)
OS TUNING FOR NETBACKUP RESOURCES &nbsp;- &nbsp;NBU 6.X &nbsp;/ &nbsp;7.X
-------------------------------------
&nbsp;
&nbsp;
2.A)
&nbsp;
Default SYSTEM File Descriptors too low for NetBackup Master
This is a very critical setting for all masters on all OS's
&nbsp;
&nbsp;
# /usr/bin/ulimit -a
time(seconds) &nbsp; &nbsp; &nbsp; &nbsp;unlimited
file(blocks) &nbsp; &nbsp; &nbsp; &nbsp; unlimited
data(kbytes) &nbsp; &nbsp; &nbsp; &nbsp; 1048576
stack(kbytes) &nbsp; &nbsp; &nbsp; &nbsp;8192
memory(kbytes) &nbsp; &nbsp; &nbsp; unlimited
coredump(blocks) &nbsp; &nbsp; 4194303
nofiles(descriptors) 2048 &nbsp; &nbsp; &nbsp; &nbsp; &lt;&lt;&lt;&lt;&lt;&lt;&lt;&lt;&lt;&lt;&lt;&lt;&lt; Set to 8192 minimum &nbsp; &nbsp; &nbsp;&nbsp;
&nbsp;
&nbsp;
&nbsp;
Reference
&nbsp;
&nbsp; &nbsp;Minimum O/S ulimit settings on UNIX platforms&nbsp;
&nbsp; &nbsp; http://www.symantec.com/docs/TECH75332&nbsp;
&nbsp;
&nbsp; &nbsp;Insufficient system file descriptors can cause the EMM_DATA.db file to grow very large.
&nbsp; &nbsp;http://www.symantec.com/docs/TECH168846
&nbsp;
=======end====================
Those settings helped, as I said before... but after 3 weeks the issue is back... :(
My friends, I'll continue on the new post --&gt;&nbsp;https://www-secure.symantec.com/connect/forums/nbd...
Please continue on this new post.
Thanks in advance!
Seba.

sebaquadri · Answer

the error I got when tried to backup the nbdb was due a bad command syntaxis :) I should not specify a file name, just the path to save the backup... for instance...
/usr/openv/db/bin/nbdb_backup -dbn NBDB -online /nbdb_catalog/backup/
this one worked perfect :)
I'm still having the issue... symantec suggested to make some more tunning but the issue is still there... I did the rebuild and reorganize of the NBDB and I changed the server.conf file and installed the following eeb patch (by symantec suggestion) but the issue has not gone :(
&nbsp;
1) removed the "-gn 32" from server.conf
2) changed -cl 25M and -c 250 for -cl250M and -c250M so now my server.conf file looks like:

&nbsp;-n NB_ebrvmsnp01
	&nbsp; &nbsp;-x tcpip(LocalOnly=YES;ServerPort=13785) &nbsp;-gp 4096 -gd DBA -gk DBA -gl DBA -ti 0 -c 250M -ch 3G -cl 250M -zl -os 1M &nbsp;-o /usr/ope
	nv/db//log/server.log &nbsp;-ud -m
	&nbsp;

3) Finally I installed the following eeb patch -&gt; eebinstaller.2925327.1.hpia64
&nbsp;
But the issue is still there, NB_dbsrv is taking one CPU to 100% and have 100 queued jobs (100 active jobs, so 200 total)
&nbsp;
Do you have any other suggestion? This master server have 4 CPUs and tons of memory... check below...
&nbsp;
I cannot believe this is a lack of resources on the master server...
&nbsp;

System: ebrbsnp05 &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; Fri Aug 16 18:25:19 2013
	Load averages: 0.34, 0.36, 0.40
	286 processes: 204 sleeping, 82 running
	Cpu states:
	CPU &nbsp; LOAD &nbsp; USER &nbsp; NICE &nbsp; &nbsp;SYS &nbsp; IDLE &nbsp;BLOCK &nbsp;SWAIT &nbsp; INTR &nbsp; SSYS
	&nbsp;0 &nbsp; &nbsp;0.20 &nbsp; 4.0% &nbsp; 0.0% &nbsp; 1.8% &nbsp;94.2% &nbsp; 0.0% &nbsp; 0.0% &nbsp; 0.0% &nbsp; 0.0%
	&nbsp;2 &nbsp; &nbsp;0.18 &nbsp;18.5% &nbsp; 0.0% &nbsp; 0.6% &nbsp;80.9% &nbsp; 0.0% &nbsp; 0.0% &nbsp; 0.0% &nbsp; 0.0%
	&nbsp;4 &nbsp; &nbsp;0.46 &nbsp;40.6% &nbsp; 0.0% &nbsp; 1.0% &nbsp;58.4% &nbsp; 0.0% &nbsp; 0.0% &nbsp; 0.0% &nbsp; 0.0%
	&nbsp;6 &nbsp; &nbsp;0.54 &nbsp;42.5% &nbsp; 0.0% &nbsp; 0.2% &nbsp;57.3% &nbsp; 0.0% &nbsp; 0.0% &nbsp; 0.0% &nbsp; 0.0%
	--- &nbsp; ---- &nbsp;----- &nbsp;----- &nbsp;----- &nbsp;----- &nbsp;----- &nbsp;----- &nbsp;----- &nbsp;-----
	avg &nbsp; 0.34 &nbsp;26.4% &nbsp; 0.0% &nbsp; 1.0% &nbsp;72.6% &nbsp; 0.0% &nbsp; 0.0% &nbsp; 0.0% &nbsp; 0.0%
	&nbsp;
	System Page Size: 4Kbytes
	Memory: 2342944K (2010624K) real, 7187024K (6423580K) virtual, 34561896K free &nbsp;Page# 1/13
	&nbsp;
	CPU TTY &nbsp; &nbsp;PID USERNAME PRI NI &nbsp; SIZE &nbsp; &nbsp;RES STATE &nbsp; &nbsp;TIME %WCPU &nbsp;%CPU COMMAND
	&nbsp;0 &nbsp; ? &nbsp; 22393 root &nbsp; &nbsp; 152 20 &nbsp;3330M &nbsp; 156M run &nbsp; &nbsp;334:55 101.45 101.28 NB_dbsrv
	&nbsp;0 &nbsp; ? &nbsp; 12766 root &nbsp; &nbsp; 128 20 80616K 40724K sleep &nbsp; &nbsp;2:49 &nbsp;2.55 &nbsp;2.54 bpbkar
	&nbsp;2 &nbsp; ? &nbsp; 22680 root &nbsp; &nbsp; 152 20 &nbsp; 224M 72764K run &nbsp; &nbsp; &nbsp;5:37 &nbsp;1.14 &nbsp;1.14 nbjm
	&nbsp;4 &nbsp; ? &nbsp; 22642 root &nbsp; &nbsp; 152 20 &nbsp; 804M &nbsp; 509M run &nbsp; &nbsp; 10:45 &nbsp;0.75 &nbsp;0.75 nbemm
	&nbsp;4 &nbsp; ? &nbsp; &nbsp;4023 root &nbsp; &nbsp; 168 20 15844K &nbsp;1332K sleep &nbsp; 14:01 &nbsp;0.52 &nbsp;0.52 utild
	&nbsp;2 pts/0 21332 root &nbsp; &nbsp; 178 20 11092K &nbsp;1892K run &nbsp; &nbsp; &nbsp;0:09 &nbsp;0.41 &nbsp;0.41 top
	&nbsp;2 &nbsp; ? &nbsp; 22677 root &nbsp; &nbsp; 154 20 &nbsp; 133M 10020K sleep &nbsp; &nbsp;2:39 &nbsp;0.39 &nbsp;0.39 bpdbm
	&nbsp;2 &nbsp; ? &nbsp; 22678 root &nbsp; &nbsp; 154 20 97084K 16664K sleep &nbsp; &nbsp;3:28 &nbsp;0.32 &nbsp;0.32 bpjobd
	&nbsp;4 &nbsp; ? &nbsp; &nbsp;3292 root &nbsp; &nbsp; 152 20 41924K 25228K run &nbsp; &nbsp; &nbsp;2:44 &nbsp;0.23 &nbsp;0.23 python
	&nbsp;2 &nbsp; ? &nbsp; &nbsp;1841 root &nbsp; &nbsp; 154 20 10992K &nbsp;1156K sleep &nbsp; &nbsp;5:47 &nbsp;0.22 &nbsp;0.21 sendmail:
	&nbsp;0 &nbsp; ? &nbsp; &nbsp;2812 sfmdb &nbsp; &nbsp;154 20 37652K &nbsp;1904K sleep &nbsp; &nbsp;6:43 &nbsp;0.21 &nbsp;0.21 postgres:
	&nbsp;6 &nbsp; ? &nbsp; &nbsp;3404 root &nbsp; &nbsp; 152 &nbsp;0 52968K 11676K run &nbsp; &nbsp; &nbsp;7:19 &nbsp;0.20 &nbsp;0.20 perfd
	&nbsp;4 &nbsp; ? &nbsp; 22645 root &nbsp; &nbsp; 152 20 &nbsp; 283M 87228K run &nbsp; &nbsp; &nbsp;6:59 &nbsp;0.16 &nbsp;0.16 nbrb
	&nbsp;6 &nbsp; ? &nbsp; &nbsp;2064 cimsrvr &nbsp;152 20 78608K 23296K run &nbsp; &nbsp; &nbsp;5:42 &nbsp;0.12 &nbsp;0.12 cimservermain
	&nbsp;6 &nbsp; ? &nbsp; &nbsp;2070 root &nbsp; &nbsp; 152 20 &nbsp; 204M 56384K run &nbsp; &nbsp; &nbsp;6:35 &nbsp;0.11 &nbsp;0.11 cimprovagt
	&nbsp;4 &nbsp; ? &nbsp; &nbsp; &nbsp;80 root &nbsp; &nbsp; 152 20 48960K 43520K run &nbsp; &nbsp; 22:43 &nbsp;0.10 &nbsp;0.10 vxfsd
	&nbsp;0 &nbsp; ? &nbsp; &nbsp;3348 root &nbsp; &nbsp; 127 20 62396K 18436K sleep &nbsp; &nbsp;2:23 &nbsp;0.09 &nbsp;0.09 scopeux
	&nbsp;4 &nbsp; ? &nbsp; 22183 root &nbsp; &nbsp; 154 20 30324K &nbsp;2272K sleep &nbsp; &nbsp;0:38 &nbsp;0.08 &nbsp;0.08 pbx_exchange
	&nbsp;6 &nbsp; ? &nbsp; &nbsp; &nbsp;82 root &nbsp; &nbsp; 152 20 &nbsp; 360K &nbsp; 320K run &nbsp; &nbsp; &nbsp;4:03 &nbsp;0.07 &nbsp;0.07 pm_schedcpu
	&nbsp;2 &nbsp; ? &nbsp; 19871 ed854663 152 20 25552K &nbsp;2812K run &nbsp; &nbsp; &nbsp;0:01 &nbsp;0.06 &nbsp;0.06 sshd:
	&nbsp;2 &nbsp; ? &nbsp; &nbsp;3356 root &nbsp; &nbsp; -16 20 38004K 13192K run &nbsp; &nbsp; &nbsp;4:30 &nbsp;0.05 &nbsp;0.05 midaemon
	&nbsp;
	&nbsp;
	Let me mention that this is a dedicated master server and NOT a media server...
	&nbsp;
	Looking forward to hearing from you...
	&nbsp;
	Seba.

&nbsp;

jaime_vazquez · Answer

Seba:
I do not think I said anything about lack of resources for this problem. I tried to explain that a NB_dbsrv connection thread process appears to be just spinning inside a database.&nbsp; Understand that NB_dbsrv handles connections from the NBU processes to all of the databases. That would be NBDB. EMMDB, and (if configured) BMRDB. NB_dbsrv is multi-threaded and as such will have multiple threads running associated within its process scope. &nbsp;The "spining" thread, if one exists as I suspect, &nbsp;will be operating strictly in memory, using shared memory space, so nothing will be stoping it from taking over a CPU time slice. For me that is the best guess.
Did you look at the file "/usr/openv/db/log/server.log &nbsp;" to see what is being written to it?&nbsp; As I said before, the information may be valuable for this.
Also, when do you see the CPU hit the 100% load value? Is it that way after stopping and then starting NBU processes?
&nbsp;

sebaquadri · Answer

Hi Jaime,
&nbsp;
I do not see anything wrong on this log... I cut the lastest 100 lines (I have 103 queued jobs and 98 active currently)..
On the other hand we do not have BMRDB as we do not use Bare Metal Restore option... most of our backups are just "standard" "windows" or "sap" or "oracle"...
ebrbsnp05 &gt;&gt; tail -100 /usr/openv/db/log/server.log
I. 08/16 14:30:26. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 14:30
I. 08/16 14:30:26. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 14:30
I. 08/16 14:43:20. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 14:43
I. 08/16 14:43:20. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 14:43
I. 08/16 14:45:27. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 14:45
I. 08/16 14:45:27. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 14:45
I. 08/16 14:55:05. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 14:55
I. 08/16 14:55:06. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 14:55
I. 08/16 15:02:36. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 15:02
I. 08/16 15:02:36. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 15:02
I. 08/16 15:05:29. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 15:05
I. 08/16 15:05:29. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 15:05
I. 08/16 15:10:51. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 15:10
I. 08/16 15:10:51. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 15:10
I. 08/16 15:19:28. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 15:19
I. 08/16 15:19:28. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 15:19
I. 08/16 15:25:31. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 15:25
I. 08/16 15:25:31. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 15:25
I. 08/16 15:30:18. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 15:30
I. 08/16 15:30:18. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 15:30
I. 08/16 15:45:32. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 15:45
I. 08/16 15:45:32. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 15:45
I. 08/16 15:45:46. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 15:45
I. 08/16 15:45:47. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 15:45
I. 08/16 15:56:01. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 15:56
I. 08/16 15:56:01. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 15:56
I. 08/16 16:05:34. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 16:05
I. 08/16 16:05:34. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 16:05
I. 08/16 16:06:30. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:06
I. 08/16 16:06:31. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:06
I. 08/16 16:12:40. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:12
I. 08/16 16:12:40. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:12
I. 08/16 16:20:33. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:20
I. 08/16 16:20:33. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:20
I. 08/16 16:25:36. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 16:25
I. 08/16 16:25:36. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 16:25
I. 08/16 16:31:46. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:31
I. 08/16 16:31:47. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:31
I. 08/16 16:43:57. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:43
I. 08/16 16:43:57. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:43
I. 08/16 16:44:01. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 16:44
I. 08/16 16:44:01. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 16:44
I. 08/16 16:44:01. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 16:44
I. 08/16 16:44:01. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 16:44
I. 08/16 16:44:01. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 16:44
I. 08/16 16:44:01. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 16:44
I. 08/16 16:44:01. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:44
I. 08/16 16:44:01. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:44
I. 08/16 16:44:03. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:44
I. 08/16 16:44:03. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:44
I. 08/16 16:44:03. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:44
I. 08/16 16:44:03. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:44
I. 08/16 16:57:17. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:57
I. 08/16 16:57:17. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 16:57
I. 08/16 17:04:03. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 17:04
I. 08/16 17:04:03. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 17:04
I. 08/16 17:12:53. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 17:12
I. 08/16 17:12:54. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 17:12
I. 08/16 17:24:05. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 17:24
I. 08/16 17:24:05. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 17:24
I. 08/16 17:26:43. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 17:26
I. 08/16 17:26:43. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 17:26
I. 08/16 17:31:04. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 17:31
I. 08/16 17:31:05. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 17:31
I. 08/16 17:42:24. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 17:42
I. 08/16 17:42:24. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 17:42
I. 08/16 17:44:06. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 17:44
I. 08/16 17:44:06. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 17:44
I. 08/16 17:56:07. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 17:56
I. 08/16 17:56:07. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 17:56
I. 08/16 18:04:08. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 18:04
I. 08/16 18:04:08. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 18:04
I. 08/16 18:16:52. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 18:16
I. 08/16 18:16:53. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 18:16
I. 08/16 18:24:10. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 18:24
I. 08/16 18:24:10. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 18:24
I. 08/16 18:33:16. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 18:33
I. 08/16 18:33:16. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 18:33
I. 08/16 18:43:54. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 18:43
I. 08/16 18:43:54. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 18:43
I. 08/16 18:44:12. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 18:44
I. 08/16 18:44:12. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 18:44
I. 08/16 18:55:13. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 18:55
I. 08/16 18:55:13. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 18:55
I. 08/16 19:04:14. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 19:04
I. 08/16 19:04:14. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 19:04
I. 08/16 19:04:25. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 19:04
I. 08/16 19:04:25. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 19:04
I. 08/16 19:16:39. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 19:16
I. 08/16 19:16:39. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 19:16
I. 08/16 19:24:15. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 19:24
I. 08/16 19:24:15. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 19:24
I. 08/16 19:30:20. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 19:30
I. 08/16 19:30:20. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 19:30
I. 08/16 19:38:43. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 19:38
I. 08/16 19:38:43. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 19:38
I. 08/16 19:44:17. Starting checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 19:44
I. 08/16 19:44:17. Finished checkpoint of "NBAZDB" (NBAZDB.db) at Fri Aug 16 2013 19:44
I. 08/16 19:53:42. Starting checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 19:53
I. 08/16 19:53:42. Finished checkpoint of "NBDB" (NBDB.db) at Fri Aug 16 2013 19:53
ebrbsnp05 &gt;&gt;
&nbsp;
&nbsp;
Symantec suggested to increase the pfiles on the master and on the media servers...
changed pfiles to 16264 on the master &nbsp;(previous value was 8192) and to 8192 on all our media servers (old value was 2048)
&nbsp;
But the issue is still there :( no luck yet...
&nbsp;
Let me know which other log should I provide to get to the root cause? Thanks in advance...

Forum Discussion

NB_dbsrv consuming 100% of CPU

7 Replies

Related Content

backupexecmanagementservice.exe consumes all available RAM

Replication is consuming all bandwidth

Storage Crawler consuming high CPU

Re: Status Code 25: cannot connect on socket

system volume information consuming disk space

Recent Discussions

command: bperror

MS-SharePoint policy restore error (2804) .

How to restore a backup

How to configure RBAC

10 years old netbackup appliance database service down, ssl certification out date