05-06-2013 06:43 PM
Hi All,
Our Netbackup 7.5.0.4 running on RedHat Linux 6.1 has been crashed two times whithin 1 week, after stable for quite a long time. What should I check? Any experience on this? (I also log the case to Symantec but it is been a little slow.
Regards,
Iwan Tamimii
05-06-2013 07:42 PM
It is the NBU crashed or the OS itself crashed. If OS is crashing then there is nothing NBU or Symantec can do. If you are saying that it is NBU crashing then please tell us more about it. What is the error message it gives,what is the behaviour, your system logs, is anything recently changed which might have caused this. Normally on windows we check evient viewer if application crashes not sure on Linux but check the system logs which might tell what is crashing.
05-06-2013 09:33 PM
Given the new EEB support policy I would strongly suggest to reproduce this behavior using 7.5.0.5
Even if support find a bug in code, Symantec will provide EEB for the latest version only :(
05-06-2013 10:32 PM
I agree with sazz - we need to know what exactly is crashing.
Error messages, /var/log/messages, screenshots, processes running / not running will be helpful.
05-07-2013 02:12 AM
Define "crash"
System crash?
NetBackup hung?
Processes not running? If so, which ones?
As mentioned system messages is a good place to look. Also ensure core logging is enabled at OS level so if any processes are core dumping we can tell which process/es
05-13-2013 10:01 PM
Thank you for the responses, I am so sorry for the late reply. The crashed was only the Netbackup process the system/server (which is RedHat 6.1) was running fine. Actually I brought up to the Symantec we found 2 coredump files but according to them the coredumps didn't tell anything. So I wait for another crash for the coredump.
The second hung because it is in the cluster the service nbu was autorestart.
Anything to check?
Regards,
Iwan
05-13-2013 10:35 PM
As per my previous post:
Error messages, /var/log/messages, screenshots, processes running / not running will be helpful.
We need to know which NBU processes crashed so that we can tell you which NBU logs are needed.
Cluster restart is a result of NBU processes not running or hanging.
Seems your previous issue is also not solved then?
https://www-secure.symantec.com/connect/forums/netbackup-7504-rhel-61-keep-crashing
05-13-2013 11:20 PM
You need to capture.
The core file itself
A copy of the binary (eg, if bptm crashed, you want the bptm binary)
The system messages log
The log that matches the binary that crashed
Debug output from the core file by following this TN.
http://www.symantec.com/docs/TECH52289
This has to be run from your system, any other system will have different library files, and will give the incorrect results.
nbsu -c -t output
It is true, the core file alone is no use, most importantly we need the debug info as explained.
For whatever process it is, please turn the verbose level up, either to 5 if legacy log of Debug /Diag 6 if unified log. If process is nbrb, only turn Debug upto 4.
To set logs :
https://www-secure.symantec.com/connect/articles/quick-guide-setting-logs-netbackup
Thanks,
Martin