Showing results for 
Search instead for 
Did you mean: 

nbrmms keeps crashing and restarting after upgrade

Level 3

I upgraded NBU from 7.7.3 to 8.1.1, and all appeared to be fine, apart from nbrmms which crashes and restarts, around twice a minute, generating a 200Mb core file each time.

I see the following 2 errors: -

Apr 11 08:21:15 abrt[6613]: Saved core dump of pid 6612 (/usr/openv/netbackup/bin/admincmd/bpstsinfo) to /var/spool/abrt/ccpp-2018-04-11-08:21:15-6612 (23531520 bytes)
Apr 11 08:21:15  abrtd: Directory 'ccpp-2018-04-11-08:21:15-6612' creation detected
Apr 11 08:21:15  abrtd: Package 'VRTSnetbp' isn't signed with proper key
Apr 11 08:21:15  abrtd: 'post-create' on '/var/spool/abrt/ccpp-2018-04-11-08:21:15-6612' exited with 1
Apr 11 08:21:15  abrtd: Deleting problem directory '/var/spool/abrt/ccpp-2018-04-11-08:21:15-6612'
Apr 11 08:21:55 kernel: nbrmms[10758]: segfault at 272 ip 00007f8dbc666301 sp 00007f8db7fd5c50 error 4 in[7f8dbc5bc000+1c8000]



Detail: <fault xsi:type="ns1:STSException"><ns1:errorCode>2060029</ns1:errorCode></fault>

HANDLE_SOAP_FAULT: 2060029,46:Network_NTAP:brlvsnapmp01.bluecrest.local:6775,1

0,51216,395,222,20325,1523316448496,63513,140237748373248,0:,132:libsts opensvh() 18/04/10 00:27:28: v11_open_server failed in plugin /usr/openv/lib/ost-plugins/ err 2060029,46:Network_NTAP:<Snap Storage Server>:6775,1

0,51216,395,222,20326,1523316448496,63513,140237748373248,0:,100:[6775] failed to open a server handle for Network_NTAP:brlvsnapmp01.bluecrest.local returned 2060029,23:STSEventSupplier::run(),1

0,51216,221,222,20887,1523316458309,63513,140239406364416,0:,107:: load_avg_short=0.00125, load_avg_long=0.0025, avg_freemem_short=2.98328e+06, avg_freemem_long=2.93784e+06,43:MediaPerformanceTimer::calculateStateChange,1

0,51216,221,222,20888,1523316458310,63513,140239406364416,0:,39:short_state=3, long_state=3, m_state= 3,43:MediaPerformanceTimer::calculateStateChange,1

0,51216,220,222,13675,1523316502697,63513,140239364339456,0:,60:7f8c057eb700:  workInfo::doWork: can't connect to server, -1,16:workInfo::doWork,1

0,51216,220,222,13676,1523316502697,63513,140239364339456,0:,62:7f8c057eb700:  workInfo::doWork: error'ing all pending volumes,16:workInfo::doWork,1

0,51216,395,222,20327,1523316508393,63513,140237748373248,0:,184:SOAP 1.1 fault: SOAP-ENV:Client[no subcode]

"authorization failure".

In the highlighted bit, that is our DFM Windows Server, which maintains local snapshots for user based restores from NetApp.

 All licenses are installed. However, I noticed that the upgrade has changed the name of the master from an alias that we used to use, to just it's hostname (guess to do with security certificate generation). 2 licenses are under the alias, 1 is under the hostname.

I do have a call open with Veritas, but nothing has been determined yet.

Cannot find any reference to this on any forum, or search.

Any ideas ?.