06-25-2014 03:45 PM
Hi
My Netbackup LAB environment is running with 7.1.0.2 in VCS cluster. Somehow /usr/openv got delete from NodeA. I am able to up the Netbackup on NodeB. In cluster configuration it is showing with error state.
hastatus -sum
-- SYSTEM STATE
-- System State Frozen
A NodeA RUNNING 0
A NodeB RUNNING 0
-- GROUP STATE
-- Group System Probed AutoDisabled State
B ClusterService NodeA Y N OFFLINE
B ClusterService NodeB Y N ONLINE
B nbu_group NodeA N N OFFLINE
B nbu_group NodeB Y N ONLINE
-- RESOURCES NOT PROBED
-- Group Type Resource System
E nbu_group NetBackup nbu_server NodeA
-- AGENTS FAILED
-- Type System
J NetBackup NodeA
After offline nbu_group , I tried to reinstall Netbackup on NodeA but it is not recognizing cluster and installing as standalone Netbackup Server. Currently I have removed SYMC* package from NodeA.
06-25-2014 04:34 PM
Hello,
If user/openv was damaged or deleted then follow the How to recover a NetBackup Veritas Cluster Server (VCS) node after system failure guide.
when you installed NetBackup did you stop the node using /opt/VRTSvcs/bin/hastop -sys <system_name> -force? Also are you using the virtual name when prompted to enter the name of the server?
06-25-2014 08:54 PM
Hi Terry,
I tried with provided link but it didn't work. To do it from scratch, I removed the nbu_group from cluster, removed all the SYMC* package and deleted everything from /usr/openv. I tried to reinstall but it is not prompting for cluster and installing as standalone.
NODEA and NodeB are imaginary name. Actuall nodes name are usa0300lv963 and usa0300lv964
Cluster configuration
-- SYSTEM STATE
-- System State Frozen
A usa0300lv963 RUNNING 0
A usa0300lv964 RUNNING 0
-- GROUP STATE
-- Group System Probed AutoDisabled State
B ClusterService usa0300lv963 Y N ONLINE
B ClusterService usa0300lv964 Y N OFFLINE
B aptareSG usa0300lv963 Y N OFFLINE
B aptareSG usa0300lv964 Y N OFFLINE
Main.cf
include "OracleASMTypes.cf"
include "types.cf"
include "Db2udbTypes.cf"
include "OracleTypes.cf"
include "SybaseTypes.cf"
cluster ClusterService (
UserNames = { admin = bIJbIDiFJeJJhRJdIG }
ClusterAddress = "13.120.207.103"
Administrators = { admin }
UseFence = SCSI3
)
system usa0300lv963 (
)
system usa0300lv964 (
)
group ClusterService (
SystemList = { usa0300lv963 = 0, usa0300lv964 = 1 }
AutoStartList = { usa0300lv963, usa0300lv964 }
OnlineRetryLimit = 3
OnlineRetryInterval = 120
)
IP webip (
Device = eth0
Address = "13.120.207.103"
NetMask = "255.255.255.0"
)
NIC csgnic (
Device = eth0
)
webip requires csgnic
// resource dependency tree
//
// group ClusterService
// {
// IP webip
// {
// NIC csgnic
// }
// }
group aptareSG (
SystemList = { usa0300lv963 = 1, usa0300lv964 = 0 }
AutoStartList = { usa0300lv963, usa0300lv964 }
)
Application aptareApp (
StartProgram = "/opt/aptare/mbs/bin/aptare_agent start"
StopProgram = "/opt/aptare/mbs/bin/aptare_agent stop"
PidFiles = { "/opt/aptare/mbs/logs/watchdog.pid",
"/opt/aptare/mbs/logs/metadatacollector.pid" }
)
// resource dependency tree
//
// group aptareSG
// {
// Application aptareApp
// }
07-04-2014 02:12 AM
You should not have removed nbu_group from the cluster config. Just maybe NodeA removed from the SG.
In the past, I have been able to remove a cluster node as follows:
cd /usr/openv/netbackup/bin/cluster/util
# ./cluster_drop_node -s nbu -n <drop_node_name>
This removes the node from VCS config as well as from EMM.
To reinstall and add NodeA, NBU should be up and running on NodeB with nbu_group online.
gabconfig and hastatus must show VCS up and running on both nodes.
Ensure rsh is enabled between the 2 nodes, Virtual IP and hostname in /etc/hosts on both nodes.
Install NBU on NodeA.
When virtual hostname is entered as Master name, the installation script will detect current cluster config and add NodeA to the cluster and EMM.