Forum Discussion

twinkle_sapra1's avatar
11 years ago

Netbackup /usr/openv/ deleted from cluster node

Hi

My Netbackup LAB environment is running with 7.1.0.2  in VCS cluster. Somehow /usr/openv got delete from NodeA. I am able to up the Netbackup on NodeB. In cluster configuration it is showing with error state.

 

hastatus -sum

-- SYSTEM STATE
-- System               State                Frozen

A  NodeA         RUNNING              0
A  NodeB         RUNNING              0

-- GROUP STATE
-- Group           System               Probed     AutoDisabled    State

B  ClusterService  NodeA         Y          N               OFFLINE
B  ClusterService  NodeB         Y          N               ONLINE
B  nbu_group       NodeA         N          N               OFFLINE
B  nbu_group       NodeB         Y          N               ONLINE

-- RESOURCES NOT PROBED
-- Group           Type                 Resource             System

E  nbu_group       NetBackup            nbu_server           NodeA

-- AGENTS FAILED
-- Type            System

J  NetBackup       NodeA

After offline nbu_group , I tried to reinstall Netbackup on NodeA but it is not recognizing cluster and installing as standalone Netbackup Server. Currently I have removed SYMC* package from NodeA.

 

  • Hi Terry,

    I tried with provided link but it didn't work. To do it from scratch, I removed the nbu_group from cluster, removed all the SYMC* package and deleted everything from /usr/openv. I tried to reinstall but it is not prompting for cluster and installing as standalone.

    NODEA and NodeB are imaginary name. Actuall nodes name are usa0300lv963 and usa0300lv964

    Cluster configuration

     

    -- SYSTEM STATE
    -- System               State                Frozen

    A  usa0300lv963         RUNNING              0
    A  usa0300lv964         RUNNING              0

    -- GROUP STATE
    -- Group           System               Probed     AutoDisabled    State

    B  ClusterService  usa0300lv963         Y          N               ONLINE
    B  ClusterService  usa0300lv964         Y          N               OFFLINE
    B  aptareSG        usa0300lv963         Y          N               OFFLINE
    B  aptareSG        usa0300lv964         Y          N               OFFLINE

    Main.cf

    include "OracleASMTypes.cf"
    include "types.cf"
    include "Db2udbTypes.cf"
    include "OracleTypes.cf"
    include "SybaseTypes.cf"

    cluster ClusterService (
            UserNames = { admin = bIJbIDiFJeJJhRJdIG }
            ClusterAddress = "13.120.207.103"
            Administrators = { admin }
            UseFence = SCSI3
            )

    system usa0300lv963 (
            )

    system usa0300lv964 (
            )

    group ClusterService (
            SystemList = { usa0300lv963 = 0, usa0300lv964 = 1 }
            AutoStartList = { usa0300lv963, usa0300lv964 }
            OnlineRetryLimit = 3
            OnlineRetryInterval = 120
            )

            IP webip (
                    Device = eth0
                    Address = "13.120.207.103"
                    NetMask = "255.255.255.0"
                    )

            NIC csgnic (
                    Device = eth0
                    )

            webip requires csgnic


            // resource dependency tree
            //
            //      group ClusterService
            //      {
            //      IP webip
            //          {
            //          NIC csgnic
            //          }
            //      }


    group aptareSG (
            SystemList = { usa0300lv963 = 1, usa0300lv964 = 0 }
            AutoStartList = { usa0300lv963, usa0300lv964 }
            )

            Application aptareApp (
                    StartProgram = "/opt/aptare/mbs/bin/aptare_agent start"
                    StopProgram = "/opt/aptare/mbs/bin/aptare_agent stop"
                    PidFiles = { "/opt/aptare/mbs/logs/watchdog.pid",
                             "/opt/aptare/mbs/logs/metadatacollector.pid" }
                    )

     

            // resource dependency tree
            //
            //      group aptareSG
            //      {
            //      Application aptareApp
            //      }

     

  • You should not have removed nbu_group from the cluster config. Just maybe NodeA removed from the SG.

    In the past, I have been able to remove a cluster node as follows:
    cd /usr/openv/netbackup/bin/cluster/util
    # ./cluster_drop_node -s nbu -n <drop_node_name>

    This removes the node from VCS config as well as from EMM.

    To reinstall and add NodeA, NBU should be up and running on NodeB with nbu_group online.

    gabconfig and hastatus must show VCS up and running on both nodes.

    Ensure rsh is enabled between the 2 nodes, Virtual IP and hostname in /etc/hosts on both nodes.
    Install NBU on NodeA.
    When virtual hostname is entered as Master name, the installation script will detect current cluster config and add NodeA to the cluster and EMM.