Query regarding one vcs node down
Hi Team, Please suggest, if I have 4 nodes and due to some network issue suddenly all 4 nodes down. 3 nodes came up but 1 node is not coming up. So what would happen to vcs services.Does that run automatically or need to run it manually. Please share the step by step activity. Thanks. AllaboutunixSolved1.3KViews0likes7CommentsJeopardy state Query
Hi Team, I have a query,If I have a 2 node cluster set up( S1 -S2)and system S2 went into the state of Jeopardywhat would be the impact on the service group if a) SG has no critical resources setup b) SG has critical resource set up What would be the steps to rectify the this Jeopardy state. Please guide. Thanks..Solved1.7KViews1like2CommentsResource faults issue
Hi Team, Many times we face resource faults alerts on our AIX server,but when we check it shows all are running fine. Also, when checked logs it shows not (initiated by VCS) alerts. Our main concern is to troubleshoot, why we get these alerts and if we get these alerts then why cluster resource is not showing faulty. Below are the logs, -- SYSTEM STATE -- System State Frozen A xxxibm012 RUNNING 0 A xxxibm014 RUNNING 0 -- GROUP STATE -- Group System Probed AutoDisabled State B ClusterService xxxibm012 Y N ONLINE B ClusterService xxxibm014 Y N OFFLINE B DB_INSIGHT_STAGE xxxibm012 Y N ONLINE B DB_INSIGHT_STAGE xxxibm014 Y N OFFLINE ============================================================= 2015/04/21 10:14:53 VCS INFO V-16-1-53504 VCS Engine Alive message!! 2015/04/21 12:57:32 VCS WARNING V-16-10011-5611 (clnibm014) NIC:csgnic:monitor:Second PingTest failed for Virtual Interface en4. Resource is OFFLINE 2015/04/21 12:57:32 VCS INFO V-16-1-50135 User root fired command: hares -modify csgnic ConfidenceMsg from localhost 2015/04/21 12:57:33 VCS ERROR V-16-1-54031 Resource csgnic (Owner: Unspecified, Group: ClusterService) is FAULTED on sys clnibm014 2015/04/21 12:57:33 VCS INFO V-16-6-0 (clnibm014) resfault:(resfault) Invoked with arg0=clnibm014, arg1=csgnic, arg2=ONLINE 2015/04/21 12:57:49 VCS INFO V-16-6-15002 (clnibm014) hatrigger:hatrigger executed /opt/VRTSvcs/bin/triggers/resfault clnibm014 csgnic ONLINE successfully 2015/04/21 12:58:18 VCS ERROR V-16-1-54031 Resource proxy_DB_INSPRD (Owner: Unspecified, Group: DB_INSIGHT_STAGE) is FAULTED on sys clnibm014 2015/04/21 12:58:18 VCS INFO V-16-6-0 (clnibm014) resfault:(resfault) Invoked with arg0=clnibm014, arg1=proxy_DB_INSPRD, arg2=ONLINE 2015/04/21 12:58:29 VCS INFO V-16-6-15002 (clnibm014) hatrigger:hatrigger executed /opt/VRTSvcs/bin/triggers/resfault clnibm014 proxy_DB_INSPRD ONLINE successfully 2015/04/21 12:58:33 VCS INFO V-16-1-50135 User root fired command: hares -modify csgnic ConfidenceMsg Primary test to confirm Online status succeeded. from localhost 2015/04/21 12:58:34 VCS INFO V-16-1-10299 Resource csgnic (Owner: Unspecified, Group: ClusterService) is online on clnibm014 (Not initiated by VCS) 2015/04/21 12:58:34 VCS NOTICE V-16-1-10233 Clearing Restart attribute for group ClusterService on all nodes 2015/04/21 12:58:34 VCS NOTICE V-16-1-51034 Failover group ClusterService is already active. Ignoring Restart 2015/04/21 12:59:18 VCS INFO V-16-1-10299 Resource proxy_DB_INSPRD (Owner: Unspecified, Group: DB_INSIGHT_STAGE) is online on clnibm014 (Not initiated by VCS) 2015/04/21 12:59:18 VCS NOTICE V-16-1-10233 Clearing Restart attribute for group DB_INSIGHT_STAGE on all nodes 2015/04/21 12:59:18 VCS NOTICE V-16-1-51034 Failover group DB_INSIGHT_STAGE is already active. Ignoring Restart 2015/04/21 12:59:34 VCS INFO V-16-1-50135 User root fired command: hares -modify csgnic ConfidenceMsg Primary test to confirm Online status succeeded. from localhost 2015/04/21 13:18:53 VCS INFO V-16-1-50135 User root fired command: hares -modify csgnic ConfidenceMsg Relying on secondary test to confirm Online status. from localhost 2015/04/21 13:19:34 VCS INFO V-16-1-50135 User root fired command: hares -modify csgnic ConfidenceMsg Primary test to confirm Online status succeeded. from localhost 2015/04/21 13:44:49 VCS INFO V-16-1-50135 User root fired command: hares -modify csgnic ConfidenceMsg Relying on secondary test to confirm Online status. from localhost 2015/04/21 13:45:34 VCS INFO V-16-1-50135 User root fired command: hares -modify csgnic ConfidenceMsg Primary test to confirm Online status succeeded. from localhost 2015/04/21 14:14:54 VCS INFO V-16-1-53504 VCS Engine Alive message!! 2015/04/21 16:48:59 VCS INFO V-16-1-50135 User root fired command: hares -modify csgnic ConfidenceMsg Relying on secondary test to confirm Online status. from localhost 2015/04/21 16:49:34 VCS INFO V-16-1-50135 User root fired command: hares -modify csgnic ConfidenceMsg Primary test to confirm Online status succeeded.Solved4.8KViews1like9CommentsDIMM replacement on VCS active node
Hi Team, We have to replace DIMM on the activenode in which VCS services is currently running. Kindly guide step by step procedure for this activity,also please suggest the prerequisities before starting this activity. This is very very crucial activity as Class A application is running on the active node. Do wemove the SG to passive node(Jupiter) and then perform theactivity so that apps will be online.What will be its steps do to it. Currently, SG is running in Venusserver.in whichWe have to perform activity. -- SYSTEM STATE -- System State Frozen A Venus RUNNING 0 A Jupiter RUNNING 0 -- GROUP STATE -- Group System Probed AutoDisabled State B ClusterService Venus Y N ONLINE B ClusterService Jupiter Y N OFFLINE B ORA_SG_Group Venus Y N ONLINE B ORA_SG_Group Jupiter Y N OFFLINE Kindly suggest as soon as possible.Solved1.5KViews0likes3CommentsPlan for DIMM replacement (Both cluster system would be down which is mandatory)
Hi Team, We have to replacement DIMM on the passive node in which VCS services is currently not running.The cross over cables LLT is hanged in such a way that we need to move both boxes (Active and passive nodes).It requires downtime because we will also have to shutdown active node. We are now asking apps team for downtime for it. Could you please suggest,the plan to proceed for this activity when we have to shutdown both nodes for DIMM replacement. Currently, SG is running in Polar server. -- SYSTEM STATE -- System State Frozen A polar RUNNING 0 A summer RUNNING 0 -- GROUP STATE -- Group System Probed AutoDisabled State B ClusterService polar Y N ONLINE B ClusterService summer Y N OFFLINE B ORA_SG_Group polar Y N ONLINE B ORA_SG_Group summer Y N OFFLINE This is bit urgent so require your suggestions as soon as possible.Solved957Views0likes3CommentsRegarding resource online operations
Hi, Suppose, a resouce got faulted in a SG and I need to make it online. Shall I have to do in this way? hagrp -clear <group> [-sys] <host> <sys> and then online the service group hagrp -online <group> -sys <sys> OR by this way? 1.Flush the SG, hagrp -flush <group> -sys <system> 2.Clear the faulted resource hares -clear <resource> [-sys] 3.Online it, hares -online <resource> [-sys] Do flushing of the SG required? Kindly assist how should i have to proceed for these cases.Solved4.8KViews2likes3CommentsMulti nic concept in vcs
Team, I have gone through the doc to understand the functionalities of Multinic in vcs,however unable to get it completely. Could you please explain step by step about the Multinic and how to configure its resources? I have to give training in multinic concept theoritically and to implement it. Thanks.. AllaboutunixSolved1.6KViews1like2CommentsSG is not switching to next node.
Hi All, I am new to VCS but good in HACMP. In our environment we are using VCS-6.0, I one server we found that the SG is not moving from one node to another node when we tried manual failover using the bellow command. hagrp -switch <SGnamg> -to <sysname> We able to see that the SG is offline in the currnent node but it's not coming online in the secondary node. There is no error locked in engine_A.log except the bellow entry cpus load more than 60% <Secondary node name> Can anyone help me to find the solution for this. I will provide the output of any commands if you need more info to help me out to get this trouble shooted :) Thanks,Solved1.8KViews1like8Comments