Cluster Failover: Clearing VCS for Automatic Failover
Steps:
1. Failover Cluster from say app1 to App2 by shutting down App1.
2. Bring up the App1 server and now failover App2 to App1 server.
When the same scenario occurs multiple times then observed that all the services are offline at some point after failover and the system doesnt come up.
How can i check the state of the other box once brought backup and clear the error to allow the automatic failover to occure again?
-Matt-
When you down a system, the service group fails over and then it gets autodisabled. When the downed system comes back, VCS is the very last thing to start (i.e rc script is S99) and when it starts it probes all the resources and this can take a minute and then it autoenables servicegroup once all resources are probed successfully.
So your issue is probably than you are downing the box too soon after it comes up, before the probes have finished - so you need to check this with "hastatus -sum" before downing the box.
Mike