khlow
17 years agoLevel 2
SFW HA 5.1 Cluster doesn't work when the LLT heartbeats disconnected
Hi All,
For my case, I have SFW HA 5.1 installed and it has been almost configured successful, now we are reached to stage of perform UAT. One of our UAT test was to simulate the faulty of LLT heartbeat interface and public interface, during the simulation of disconnecting LLT heartbeat or public interface, the cluster services was initail service group failover to another node successful, but during the service group bringing online on another node, the resources of VMDg and MountV both that were unable to bring online successfully with some error said "Agent is calling clean for resource because the resource is not up even after online completed." . Then the partial online service group went hung and a while later, the status of service group has changed to faulted on passive node.
I suspected that this was due to the VMDg has been locked out by the active node when the cable is being disconnected and therefore, the service group is online on passive node unsuccessfully due to the agent is unable to call the clean state of resource to VMDg and MountV that is from original node. Please advise. Appreaciate it...