sparmar
15 years agoLevel 3
VCS nodes keep rebooting
Hi
I wonder if you kind people can help me again.
I have a 3 node cluster on Sun x4240 servers, which I have installed VCS v5.0.
There are only about 9 service groups created on them which just have mounts and volumes, so no load on them.
The issue I am seeing is randomly one server in the cluster drops off the network and then I can't access it via the console as root.
This seems to happen for about 15 minutes then it fixes itself, then the other server does the same.
I have noticed that the heart beat connections go first.
My Cluster set up is:
Redhat 5.4 x86
VCS v5.0 RP3
Heartbeats on = eth1 and eth3 (100mb full duplex)
All the servers are built exactly the same with no variation.
has anyone come across this before?
Thanks
Sparmar
Just to let you all know, the issue was a faulty network card for one of the heart beats which has now been replaced.
Also there was an issue with a PCI card which connects via a fibre cable as a media server for Netbackup which seemed to hang the servers on reboot. (Keeps scanning down the lpfc)
So, it looks as if it was hardware related.
Many thanks for all the input in helping me get to some resolution.
Sparmar