LLT connections mismatch
My cluster was configured some time ago and had two dedicated and one low priority LLT links.
About a week ago I found that one of the dedicated links is down. It turned out that ports on the switch were disabled.
So, I asked to enable them and OS on both nodes started to see interfaces running. But the cluster still shows that one link is down.
Please find below the lltstat output from both nodes and llttab files from both nodes:
As you can see each node sees its own links but only two from the other node. I thought that restart of the cluster and perhaps the node will help, but not sure. I also did not want to restart aplication if it is not necessary.
My question is will it help to stop cluster with hastop -all -force command and then restart or reconfigure LLT links or I really have to stop everything, fix LLT links and then start the app. Only one node of the cluster is active the other does not have any application services running.
node-ora01:root ~ # lltstat -n
LLT node information:
Node State Links
* 0 node-ora01 OPEN 3
1 node-ora02 OPEN 2
node-ora01:root ~ # cat /etc/llttab
set-node node-ora01
set-cluster 3415
link eth1 eth-9c:8e:99:fa:21:0a - ether - -
link eth3 eth-9c:8e:99:fa:21:0e - ether - -
link-lowpri bond0 bond0 - ether - -
node-ora02:root ~ # lltstat -n
LLT node information:
Node State Links
0 node-ora01 OPEN 2
* 1 node-ora02 OPEN 3
node-ora02:root ~ # cat /etc/llttab
set-node node-ora02
set-cluster 3415
link eth1 eth-9c:8e:99:f9:ec:bc - ether - -
link eth3 eth-9c:8e:99:f9:ec:c0 - ether - -
link-lowpri bond0 bond0 - ether - -
I did get down time for the cluster but it turned out that something wrong with ports on the switch or cables.
So far only node itself sees NIC is up and running, other node does not.
I also found how to enable or disable LLT link without restarting the whole stack or the cluster:
lltconfig -u eth3
lltconfig -t eth3 -d eth3
Thank you all.
Leonid