Forum Discussion

khemmerl's avatar
khemmerl
Level 5
14 years ago

How to recover from failed attempt to switch to a different node in cluster

Hello everyone.  I have a two node cluster to serve as an Oracle database server.  I have the Oracle binaries installed on disks local to each of the nodes (so they are outside the control of the Cluster Manager).  I have a diskgroup which is three 1TB LUNs from my SAN, six volumes on the diskgroup (u02 through u07), six mount points (/u02 through /u07), a database listener and the actual Oracle database.  I was able to successfully manually bring up these individual components and confirmed that the database was up an running.

I then tried a "Switch To" operation to see if everything would mode to the other node of the cluster.  It turns out this was a bad idea.  Within the Cluster Manager gui, the diskgroup has a state of Online, Istate of "Waiting to go offline propogate" and Flag of "Unable to offline".  The volumes show as "Offline on all systems" but the mounts still show as online with "Status Unknown".  When I try to take the mount points offline, I get the message "VCS ERROR V-16-1-10277  The Service Group i1025prd to which Resource Mnt_scratch belongs has failed or switch, online, and offline operations are prohibited."

Can anyone tell me how I can fix this?

Ken

  • Ken,

    Sorry I forgot "-sys" of hagrp -flush command, but the error did tell you what was wrong and the correct syntax:

     

    VCS WARNING V-16-1-10691 Must specify system name
    VCS INFO V-16-1-10601 Usage:
    hagrp -flush <group> [-force] -sys <system> [-clus <cluster> | -localclus]

     

     

     

    Stuff in square brackets is optional so mandatory args are:

    hagrp -flush <group> -sys <system>

     

    You can see from main.cf which objects are groups which are ClusterService and i1025prd, so you cannot use Ora_DiskGroup_Data because this, as you said in your email, is a resource, not a group.  Flush is stopping VCS taking further action so that stuff that is waiting to offline will not continue once diskgroup resource is deleted

    Mike