cancel
Showing results for 
Search instead for 
Did you mean: 

Cluster service group failed to failover

IT_guys
Level 4

Dear All,

 I encounter the problem is the Window server 2003 failed to failover the service group from Active node to Passive node when Active one not respond.

Once node 1 failover to node 2 , i logon to node 2 then open the cluster administration, but not to able to see the service group.
My colleauges reboot the node 1 then I can see all the service group on node 2 and services can start.  

Please advice the root cause or log to check the issue 

 Thanks 
 

2 ACCEPTED SOLUTIONS

Accepted Solutions

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

So, you don't have a VCS issue.

You have an issue with insufficient physical resources on the servers.

You need to bring this to your management's attention.

View solution in original post

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

You can start with the free online training over here:  https://www.veritas.com/elibrary/en.html

Manuals can be found here: 
https://sort.veritas.com/documents/doc_details/sfha/5.0/Windows/ProductGuides/

Unfortunately your version ran out of support some time ago.
Version 5.0 is the oldest version for which documentation is still available.

 

View solution in original post

10 REPLIES 10

CraigV
Moderator
Moderator
Partner    VIP    Accredited
...what product is this for? A product that Veritas makes...?

Hi CraigV

The VCS current use Storage Foundation Cluster File System Enterprise,  v4.1, 

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

@IT_guys I have moved your post to the Cluster forum.

In order for us to try and assist you, you need to give us something to work with.

Background... error message... all relevant info, please.

Extract all text from engine_A log related to failover and save in engine.txt.

Please upload the .txt as attachment.

Hi Marianne,

 

Thank you for your reply !

We find the problem is the end point protection use a lot of resource to trigger the failover, and also the backup exec unable to kill the process, after failover to another node 2, the service group of backup exec and sftp failed to mount on the Window node 2.  After reboot the node 1 , all the service group resume to node 2 and service start up successfully.  

However, I will gather the more detail information and post it to cluster focum.  Thank you for your support again.

Cheers,

Alfred

 

 

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

You really need to get us correct info.

I have no idea how 'end point protection' is able to trigger a failover and how Backup Exec is related.

What exactly are you clustering? Backup Exec?

Have you checked system logs and VCS engine_A log?

If your servers are running out of resources, you probably have an issue with outdated, overloaded servers that probably are out of maintenance as well, right? 

What exactly are you clustering? Backup Exec? 

The cluster is form Active/Passive, the Active node is failover to Passive node due to the active one out of memory non paged file after check the event log.  

One of the service group is Backup Exec.

Have you checked system logs and VCS engine_A log?  Will check it asap.

If your servers are running out of resources, you probably have an issue with outdated, overloaded servers that probably are out of maintenance as well, right? 

Yes, the Active node is out of resource to trigger the failover.  The reason of the failover occur, my colleague observed the problem on the memory usage from the event log.  

Thanks, 

 

 

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

So, you don't have a VCS issue.

You have an issue with insufficient physical resources on the servers.

You need to bring this to your management's attention.

Yes, I will inform them on this issue.

Btw, I would like to find some VCS document for beginner. Do you have the website for me to download? Thank you

Marianne
Moderator
Moderator
Partner    VIP    Accredited Certified

You can start with the free online training over here:  https://www.veritas.com/elibrary/en.html

Manuals can be found here: 
https://sort.veritas.com/documents/doc_details/sfha/5.0/Windows/ProductGuides/

Unfortunately your version ran out of support some time ago.
Version 5.0 is the oldest version for which documentation is still available.

 

Thank you for your help