VCS Switch is taking up to 10 min.
I have 2 server running SFWHA 5.1 SP2 on a 2008R2 platform. I created the diskgroups through VEA then created my clusters and Service Group in VCS. when the servers were furst setup the switch took about 30 seconds. Currently it takes about 10 mins. I have the same setup in a testbed that is having no problems, the only difference is production has users and data running throught it. Is there a way to see where the real hangup it coming from as the logs just give me time gaps but no real details what happened?
Jun 7, 2012 4:02:23 AM |
V-16-1-50135 User admin fired command: hagrp -switch SG_AvMail MHSWS001ANP-2 localclus from 192.168.249.203 |
Jun 7, 2012 4:02:23 AM |
V-16-1-10208 Initiating switch of group SG_AvMail from system MHSWS001ANP-1 to system MHSWS001ANP-2 |
Jun 7, 2012 4:02:23 AM |
V-16-1-10300 Initiating Offline of Resource AviNetConnector (Owner: unknown, Group: SG_AvMail) on System MHSWS001ANP-1 |
|
Jun 7, 2012 4:02:23 AM |
V-16-1-10300 Initiating Offline of Resource AvMailInterchange (Owner: unknown, Group: SG_AvMail) on System MHSWS001ANP-1 |
|
Jun 7, 2012 4:02:29 AM |
V-16-1-10305 Resource AviNetConnector (Owner: unknown, Group: SG_AvMail) is offline on MHSWS001ANP-1 (VCS initiated) |
|
Jun 7, 2012 4:02:34 AM |
V-16-2-13064 (MHSWS001ANP-1) Agent is calling clean for resource(AvMailInterchange) because the resource is up even after offline completed. |
|
Jun 7, 2012 4:02:35 AM |
V-16-2-13069 (MHSWS001ANP-1) Resource(AvMailInterchange) - clean failed. |
|
Jun 7, 2012 4:02:35 AM |
V-16-2-13069 (MHSWS001ANP-1) Resource(AvMailInterchange) - clean failed. |
|
Jun 7, 2012 4:03:34 AM |
V-16-1-10305 Resource AvMailInterchange (Owner: unknown, Group: SG_AvMail) is offline on MHSWS001ANP-1 (VCS initiated) |
|
Jun 7, 2012 4:03:34 AM |
V-16-1-10300 Initiating Offline of Resource AGN_IP (Owner: unknown, Group: SG_AvMail) on System MHSWS001ANP-1 |
|
Jun 7, 2012 4:03:34 AM |
V-16-1-10300 Initiating Offline of Resource Volume (Owner: unknown, Group: SG_AvMail) on System MHSWS001ANP-1 |
|
Jun 7, 2012 4:03:35 AM |
V-16-1-10305 Resource AGN_IP (Owner: unknown, Group: SG_AvMail) is offline on MHSWS001ANP-1 (VCS initiated) |
|
Jun 7, 2012 4:04:23 AM |
V-16-1-10305 Resource Volume (Owner: unknown, Group: SG_AvMail) is offline on MHSWS001ANP-1 (VCS initiated) |
|
Jun 7, 2012 4:04:23 AM |
V-16-1-10300 Initiating Offline of Resource DiskGroup (Owner: unknown, Group: SG_AvMail) on System MHSWS001ANP-1 |
|
Jun 7, 2012 4:09:37 AM |
V-16-1-50135 User admin fired command: MSG_RES_PROBE DiskGroup MHSWS001ANP-1 from 192.168.249.203 |
|
Jun 7, 2012 4:09:50 AM |
V-16-1-10305 Resource DiskGroup (Owner: unknown, Group: SG_AvMail) is offline on MHSWS001ANP-1 (VCS initiated) |
|
Jun 7, 2012 4:09:50 AM |
V-16-1-10446 Group SG_AvMail is offline on system MHSWS001ANP-1 |
|
Jun 7, 2012 4:09:50 AM |
V-16-1-10301 Initiating Online of Resource AGN_IP (Owner: unknown, Group: SG_AvMail) on System MHSWS001ANP-2 |
|
Jun 7, 2012 4:09:50 AM |
V-16-1-10301 Initiating Online of Resource DiskGroup (Owner: unknown, Group: SG_AvMail) on System MHSWS001ANP-2 |
Jun 7, 2012 4:09:51 AM |
V-16-1-10306 Resource DiskGroup (Owner: unknown, Group: SG_AvMail) is offline on MHSWS001ANP-1 (Previous State = OFFLINE) |
Jun 7, 2012 4:09:51 AM |
V-16-1-10298 Resource AGN_IP (Owner: unknown, Group: SG_AvMail) is online on MHSWS001ANP-2 (VCS initiated) |
Jun 7, 2012 4:09:52 AM |
V-16-6-15004 (MHSWS001ANP-1) hatrigger:Failed to send trigger for postoffline; script doesn't exist |
Jun 7, 2012 4:11:50 AM |
V-16-1-10298 Resource DiskGroup (Owner: unknown, Group: SG_AvMail) is online on MHSWS001ANP-2 (VCS initiated) |
Jun 7, 2012 4:11:50 AM |
V-16-1-10301 Initiating Online of Resource Volume (Owner: unknown, Group: SG_AvMail) on System MHSWS001ANP-2 |
Jun 7, 2012 4:12:01 AM |
V-16-1-10298 Resource Volume (Owner: unknown, Group: SG_AvMail) is online on MHSWS001ANP-2 (VCS initiated) |
Jun 7, 2012 4:12:01 AM |
V-16-1-10301 Initiating Online of Resource AvMailInterchange (Owner: unknown, Group: SG_AvMail) on System MHSWS001ANP-2 |
Jun 7, 2012 4:12:01 AM |
V-16-1-10301 Initiating Online of Resource AviNetConnector (Owner: unknown, Group: SG_AvMail) on System MHSWS001ANP-2 |
Jun 7, 2012 4:12:03 AM |
V-16-1-10298 Resource AvMailInterchange (Owner: unknown, Group: SG_AvMail) is online on MHSWS001ANP-2 (VCS initiated) |
Jun 7, 2012 4:12:05 AM |
V-16-1-10298 Resource AviNetConnector (Owner: unknown, Group: SG_AvMail) is online on MHSWS001ANP-2 (VCS initiated) |
Jun 7, 2012 4:12:05 AM |
V-16-1-10447 Group SG_AvMail is online on system MHSWS001ANP-2 |
|
Hi mhab11,
You mentioned that you are running 5.1 SP2 but you did not mention if you have any CPs installed on top of that. Several of the early CPs had DG import/deport performance fixes that might be helpful in your situation. I would recommend going to one of the latest CPs like CP10 or higher and see if your issue still happens.
There have also been major increases to the 6.0 product with the FastFailover option in SFW-HA clusters. If you could upgrade to 6.0 and if your environment meets all the requirements for FastFailover (SCSI-3 is the big one) then you would see major improvements for Diskgroup imports and deports.
Thank you,
Wally