05-30-2011 01:55 AM
Hi,
On one of the thread it has been mentioned that 'MonitorTimeout' value need to be increased in order to solve a specific issue (online of a service delay, more than 60 sec). I am wondering if any of you aware where is the 'monitorTimeout' value in VCS windows ?
Regards
Ramesh
Solved! Go to Solution.
06-01-2011 01:20 AM
There could be a problem with monitor routine, so I what I would try, is to start service under Windows and then probe resource (or wait 5 minutes) and see if VCS marks the resource as online.
Looking at the timing, VCS MAY be doing something like:
2011/05/28 11:10:37 VCS NOTICE V-16-1-10301 Initiating Online of Resource xxxx-xxxx-xxxx (Owner: unknown, Group: xxx) on System xxxxxx
11:12:37: 2 mins later agent times out, but waits 10 seconds before running monitor
11:12:47: Monitor routine runs and takes 4 seconds to determine resource is not online
2011/05/28 11:12:51 VCS ERROR V-16-2-13066 (xxxxx) Agent is calling clean for resource(xxxx-xxxx-xxxx) because the resource is not up even after online completed.
To see if this is the case you could try increasing delayafterOnline and OnlineTimeout to see what the effect is.
Mike
05-30-2011 03:09 AM
You can view and set MonitorTimeout and OnlineTimeout using "hatype -display/modify" from Command Prompt, however, I believe the reason you don't see them in Windows Java Gui, is these entry points are not supported in Windows - see extract from the agent dev guide which says:
I raised this discrepency with Symantec Support in March this year and this was their response:
Entrypoint timeouts are not the same as resource timeouts. All the timeouts you mention do take effect on Windows. I find it a bit hard to understand why anyone would think they didn’t.
Entrypoint timeout is a timeout internal to the instruction sent and only of interest to those developing agents for use on Windows (which is why it’s only in the dev guide). It’s just a case of what processes the timeout. The entrypoint itself will not return a code stating e.g. “I have timed out” to the agent, but the agent has a timeout internally after which point it can declare the online/offline/monitor etc. to have timed out. The effect is the same, the mechanism is different.
If you need to set a timeout, please just set it.
I didn't understand this response from Support and so responded with the following below, but never got any more replies from Support:
05-30-2011 07:29 AM
Mike, thanks a lot for detailed explanation. Here is the problem, hope you could advise an option to resolve the issue..
I have a generic service resource that is taking time to come online. I want to increase the agent that is associated with this resource type so that the service resource has sufficient time to start. Let’s say the service takes 2 minutes (120 seconds) to come up so I need to change the monitor cycle to 120 seconds
Regards
Ramesh
05-30-2011 07:54 AM
If the service is taking a long time to come online, then it is the OnlineTimeout you need to set, not the MonitorInterval or MonitorTimeout. You can try changing this, but as I said in the last post, I don't know if the attribute is used. To set use:
haconf -makerw hatype -modify GenericService OnlineTimeout 120 haconf -dump -makero
However, your problem might be that you need to increase the DelayAfterOnline resource attribute (this is by default 10 seconds). You can set this to a higher value in the GUI, or from the command line use:
haconf -makerw hares -modify resource_name DelayAfterOnline 20 haconf -dump -makero
Mike
05-30-2011 10:24 AM
Have another look at this extract that Mike posted above:
05-30-2011 08:32 PM
Onlinetimeout value is 300 which is set on service group and I guess it will override whatever settings on individual resource, if yes then onlinetimeout value may not be the solution.
We monitored service startup delay which is not more than 2 mins and its with in the limit of Onlinetimeout. Thanks for suggestion,
'DelayAfterOnline' has been set to 10sec. Do you think 'delayafterOnline' parameter has any affect since resource has been called for clean after 21sec.
2011/05/28 11:10:37 VCS NOTICE V-16-1-10301 Initiating Online of Resource xxxx-xxxx-xxxx (Owner: unknown, Group: xxx) on System xxxxxx
2011/05/28 11:12:51 VCS ERROR V-16-2-13066 (xxxxx) Agent is calling clean for resource(xxxx-xxxx-xxxx) because the resource is not up even after online completed.
2011/05/28 11:12:51 VCS ERROR V-16-2-13069 (xxxxx) Resource(xxxx-xxxx-xxxx) - clean failed.
06-01-2011 01:20 AM
There could be a problem with monitor routine, so I what I would try, is to start service under Windows and then probe resource (or wait 5 minutes) and see if VCS marks the resource as online.
Looking at the timing, VCS MAY be doing something like:
2011/05/28 11:10:37 VCS NOTICE V-16-1-10301 Initiating Online of Resource xxxx-xxxx-xxxx (Owner: unknown, Group: xxx) on System xxxxxx
11:12:37: 2 mins later agent times out, but waits 10 seconds before running monitor
11:12:47: Monitor routine runs and takes 4 seconds to determine resource is not online
2011/05/28 11:12:51 VCS ERROR V-16-2-13066 (xxxxx) Agent is calling clean for resource(xxxx-xxxx-xxxx) because the resource is not up even after online completed.
To see if this is the case you could try increasing delayafterOnline and OnlineTimeout to see what the effect is.
Mike
06-01-2011 03:11 PM
Thanks a lot, will try out those options..
Regards
Ramesh