Monitoring Cluster Resources
Hi,
Working on a new project using Veritas Storage Foundation 5.0x on Linux. Environment consists of 2 machines or nodes within the cluster each with redundant software installed for failover purposes.
Plan to use the Veritas HA agent for WebSphere to monitor the WebSphere process we need.
I have some other components or software processes that will require monitoring as well. In order to monitor these processes, I imagine that I will have to develop or write scripts to monitor these software processes. Should a process fail, it will have to have the ability fail over to the other node. Have been reading the Veritas Cluster Server Agents Developers Guide as a start. I plan to use scripts to write the custom agents.
From what I gather thus far, each process which needs monitoring requires basic entry points:
online
offline
monitor
clean
Right now we are in the planning phase of the project still so we have not installed the veritas cluster software yet. For the additional components or software processes I need to monitor, they need to have the ability to failover to the other node just as the WebSphere veritas agent will do for us.
Not quite sure how the monitoring entry point works. For example, say I start process A with the command: startProcessA.sh I can then grep for process A to see that it's running.
What code is used in the monitoring entry point to check to see if process A is running?
Are there any templates or example of agents for monitoring processes for failover purposes?
Thank you.
Most applications can be handled by the Application Agent. See the following example from the Bundled Agents Guide:
ftp://exftpp.symantec.com/pub/support/products/ClusterServer_UNIX/306957.pdf
Application samba_app (
User = "root"
StartProgram = "/usr/sbin/samba start"
StopProgram = "/usr/sbin/samba stop"
PidFiles = { "/var/lock/samba/smbd.pid" }
MonitorProcesses = { "nmbd" }
)TheAgent Builder is another option:
ftp://exftpp.symantec.com/pub/support/products/ClusterServer_UNIX/317352.pdf
I hope this helps.Regards
Manuel