Agent failed in VCS

Question

Agent are in failed status. Below are the messages in engineA.log file. Please let me knwo the cause of this issue
&nbsp;
	VCS WARNING V-16-1-53025 Agent Script has faulted; ipm connection was lost; restarting the agent
	VCS ERROR V-16-1-10015 Cannot start /opt/VRTSvcs/bin/Script/ScriptAgent please check file

VCS WARNING V-16-1-53025 Agent NIC has faulted; ipm connection was lost; restarting the agent
	&nbsp;VCS ERROR V-16-1-10008 Agent NIC has faulted 6 times since&nbsp;&nbsp;
VCS ERROR V-16-1-10015 Cannot start /opt/VRTSvcs/bin/NIC/NICAgent please check file
	&nbsp;VCS WARNING V-16-10001-4028 (unix) IP:Unix-G1-IP:monitor:Empty NetMask is supplied, default netmask will be used.

VCS WARNING V-16-1-10023 Agent DiskGroup not sending alive messages since&nbsp;
VCS WARNING V-16-1-53025 Agent DiskGroup has faulted; ipm connection was lost; restarting the agent
	VCS ERROR V-16-1-10015 Cannot start /opt/VRTSvcs/bin/DiskGroup/DiskGroupAgent please check file

gaurav_s · Accepted Answer

Above log shows that all the agents are having issue which is giving a different indication ..
1. Either HAD process is hung or unresponsive OR
2 System itself is unresponsive which means that HAD process is not getting enough resources to communicate to the agents &amp; hence all agents are complaining..
I would suggest to run OS utilities like "Sar" "prstat" or "top" to find what is happening with system performance ..
&nbsp;
G

stinsong · Answer

There is an interprocess communication between the VCS agents and the had daemon. If this communication is disrupted (due to system load), the agent will fault, and had will restart the agent.

Pls also reference to http://www.symantec.com/docs/TECH155691

anand_raj · Answer

Do you have this issue with some agents or all agents? Does this issue happen on one node in the cluster or all nodes? Also, does this issue happen during certain times of the day - Like when a backup is running etc? These details would help in troubleshooting this issue.&nbsp;

mokkan · Answer

As Gaurva said, it looks like performance issue on the system and none of the agents are communicating with HAD.

Have you tried to stop and start the agent? If you want, you can freeze&nbsp; the SGs, you can manually stop and start the sevice and see how it works.

hagent -force -stop AgentName -sys Name

hagent -start AGENT-sys NodeName

&nbsp;

I had similar issue for NIC Agent and stoped and start worked fine.

&nbsp;

Forum Discussion

Agent failed in VCS

4 Replies

Related Content

VCS agent start

VCS agent

Agent exiting in vcs 6.1 on rhel6.6

VCS SAP HANA agent

Oracle agent

Recent Discussions

Configure two Mount type resources of nfs FStype attribute using the same share

order

key registration and reservation

Verifying that primary and dr clusters replication is synced

vcs can create logical nic