Troubleshooting when an application hangs on an node within VCS
Hi, If an application hang on an node within vcs,i like to verify how I troubleshoot this issue. Should I use the stop script in main.cf to cleanly stop this application?Or kill -9 the application processes?Then start the group hagrp -online? thanks so much.
<<title edited by admin to add further descriptiveness>>
If VCS detects the application is hung, then VCS will call a clean which will forcabily stop the application and then take further action depending on how you have configured VCS - so for instance if you have set RestartLimit on the resource type then VCS will restart the application, else in other configurations will failover the group to another system.
If VCS does NOT detect the application is hung, then if you have set RestartLimit on the resource type then you could kill -9 the application processes and then VCS will restart, but if RestartLimit is not set and you don't application to failover to another system, then you could offline using VCS (hares -offline or use GUI) an this will try to gracefully stop the application and of this doesn't work, VCS will call a clean. Alternatively, you freeze service group (hagrp -freeze or use GUI) and kill -9 the application processes (freezing group means VCS will not take action when it sees process dies) and then restart application manually or using VCS.
Mike