Forum Discussion

Zahid_Haseeb's avatar
Zahid_Haseeb
Moderator
11 years ago

System unresponsive for a while when LLT/GAB error prints in /var/log/messages

Environment Linux RHEL = 6.2 SFHA/DR = 6.0.2 Query I installed SFHA 6.0.2 and configured it. This is only one node in the cluster this time. I installed LLT and GAB as well so In future I can...
  • Gaurav_S's avatar
    11 years ago

    Hi,

    the highlighted messages are linux messages & not from VCS. I did some google around & see below

    The term "canary" as used here comes from coal mining originally. Coal miners used canaries to detect dangerous gases (if the canary they carried with them died, they knew they had to get out of the shaft/mine ASAP). As a result the term "canary" is now often used for anything that you use to get an (early) warning about a dangerous situation.

    In this case it seems like 'rtkit' starts a "normal" thread to test if the threads that get "real time" priorities are "starving" other threads (& processes), where "starving" means that they get too little processor time. This is a safety measure to make sure that processes/threads that have access to real time priorities don't use up so much CPU time that other tasks get none anymore.

    So apparently some thread(s) that got real-time priorities from rtkit is/are misbehaving, and trying to monopolize the CPU, rtkit detects this with its "canary thread", and thus rtkit takes away the real-time priorities.

    To me, it appears that above messages are result of system going busy rather than saying because of above messages system is getting unresponsive, all above messages are symptoms.

    All the LLT messages also indicate that system is heavily loaded. So you need to troubleshoot from OS end as to what is happening.

     

    G