Posts

Showing posts from 2009

You know what? - DIAGWAIT

What dose mean the diagwait on RAC environment ? Oracle Clusterware evicts the node from the cluster when 1. Node is not pinging via the network heartbeat 2. Node is not pinging the Voting disk 3. Node is hung/busy and is unable to perform either of the earlier tasks In Most cases when the node is evicted, there is information written to the logs to analyze the cause of the node eviction. However in certain cases this may be missing, the steps documented in this note are to be used for those cases where there is not enough information or no information to diagnose the cause of the eviction. CAUSE When the node is evicted and the node is extremely busy in terms of CPU (or lack of it) it is possible that the OS did not get time to flush the logs/traces to the file system. It may be useful to set diagwait attribute to delay the node reboot to give additional time to the OS to write the traces. This setting will provide more time for diagnostic data to be collected by safely and will NOT i