![]() This Support Knowledgebase provides a valuable tool for SUSE customers and parties interested in our products and solutions to acquire information, ideas and learn from one another. It is provided as a convenience to our customers who may run into the same or similar issues. NOTE: Much of the information in this document comes from 3rd parties and is not directly verified by SUSE. Those times when OCSSD lost the ability to make system calls were tracked to periods of excessive activity by FireEye's xagt. However, there were some periods of time when OCSSD was apparently unable to make system calls, and therefore the heartbeat was interrupted. These showed that all calls by OCSSD to send heartbeat packets were resulting in packets going onto the wire, and those packets were successfully delivered from node to node. System trace (strace) of Oracle's cluster process (OCSSD) was performed, as well as packet analysis of the UDP heartbeat which OCSSD generates. 22:01:51.118 CRS-1727: Network communication between this node 'server1b' (2) and node 'server1a' (1) re-established. If this persists, removal of this node from cluster will occur in 14.840 seconds 22:01:46.116 CRS-1612: Network communication with node server1a (1) has been missing for 50% of the timeout interval. For example, the Oracle cluster software may log the following warnings: Often, a partial timeout is detected but communication recovers in time to avoid fencing the node. The communication timeouts center around the UDP heartbeat of the cluster. Occasionally, the Oracle software will fence a node because communication timeouts are detected. Two or more SLES 12 SP4 systems are running a Oracle RAC (Real Application Cluster). It is also possible that similar failures or timeouts could occur with any application which is sensitive to delays. Other evidence has shown that similar symptoms can happen with SUSE High Availability (HA) clustering. NOTE: The case described here occurred with Oracle RAC (Real Application Cluster).
0 Comments
Leave a Reply. |