|
AggreGate Server Failover |
Top Previous Next |
|
AggreGate Server failover cluster includes:
The failover servers are activated when the master server fails, e.g. due to:
Failover Server Modes Failover servers may work in Normal Mode or Read-only Mode. The difference between these modes is explained below. The mode of failover server is controlled by Failover Mode global cluster setting. Normal Failover Mode In Normal Failover Mode, failover server take full control control or AggreGate cluster upon Master Server failure. It controls and monitors the devices, services operators connections, etc. All configuration changes and events are stored in the database and will be available for the Master server once it becomes operative again. Read-Only Failover Mode In Read-Only Failover Mode, failover server does not perform any change to the underlying database. Its behaviour appears to be similar to the Normal Failover server at first glance: devices are being controlled and their configuration settings may be changed by operators, actions can be executed, and all system functions are available. However, no configuration changes and events are stored in the database. This causes several limitations:
Failover Scenarios This section describes several common Failover Cluster configurations. Note that database cluster is shown as a "cloud" on the below images. In practice, databases participating the database cluster will run on the same physical servers with AggreGate Server installations. Two Nodes The most common failover cluster configuration includes two servers: Master Server and Normal Failover Server. Once the Master fails, the Failover switches to Failover Master mode, taking over Master's operations.
Three Nodes The three nodes failover cluster helps to maintain system reliability even when the Master server has failed.
If the Master server fails, three nodes cluster will work similar to two nodes cluster. This allows to protect from Failover Master's failure and gives system administrators spare time to restore three nodes operation.
Failover Mode Operation If a Master node fails, it stops performing regular database updates called "heart beats". The absence of these updates is notified by the Failover nodes. If no Master heart beats occur for longer than a Node Failure Detection Time, the failover nodes are activated and start servicing normal system activities, such as device control operations and operator actions. The service interruption interval equals to the sum of Node Failure Detection Time and failover node activation time. This gap is typically less than a minute. Disconnection of Failover Nodes If a failover node is disconnected from the cluster, e.g. for an update, cluster operation continues without any change. However, the Master server constantly monitors the heart bean of Failover nodes. If no failover nodes seem to be alive for longer than the Node Failure Detection Time, Master server will fire a warning event in the Administration context. Failover Alert Once a Master Server of AggreGate failover cluster fails, the Failover node raises a Failover Alert. This helps to quickly notify system administrators of the situation. By default, an e-mail message is sent to the administrators. It is however recommended to configure an SMS message to be sent in case of Failover alert. See Alert SMS Notifications for details. Making Failover Server a Master In some rare cases the Master server may be completely lost in a severe accident, e.g. due to a major hardware failure. In this case it's necessary to make one of the Failover nodes the Master node. To switch a Failover node to the Master mode:
Configuring Client for Failover To prepare AggreGate Client to work in the clustered environment:
Configuring Web Desktop for Failover Once the Master node of the high availability cluster fails, the system operators won't be longer able to log in to the Web Desktop since the IP address an host name of Failover node differs from the address of failed Master node. There are two resolutions for this issue:
|