Configure HA for a cluster

Cluster high availability (HA) depends on shared storage and dynamic migration technologies to provide simple and efficient HA services for applications running on all cloud hosts in the cluster. It reduces service interruption caused by host hardware failure. Cluster HA is applicable to scenarios that require service continuity.

HA mechanism

CVM virtualizes a group of hosts into a cluster that uses a shared resource pool. After you enable HA for the cluster, CVM monitors running state of all hosts and cloud hosts in the cluster.

HA benefits

Restrictions and guidelines

Procedure

  1. On the top navigation bar, click Resources.

  1. From the left navigation pane, select Virtualization.

  1. Click the Expand icon  on the left of CloudOS.

  1. Click the name of a cluster.

  1. Click HA.

  1. Enable HA as needed.

If you enable HA for the cluster, select a default startup priority for cloud hosts in the cluster.

  1. Enable service network HA and HA access control as needed.

If you enable HA access control, specify the minimum number of nodes, select a failover host, or set the reserved CPU and memory percentages.

  1. Click OK.

Parameters

Parameter

Description

Startup Priority

Select a default startup priority for the cloud hosts in the cluster. You can set the startup priority for a cloud host when you add or edit the cloud host. After a host fails, the system migrates the cloud hosts on the host based on their startup priorities until all the cloud hosts are migrated or the cluster does not have any available resources.

Enable Service HA

Select whether to enable service network HA. After you enable this feature, a cloud host is migrated to another host if the service network of the cloud host fails or is disconnected. HA failure detection is not supported on a vSwitch that uses the management network, uses VXLAN forwarding mode, or is not bound to physical NICs.

Enable HA Access Control

Select whether to enable HA access control. If you enable this feature, configure the Min Nodes, Failover Host, or HA Resource Reservation parameter.

HA Access Control Settings

Min Nodes

Specify the minimum number of hosts for HA to take effect on the cluster. If the number of hosts that are operating correctly in the cluster is smaller than the specified minimum node number, HA cannot take effect on the cluster. To avoid migration failure caused by inaccurate resource calculation, make sure all hosts in the cluster have the same CPU quantity and memory size.

Failover Host

The failover hosts must use the same shared storage as the service hosts.

HA Resource Reservation

Set the reserved CPU and memory percentages. When the remaining resources in the cluster are less than the specified percentage of resources, you cannot start new cloud hosts, set the cloud hosts to running or suspended state, or migrate running cloud hosts to the cluster.

Action

Select the action to take on related cloud hosts when the shared storage fails. This parameter is available when the Shared Storage Fault Action parameter on the system parameter page is Do Not Restart Host and the HA state is changed from disabled to enabled.

  • Fault Migration—If all data of a cloud host is saved on the shared storage, the cloud host will be migrated to another host when the shared storage fails.

  • Freeze cloud hosts—If some or all data of a cloud host is saved on the shared storage, the cloud host will be frozen and enter suspended state when the shared storage fails. After the shared storage recovers, the cloud host will automatically enter running state.