SRM

About disaster recovery

Disaster recovery recovers data and resumes service operations for information systems from faults or breakdowns caused by natural disasters and infrastructure failures such as fires, floods, earthquakes, power outages, and network egress failures.

Typically, a disaster recovery system contains two or more identical service systems deployed at geographically dispersed sites. The disaster recovery system monitors the service systems and maintains data consistency among them. When one service system fails, its services fail over to another site to ensure service continuity.

Key metrics for disaster recovery performance measurement

The following metrics are used for measuring the performance of a disaster recovery system:

Figure-1 RPO

 

Figure-2 RTO

 

Types of disaster recovery

Based on the level of protection, the following types of disaster recovery are available:

Array-based replication

Disaster recovery relies on array-based replication to back up and protect data. Array-based replication copies all or some data sets of an application from a host disk or storage array to another storage medium to prevent misoperations or failures from causing data loss. Array-based replication can be performed synchronously or asynchronously.

Synchronous replication

Synchronous replication ensures data consistency between the storage volumes in a replication pair. In synchronous replication, each IO operation releases resources only after both the local and remote volumes in a replication pair return write operation completion. Synchronous replication provides the highest level of data integrity at the cost of decreased performance caused by data transmission latency, and it requires the round trip delay between source and destination arrays to be short. Typically, synchronous replication is used for short-distance replication (10 to 100 km, or 6.21 to 62.13 miles) in scenarios that require strict data consistency and near-zero data loss, such as internal systems of banks.

Figure-3 Synchronous replication

 

Asynchronous replication

Asynchronous replication is performed periodically and thus cannot ensure data consistency between volumes in a replication pair. In asynchronous replication, a local volume creates a snapshot after it finishes a write operation and copies the snapshot to a remote volume. Asynchronous replication offers high performance but does not guarantee zero data loss because source and destination volumes might have inconsistent data. Asynchronous replication does not require high bandwidth or short transmission distance, which makes it suitable for systems that require high performance, have light write loads, and does not require high array IOPS performance or short delay, such as databases and file systems.

Figure-4 Asynchronous replication

 

About UIS site recovery management

UIS site recovery management (SRM) provides disaster recovery services for applications based on array-based replication.

Application scenarios

UIS SRM is applicable to data centers in homogeneous clouds (with same H3C UIS version deployed). UIS SRM can decrease RPO and RTO to minutes and replicate data for storage arrays regardless of whether they support storage replication adapters (SRAs).

Mechanisms

UIS SRM ensures data consistency and service continuity as follows:

When the protected site fails, SRM restores protected VMs at the recovery site by using the data backed up at the storage and service levels based on a recovery plan.

For array-based replication to operate correctly, make sure the following conditions are met:

·          The protected and recovery sites use the same replication technology.

·          Storage replication and snapshot features have been licensed on storage arrays.

 

Figure-5 Storage replication and disaster recovery procedure

 

Features

Cost-effective disaster recovery plan

Automated disaster recovery

One-stop disaster recovery solution

Support for various disaster recovery scenarios