11-High Availability Configuration Guide

HomeSupportSwitchesH3C S7500E Switch SeriesConfigure & DeployConfiguration GuidesH3C S7500E Switch Series Configuration Guides-Release7178-6W10011-High Availability Configuration Guide
01-High Availability Overview
Title Size Download
01-High Availability Overview 48.46 KB

High availability overview

Because communication interruptions can seriously affect widely-deployed value-added services such as IPTV and video conference, basic network infrastructures must be able to provide high availability.

The following are the effective ways to improve availability:

·     Increasing fault tolerance.

·     Speeding up fault recovery.

·     Reducing impact of faults on services.

Availability requirements

Table 1 describes a typical availability model that divides availability requirements into different levels.

Table 1 Availability requirements

Level

Requirement

Solution

1

Decrease system software and hardware faults

·     HardwareSimplified circuit design, enhanced production techniques, and reliability tests.

·     SoftwareReliability design and test.

2

Protect system functions from being affected if faults occur

Device and link redundancy and switchover.

3

Enable the system to recover as fast as possible

Performing fault detection, diagnosis, isolation, and recovery technologies.

 

Consider level 1 availability requirements during the design and production processes of network devices.

Consider level 2 availability requirements during network design.

Consider level 3 availability requirements during network deployment, according to the network infrastructure and service characteristics.

Availability evaluation

Mean Time Between Failures (MTBF) and Mean Time to Repair (MTTR) are used to evaluate the availability of a network.

MTBF

MTBF is the predicted elapsed time between inherent failures of a system during operation. It is typically in the unit of hours. A higher MTBF means a high availability.

MTTR

MTTR is the average time required to repair a failed system. MTTR in a broad sense also involves spare parts management and customer services.

MTTR = fault detection time + hardware replacement time + system initialization time + link recovery time + routing time + forwarding recovery time. A smaller value of each item means a smaller MTTR and a higher availability.

High availability technologies

Increasing MTBF or decreasing MTTR can enhance the availability of a network. The high availability technologies described in this section meet the level 2 and level 3 high availability requirements in the aspect of decreasing MTTR.

High availability technologies can be classified as fault detection technologies or protection switchover technologies.

Fault detection technologies  

Fault detection technologies enable detection and diagnosis of network faults:

·     DLDP and Ethernet OAM are data link layer fault detection technologies.

·     BFD is a generic fault detection technology that can be used at any layer.

·     Track works along with other high availability technologies to detect faults through a collaboration mechanism.

Table 2 Fault detection technologies

Technology

Introduction

Reference

DLDP

DLDP deals with unidirectional links that might occur in a network. When detecting a unidirectional link, DLDP, as configured, can shut down the related port automatically or prompt users to take actions to avoid network problems.

"Configuring DLDP"

Ethernet OAM

As a tool monitoring Layer 2 link status, Ethernet OAM is mainly used to address common link-related issues on the last mile. You can monitor the status of the point-to-point link between two directly connected devices by enabling Ethernet OAM on the two devices.

"Configuring Ethernet OAM"

BFD

BFD provides a single mechanism to quickly detect and monitor the connectivity of links or IP forwarding in networks. To improve network performance, devices must quickly detect communication failures to restore communication through backup paths as soon as possible.

"Configuring BFD"

Monitor Link

Monitor Link associates the state of downlink interfaces with the state of uplink interfaces in a monitor link group. When Monitor Link shuts down the downlink interfaces because of an uplink failure, the downstream device changes connectivity to another link.

"Configuring Monitor Link"

Track

The Track module implements collaboration between different modules. The collaboration involves three sets of modules: application, Track, and detection. These modules collaborate with one another through collaboration entries. The detection modules trigger the application modules to perform certain operations through the Track module. The detection modules probe such items as link status and network performance, and inform the application modules of the detection result through the Track module. Once notified of network status changes, the application modules use the changes to avoid communication interruption and network performance degradation.

"Configuring Track"

 

Protection switchover technologies

Protection switchover technologies aim at recovering network faults. They back up hardware, link, routing, and service information for switchover in case of network faults to ensure continuity of network services.

A single availability technology cannot solve all problems. You should use a combination of availability technologies, chosen on the basis of detailed analysis of network environments and user requirements, to enhance network availability. For example, access-layer devices should be connected to distribution-layer devices over redundant links, and core-layer devices should be fully meshed. Network availability should be considered during the planning stage.

Table 3 Protection switchover technologies

Technology

Introduction

Reference

Active and standby switchover

When a device has two MPUs installed, one MPU is the active MPU, and the other is the standby MPU. Only the active MPU forwards packets and processes services. Typically the system selects the MPU with a smaller slot number as the active MPU. The standby MPU keeps consistent configurations with those on the active MPU through the synchronization function. When the active MPU fails or is removed, the standby MPU automatically becomes the active MPU to ensure non-stop operating of the devices.

N/A

Ethernet link aggregation

Ethernet link aggregation, or link aggregation, aggregates multiple physical Ethernet links into one logical link to increase link bandwidth beyond the limits of any one single link. This logical link is an aggregate link. It allows for link redundancy because the member physical links can dynamically back up one another.

Layer 2—LAN Switching Configuration Guide

Smart Link

Smart Link provides link redundancy and fast convergence in a dual uplink network, allowing the backup link to take over quickly when the primary link fails.

"Configuring Smart Link"

MSTP

As a Layer 2 management protocol, MSTP eliminates Layer 2 loops by selectively blocking redundant links in a network, and in the meantime, allows for link redundancy.

Layer 2—LAN Switching Configuration Guide

RRPP

RRPP is a link layer protocol designed for Ethernet rings. RRPP can prevent broadcast storms caused by data loops when an Ethernet ring is healthy, and rapidly restore the communication paths between the nodes in the event that a link is disconnected on the ring.

"Configuring RRPP"

GR

GR prevents forwarding discontinuity caused by a protocol restart (including a BGP, IS-IS, OSPF, or LDP) or an active/standby switchover. This feature requires the peer devices to implement routing information backup and recovery.

Related chapters in Layer 3—IP Routing Configuration Guide and MPLS Configuration Guide

VRRP

VRRP is an error-tolerant protocol, which provides highly reliable default links on multicast and broadcast LANs such as Ethernet, avoiding network interruption due to failure of a single link.

"Configuring VRRP"

 

 

  • Cloud & AI
  • InterConnect
  • Intelligent Computing
  • Security
  • SMB Products
  • Intelligent Terminal Products
  • Product Support Services
  • Technical Service Solutions
All Services
  • Resource Center
  • Policy
  • Online Help
All Support
  • Become a Partner
  • Partner Resources
  • Partner Business Management
All Partners
  • Profile
  • News & Events
  • Online Exhibition Center
  • Contact Us
All About Us
新华三官网