03-System Management Configuration Guide

HomeSupportRoutersSR6602-I[IE] SeriesConfigure & DeployConfiguration GuidesH3C SR6602-I[IE] AI-Powered ICT Converged Gateways Configuration Guides(V9)-R9141-6W10003-System Management Configuration Guide
03-Hardware resource management configuration
Title Size Download
03-Hardware resource management configuration 173.61 KB

Managing hardware resources

This chapter describes routine device hardware resource management tasks, including monitoring CPU and memory usage, displaying power supply and fan operating status, and configuring basic device operating parameters.

Hardware resource management tasks at a glance

All hardware resource management tasks are optional. You can perform any of the tasks in any order.

·     Monitoring CPU usage

¡     Configuring CPU usage alarm

¡     Configuring CPU usage tracking

¡     Configuring CPU diagnostic troubleshooting

¡     Displaying CPU usage monitoring information

·     Configuring CPU cores

¡     Monitoring CPU core usage

·     Monitoring memory usage

¡     Setting memory alarm thresholds

¡     Configuring memory diagnostic troubleshooting

¡     Displaying memory usage monitoring information

·     Monitoring disk usage

·     Monitoring resource usage

¡     Configuring resource monitoring

¡     Displaying resource monitoring information

·     Monitoring device temperature

¡     Displaying device temperature information

·     Monitoring fans

¡     Setting the fan operating mode

¡     Displaying fan information

·     Setting the port status detection timer

·     Setting the operating mode for an interface card

·     Verifying and diagnosing transceiver modules

¡     Verifying transceiver modules

¡     Diagnosing transceiver modules

·     Specifying an ITU channel number for a transceiver module

·     Displaying device hardware information

·     Displaying device hardware electronic label information

·     Displaying device hardware alarm information

Monitoring CPU usage

Configuring CPU usage alarm

About this task

The device samples CPU usage at 1-minute intervals, and compares the samples with CPU usage thresholds to identify the CPU usage status and send alarms or notifications accordingly.

The device supports the following CPU usage thresholds:

·     Minor threshold—If the CPU usage increases to or above the minor threshold but is less than the severe threshold, the CPU usage enters minor alarm state. The device sends minor alarms periodically until the CPU usage increases above the severe threshold or the minor alarm is removed.

·     Severe threshold—If the CPU usage increases above the severe threshold, the CPU usage enters severe alarm state. The device sends severe alarms periodically until the severe alarm is removed.

·     Recovery threshold—If the CPU usage decreases below the recovery threshold, the CPU usage enters recovered state. The device sends a recovery notification.

CPU usage alarms and notifications can be sent to NETCONF, SNMP, and the information center to be encapsulated as NETCONF events, SNMP traps and informs, and log messages. For more information about NETCONF and SNMP, see Network Management and Monitoring Configuration Guide. For more information about information center, see "Configuring the information center."

Figure 1 CPU alarms and alarm-removed notifications

Procedure

1.     Enter system view.

system-view

2.     Set the CPU usage alarm thresholds.

monitor cpu-usage threshold severe-threshold [ minor-threshold minor-threshold recovery-threshold recovery-threshold ] [ slot slot-number [ cpu cpu-number ] ]

By default:

¡     Severe CPU usage alarm threshold—99%.

¡     Minor CPU usage alarm threshold—98%.

¡     CPU usage recovery threshold—79%.

 

CAUTION

CAUTION:

If you set the severe CPU usage alarm threshold to a too low value, the device will reach the threshold easily. Normal service processing will be affected.

3.     Set the CPU usage alarm resending intervals.

monitor resend cpu-usage { minor-interval minor-interval | severe-interval severe-interval } * [ slot slot-number [ cpu cpu-number ] ]

By default, the minor CPU usage alarm resending interval and severe CPU usage alarm resending interval are 300 seconds and 60 seconds, respectively.

Configuring CPU usage tracking

About this task

After you enable CPU usage tracking, the system samples CPU usage at intervals and saves the samples to a buffer. You can use the display cpu-usage history command to view the recent CPU usage.

Procedure

1.     Enter system view.

system-view

2.     Set the sampling interval for CPU usage tracking.

monitor cpu-usage interval interval [ slot slot-number [ cpu cpu-number ] ]

By defau

lt, the sampling interval for CPU usage tracking is 1 minute.

3.     Enable CPU usage tracking.

monitor cpu-usage enable [ slot slot-number [ cpu cpu-number ] ]

By default, CPU usage tracking is enabled.

Configuring CPU diagnostic troubleshooting

CPU diagnostic troubleshooting detects whether CPU usage is normal. If CPU usage exceeds alarm thresholds or suddenly increases, this feature helps you to locate CPU usage issues. For more information about CPU diagnostic troubleshooting, see diagnostic troubleshooting in Intelligent O&M Configuration Guide.

Displaying CPU usage monitoring information

Perform display tasks in any view.

·     Display CPU usage monitoring settings.

display cpu-usage configuration [ slot slot-number [ cpu cpu-number ] ]

·     Display the current CPU usage statistics.

display cpu-usage [data-plane ] [ summary ] [ slot slot-number [ cpu cpu-number ] ]

·     Display the historical CPU usage statistics in a coordinate system.

display cpu-usage history [ job job-id ] [ slot slot-number [ cpu cpu-number ] ]

Monitoring CPU core usage

About this task

The device samples CPU core usage at 5-second intervals and calculates the average value during each CPU core usage statistics interval. If the value during an interval is greater than a CPU core usage threshold, the device issues an alarm and logs the event.

Restrictions and guidelines

As a best practice, set the CPU core usage statistics interval to an integer multiple of 5. If you set the interval to a non-integer multiple of 5, the effective statistics interval is the integer multiple of 5 that is close to and smaller than the value you set. For example, if you set this argument to 18 seconds, the effective statistics interval is 15 seconds.

Procedure

1.     Enter system view.

system-view

2.     Set CPU core usage statistics intervals.

monitor cpu-usage statistics-interval interval slot slot-number cpu cpu-number core core-id-list

By default, the CPU core usage statistics interval is 60 seconds.

3.     Set CPU core usage alarm thresholds.

monitor cpu-usage threshold severe-threshold [ minor-threshold minor-threshold recovery-threshold recovery-threshold ] slot slot-number cpu cpu-number core core-id-list

By default:

¡     Severe alarm threshold—99%.

¡     Minor alarm threshold—98%.

¡     CPU usage recovery threshold—79%.

4.     Set CPU core alarm resending intervals.

monitor resend cpu-usage core-interval { minor-interval minor-interval | severe-interval severe-interval } * [ slot slot-number [ cpu cpu-number ] ]

By default, the minor CPU core alarm resending interval is 300 seconds and the severe CPU core alarm resending interval is 60 seconds.

Configuring CPU cores

Specifying the number of CPU cores as control cores

About this task

Control cores are used for the system services on the control plane. Typically, all services are running in one CPU core. If services need a large amount of computing, such as for a VPN with device networking, one CPU core might affect computing speed. In this case, you can use this command to increase the number of control cores. Data cores are used for packet forwarding and VM cores are used on VMs.

You can use display core allocation to obtain information about CPU core allocation.

Restrictions and guidelines

The total number of CPU cores that are allocated must be equal to or smaller than the total number of CPU cores on the device.

For the configuration to take effect, restart the slot where CPU cores reside.

Procedure

1.     Enter system view.

system-view

2.     Specify the number of CPU cores allocated as control cores.

allocate core slot slot-number control-core core-count

By defualt, the number of CPU cores allocated as control cores is 1.

Displaying information about CPU core allocation

Procedure

To display information about CPU core allocation, execute the following command in any view:

display core allocation slot slot-number

 

 

Monitoring memory usage

Setting memory alarm thresholds

About this task

To ensure correct operation and improve memory efficiency, the system monitors the amount of free memory space in real time. If the amount of free memory space reaches the minor, severe, or critical alarm threshold, the system issues an alarm to affected service modules and processes.

The system can also issue an early warning to warn you of an approaching insufficient-memory condition.

(On devices that do not support low memory.) You can use the display memory command to display memory usage information.

(On devices that support low memory.) The system monitors only the amount of free low-memory space. You can use the display memory command to display memory usage information.

(On devices with slots that support low memory.) For slots that support low memory, the system monitors only the amount of free low-memory space. You can use the display memory command to display memory usage information. If the LowMem field is displayed for a slot, the slot supports low memory.

As shown in the following table and figure, the system supports these free-memory thresholds:

·     Sufficient-memory threshold.

·     Early-warning threshold.

·     Normal state threshold.

·     Minor alarm threshold.

·     Severe alarm threshold.

·     Critical alarm threshold.

Table 1 Memory alarm notifications and memory alarm-removed notifications

Notification

Triggering condition

Remarks

Early-warning notification

The amount of free memory space decreases to or below the early-warning threshold.

After generating and sending an early-warning notification, the system does not generate and send any additional early-warning notifications until the early warning is removed.

Minor alarm notification

The amount of free memory space decreases to or below the minor alarm threshold.

After generating and sending a minor alarm notification, the system does not generate and send any additional minor alarm notifications until the minor alarm is removed.

Severe alarm notification

The amount of free memory space decreases to or below the severe alarm threshold.

After generating and sending a severe alarm notification, the system does not generate and send any additional severe alarm notifications until the severe alarm is removed.

Critical alarm notification

The amount of free memory space decreases to or below the critical alarm threshold.

After generating and sending a critical alarm notification, the system does not generate and send any additional critical alarm notifications until the critical alarm is removed.

Critical alarm-removed notification

The amount of free memory space increases above the severe alarm threshold.

N/A

Severe alarm-removed notification

The amount of free memory space increases above the minor alarm threshold.

N/A

Minor alarm-removed notification

The amount of free memory space increases above the normal state threshold.

N/A

Early-warning-removed notification

The amount of free memory space increases above the sufficient-memory threshold.

N/A

Figure 2 Memory alarm notifications and alarm-removed notifications

The sending intervals for memory alarms are as follows:

·     If the amount of the free memory space has been over the early-warning alarm threshold for 1 hour, an early-warning alarm notification is sent.

·     If the amount of the free memory space has been over the minor alarm threshold for 12 hour, a minor alarm notification is sent.

·     If the amount of the free memory space has been over the severe alarm threshold for 3 hour, a severe alarm notification is sent.

·     If the amount of the free memory space has been over the critical alarm threshold for 1 hour, a critical alarm notification is sent.

Restrictions and guidelines

If a memory alarm occurs, delete unused configuration items or disable some features to increase the free memory space. Because the memory space is insufficient, some configuration items might not be able to be deleted.

The system will restart a card if one of the following conditions is met:

·     After a critical alarm occurs, the remaining free-memory value has been smaller than the critical alarm threshold for 30 seconds.

·     The interval between two consecutive critical alarms is shorter than 30 seconds.

·     The critical alarm has occurred three times within three minutes.

·     After a critical alarm occurs, the system will periodically sample free memory space and predict if the free memory space will be exhausted within 30 seconds. If the prediction result indicates that the free memory space will be exhausted within 30 seconds, the system will restart the card.

Once the free memory space reaches the early warning, minor, severe, or critical alarm threshold, the device will display the current memory usage information when you log in to the device through console or Telnet login, or execute every command.

Procedure

1.     Enter system view.

system-view

2.     Set the memory usage threshold.

memory-threshold [ slot slot-number [ cpu cpu-number ] ] usage memory-threshold

By default, the memory usage threshold is 100%.

3.     Set the free-memory thresholds.

memory-threshold [ slot slot-number [ cpu cpu-number ] ] [ ratio ] minor minor-value severe severe-value critical critical-value normal normal-value [ early-warning early-warning-value secure secure-value ]

The default settings vary by device model. To obtain the default free-memory thresholds, execute the undo memory-threshold and display memory-threshold commands in sequence.

Configuring memory diagnostic troubleshooting

Memory diagnostic troubleshooting detects whether free memory space is normal. If free memory space exceeds alarm thresholds or suddenly decreases, this feature helps you to locate memory space issues. For more information about memory diagnostic troubleshooting, see diagnostic troubleshooting in Intelligent O&M Configuration Guide.

Displaying memory usage monitoring information

Perform display tasks in any view.

·     Display memory usage information.

display memory [ summary ] [ slot slot-number [ cpu cpu-number ] ]

·     Display memory alarm thresholds and statistics.

display memory-threshold [ slot slot-number [ cpu cpu-number ] ]

Monitoring disk usage

About this task

This feature enables the device to periodically sample the usage of a disk and compare the usage with the threshold. If the disk usage exceeds the threshold, the device sends an alarm notification.

Disk usage alarm notifications are sent to NETCONF, SNMP, and the information center to be encapsulated as NETCONF events, SNMP traps and informs, and log messages. For more information about NETCONF, SNMP, and information center, see Network Management and Monitoring Configuration Guide.

Procedure

1.     Enter system view.

system-view

2.     Set the disk usage sampling interval.

monitor disk-usage interval interval

By default, the disk usage sampling interval is 300 seconds.

3.     Set the disk usage threshold.

monitor disk-usage [ slot slot-number ] disk disk-name threshold threshold-value

The default disk usage threshold varies by device model.

Monitoring resource usage

Configuring resource monitoring

About this task

The resource monitoring feature enables the device to monitor the available amounts of types of resources, for example, the space for ARP entries. The device samples the available amounts periodically and compares the samples with resource depletion thresholds to identify the resource depletion status.

The device supports a minor resource depletion threshold and a severe resource depletion threshold for each supported resource type.

·     If the available amount is equal to or less than the minor resource depletion threshold but greater than the severe resource depletion threshold, the resource type is in minor alarm state.

·     If the available amount is equal to or less than the severe resource depletion threshold, the resource type is in severe alarm state.

·     If the available amount increases above the minor resource depletion threshold, the resource type is in recovered state.

When a resource type enters severe alarm state, the device issues a severe alarm. If the resource type stays in severe alarm state, the device resends severe alarms periodically.

When a resource type enters minor alarm state, the device issues a minor alarm. If the resource type stays in minor alarm state or changes from severe alarm state to minor alarm state, the device identifies whether resending of minor resource depletion alarms is enabled. If the feature is disabled, the device does not issue additional minor alarms. If the feature is enabled, the device resends minor alarms periodically.

Resource depletion alarms can be sent to NETCONF, SNMP, and the information center to be encapsulated as NETCONF events, SNMP traps and informs, and log messages. For more information about NETCONF and SNMP, see Network Management and Monitoring Configuration Guide. For more information about information center, see "Configuring the information center."

Figure 3 Resource depletion alarms and alarm-removed notifications

Procedure

1.     Enter system view.

system-view

2.     Set resource depletion thresholds.

resource-monitor resource resource-name slot slot-number cpu cpu-number by-percent minor-threshold minor-threshold severe-threshold severe-threshold

The default settings vary by resource type. Use the display resource-monitor command to display the resource depletion thresholds.

3.     Specify destinations for resource depletion alarms.

resource-monitor output { netconf-event | snmp-notification | syslog } *

By default, resource depletion alarms are sent to NETCONF, SNMP, and the information center.

4.     Enable resending of minor resource depletion alarms.

resource-monitor minor resend enable

By default, resending of minor resource depletion alarms is enabled.

Displaying resource monitoring information

To display resource monitoring information, execute the following command in any view:

display resource-monitor [ resource resource-name ] [ slot slot-number [ cpu cpu-number ] ]

Monitoring device temperature

Displaying device temperature information

To display device temperature information, execute the following command in any view:

display environment [ slot slot-number ]

Monitoring fans

Setting the fan operating mode

About this task

Fans can operate in one of the following modes:

·     Balance mode—Operates at a speed higher than the silence mode and lower than the low-temperature mode to provide balanced noise control and cooling performance.

·     Low-temperature mode—Operates at a higher speed to provide better cooling service.

·     Silence mode—Operates at a lower speed to reduce the noise at the cost of lower cooling service quality. This mode applies to noise-sensitive environments.

Restrictions and guidelines

This feature is supported only on the default MDC.

Procedure

1.     Enter system view.

system-view

2.     Set the fan operating mode.

fan auto-control-mode { balance | low-temperature | silence }

Displaying fan information

Perform display tasks in any view.

·     Display fan tray operating status information.

display fan [ fan-id ] [ verbose ]

·     Display electronic label information for a fan tray.

display device manuinfo fan fan-id

Setting the port status detection timer

About this task

On a network enabled with loop detection or STP, the device starts a port status detection timer when a port is shut down by a protocol. If the port has been in down state before the timer expires, the device will set the port status to the port's physical status.

Procedure

1.     Enter system view.

system-view

2.     Set the port status detection timer.

shutdown-interval time

The default is 30 seconds.

Setting the operating mode for an interface card

About this task

Some interface cards can operate in multiple modes to provide different types of interfaces. Use this feature to set the operating mode for an interface card.

Restrictions and guidelines

For the new setting to take effect, you must perform one of the following tasks to activate the setting:

·     Restart the device.

·     Hot swap the interface card if the interface card supports hot swapping.

Procedure

1.     Enter system view.

system-view

2.     Set the operating mode for an interface card.

card-mode slot slot-number subslot subslot-number mode-name

Supported operating modes depend on the card model.

Verifying and diagnosing transceiver modules

Verifying transceiver modules

About this task

You can use one of the following methods to verify the genuineness of a transceiver module:

·     Display the key parameters of a transceiver module, including its transceiver type, connector type, central wavelength of the transmit laser, transfer distance, and vendor name.

·     Display its electronic label. The electronic label is a profile of the transceiver module and contains the permanent configuration, including the serial number, manufacturing date, and vendor name. The data was written to the transceiver module or the device's storage component during debugging or testing of the transceiver module or device.

The device regularly checks transceiver modules for their vendor names. If a transceiver module does not have a vendor name or the vendor name is not H3C, the device repeatedly outputs traps and log messages. For information about logging rules, see "Configuring the information center."

Procedure

To verify transceiver modules, execute the following commands in any view:

·     Display the key parameters of transceiver modules.

display transceiver interface [ interface-type interface-number ]

·     Display the electrical label information of transceiver modules.

display transceiver manuinfo interface [ interface-type interface-number ]

Diagnosing transceiver modules

About this task

The device provides the alarm and digital diagnosis functions for transceiver modules. When a transceiver module fails or is not operating correctly, you can perform the following tasks:

·     Check the alarms that exist on the transceiver module to identify the fault source.

·     Examine the key parameters monitored by the digital diagnosis function, including the temperature, voltage, laser bias current, TX power, and RX power.

Procedure

To diagnose transceiver modules, execute the following commands in any view:

·     Display transceiver alarms.

display transceiver alarm interface [ interface-type interface-number ]

·     Display the current values of the digital diagnosis parameters on transceiver modules.

display transceiver diagnosis interface [ interface-type interface-number ]

Specifying an ITU channel number for a transceiver module

About this task

ITU numbers and identifies fiber signals by wavelength and frequency. A transceiver module sends signals of a specific wavelength and frequency based on the specified ITU channel number.

This feature is required in dense wavelength division multiplexing scenarios.

Restrictions and guidelines

This feature is supported only on the HPE X130 10G SFP+ LC LH80 Tunable Transceiver (JL250A) module.

This feature is supported only on the SFP-XG-LH80-Tunable transceiver module.

The ITU channel number is saved in a register on the transceiver module. It is not saved to the configuration file.

Procedure

1.     Enter system view.

system-view

2.     Enter Ethernet interface view.

interface interface-type interface-number

3.     Specify an ITU channel number.

itu-channel channel-number

By default, the ITU channel number is 1.

Displaying ITU channel information

To display ITU channel information for transceiver modules, execute the following command in any view:

display transceiver itu-channel interface [ interface-type interface-number [ supported-channel ] ]

 

Displaying device hardware information

To display device hardware information, execute the following command in any view:

display device [ harddisk | usb ] [ slot slot-number [ subslot subslot-number ] | verbose ]

Displaying device hardware electronic label information

To display hardware electronic label information for the device, execute the following command in any view:

display device manuinfo [ slot slot-number [ subslot subslot-number ] ]

Displaying device hardware alarm information

To display device hardware alarm information, execute the following command in any view:

display alarm [ slot slot-number ]

 

  • Cloud & AI
  • InterConnect
  • Intelligent Computing
  • Intelligent Storage
  • Security
  • SMB Products
  • Intelligent Terminal Products
  • Product Support Services
  • Technical Service Solutions
All Services
  • Resource Center
  • Policy
  • Online Help
  • Technical Blogs
All Support
  • Become A Partner
  • Partner Policy & Program
  • Global Learning
  • Partner Sales Resources
  • Partner Business Management
  • Service Business
All Partners
  • Profile
  • News & Events
  • Online Exhibition Center
  • Contact Us
All About Us
新华三官网