Title | Size | Downloads |
---|---|---|
H3C HDM System Log Messages Reference-6W103-book.pdf | 750.24 KB |
- Table of Contents
- Related Documents
-
|
H3C HDM |
System Log Messages Reference |
|
|
Copyright © 2022 New H3C Technologies Co., Ltd. All rights reserved.
No part of this manual may be reproduced or transmitted in any form or by any means without prior written consent of New H3C Technologies Co., Ltd.
Except for the trademarks of New H3C Technologies Co., Ltd., any trademarks that may be mentioned in this document are the property of their respective owners.
The information in this document is subject to change without notice.
Contents
Dropped below the lower minor threshold
Dropped below the lower major threshold
Dropped below the lower critical threshold
Exceeded the upper minor threshold
Exceeded the upper major threshold
Exceeded the upper critical threshold
Dropped below the lower major threshold
Exceeded the upper major threshold
Exceeded the upper minor threshold
Exceeded the upper major threshold
Exceeded the upper critical threshold
Non-redundant:Sufficient Resources from Redundant
Non-redundant:Insufficient Resources
Processor Automatically Throttled
triggered an uncorrectable error
Machine Check Error---CPU core errors
Correctable Machine Check Error
Correctable Machine Check Error---CPU UPI errors
Correctable Machine Check Error---IOH UPI errors
Correctable Machine Check Error---IOH core errors
Correctable Machine Check Error---Intel VT-d errors
Correctable Machine Check Error---CPU core errors
Correctable Machine Check Error---Cbo error
Configuration Error---System is operating in KTI Link Slow Speed Mode
Power Supply Predictive Failure---PSU Self Check Failed
Power Supply Predictive Failure
Power Supply input lost (AC/DC)
Power Supply input lost or out-of-range
Power Supply input out-of-range - but present
Configuration error ---Vendor mismatch
Configuration error---Power supply rating mismatch
Exceeded the upper minor threshold
Power Supply Inactive/standby state
Power limit is exceeded over correction time limit
Power limit is exceeded over correction time limit
Correctable ECC or other correctable memory error
CPU triggered a correctable error
Uncorrectable ECC or other uncorrectable memory error
triggered an uncorrectable error
Parity---Memory Training Faulty Part Tracking Uncorrectable Error
Parity---Memory Receive Enable Training Error
Parity---Memory Write Leveling Training Error
Parity---Memory Write DqDqs Training Error
Parity---Memory Sense Amp Training Error
Parity---Warning Command Clock Training Error
Parity---An uncorrectable error occurs during the memory test phase
Parity---Memory Training Error
Parity---The number of correctable memory errors reached the error logging threshold
Parity---An error occurred on the DIMM slot
Parity---CMD eye width is too small
Parity---The command is not in the FNv table
Parity---CTL is not consistent with clock in timing, and the channel is isolated
Parity---Memory write flyby failed
Parity---Timing error occurred during signal line adjustment for memory write leveling training
Parity---Memory read DqDqs training failed
Parity---Memory receive enable training failed
Parity---Memory write leveling training failed
Parity---Memory write DqDqs training failed
Parity---An error occurs during memory test, and the rank is disabled
Parity---Failed to find the RxVref for data eye training
Parity---LRDIMM RCVEN training failed
Parity---RCVEN CYCLE training failed
Parity---Read delay training failed
Parity---Memory write leveling training failed
Parity---Coarse write leveling training failed
Parity---Write delay training failed
Parity---QxCA_CLK_NO_EYE training failed
Parity---mapped out because failed critical mask test at cold boot
Memory Device Disabled---the DIMM is disabled
Memory Device Disabled---the rank is disabled
Memory Device Disabled---Pmem Media disabled
Correctable ECC or other memory error limit reached
Configuration error---RDIMMs are installed on the server that supports only UDIMMs
Configuration error---UDIMMs are installed on the server that supports only RDIMMs
Configuration error---SODIMMs are installed on the server that supports only RDIMMs
Configuration error---The number of ranks per channel can be only 1, 2, or 4
Configuration error---The number of ranks in the channel exceeds 8
Configuration error---The CPU is not compatible with 3DS DIMMs
Configuration error---NVDIMMs with stepping lower than 0x10 are not supported
Configuration error---The CPU is not compatible with 16-GB single-rank DIMMs
Configuration error---The CPU is not compatible with the DIMMs
Configuration error---The frequency of the DIMM is not supported on the server
Configuration error---NVDIMMs are not compatible with the CPU
Configuration error---DCPMMs are not supported
Configuration error---Memory LockStep Disable Error
Configuration error---Memory Mirror Disable Error
Configuration error---Failed to enable the full mirror mode
Configuration error---Memory Rank Sparing Error
Configuration error---Failed to enable patrol scrubbing
Configuration error---The DDR-T memory module is installed in the white slot
Configuration error---2LM IMC memory Mismatch
Configuration error---ODT configuration errorThe channel is isolated
Configuration error---Failed to enable ADDDC
Configuration error---Failed to enable SDDC
Configuration error---DCPMM firmware version not supported
Configuration error---DCPMM firmware version not supported
Configuration error---NVMCTRL_MEDIA_NOTREADY
Configuration error---The DDR-T memory modules of the unexpected model are installed
Configuration error---Failed to set the VDD voltage of the DIMM
Configuration error---Too many RIR rules
Configuration error---The DIMMs for the CPU exceeded the limit
The disk triggered a media error
The disk triggered an uncorrectable error
System Firmware Error (POST Error)---CPU matching failure
System Firmware Error (POST Error)---Firmware (BIOS) ROM corruption detected
System Firmware Error (POST Error)---Load microcode failed
System Firmware Error (POST Error)---No system memory or invalid memory configuration
System Firmware Error (POST Error)---Memory Population Rule Error
System firmware error (POST error)---DIMM installation or compatibility error occurred
System firmware error (POST error)---No Memory Usable
System firmware error (POST error)---No DDR Memory Error
System firmware error (POST error)---DIMM Compatible Error(LRDIMM and RDIMM are installed)
System Firmware Error (POST Error)---No DIMMs present
System Firmware Error (POST Error)---No DDR memory in the system
System Firmware Error (POST Error)---No DIMM is available for memory-mapping operation
System Firmware Error (POST Error)---Different DIMM types detected
System Firmware Error (POST Error)---DIMM population error
System Firmware Error (POST Error)---A maximum of two quad-rank DIMMs can be populated per channel
System Firmware Error (POST Error)---DIMM voltage error
System Firmware Error (POST Error)---DDR3 and DDR4 DIMMs cannot be mixed
System Firmware Error (POST Error)---256-byte and 512-byte SPD devices cannot be mixed
System Firmware Error (POST Error)---3DS and non-3DS LRDIMMs cannot be mixed
System Firmware Error (POST Error)---DDR-T memory modules and UDIMMs cannot be mixed
System Firmware Error (POST Error)---Memory Unrecognized Initialization Error
System Firmware Hang---Unspecified
System firmware hang-----No DDR Memory Error
System firmware hang---DIMM Compatible Error(LRDIMM and RDIMM are installed)
System firmware hang---Memory Unrecognized Initialization Error
System Firmware Progress---Current Memory Ras Mode
System Firmware Error (POST Error)---No DIMMs installed for CPU
OS Watchdog NMI/Diagnostic Interrupt
OS Watchdog pre-timeout Interrupt-non-NMI
Timestamp Clock Synch---event is $1 of pair---SEL Timestamp Clock updated
Timestamp clock synch---BMC Time SYNC succeed
PCI: PCIE Hot Plug PCIe Pull Out
PCI: PCIE Hot Plug PCIe Insert
Power Button pressed---Physical button---Button pressed
Power Button pressed---Physical button---Button released
Power Button pressed---Virtual button---Power cycle command
Power Button pressed---Virtual button---Power off command
Power Button pressed---Virtual button---Power on command
Power Button pressed---Virtual button---Soft off command
Reset Button pressed---Virtual button---Reset command
FRU service request button---Physical button---Uid button pressed
Transition to Critical from less severe
Transition to Non-recoverable from less severe
Transition to Non-Recoverable from less severe
Monitor---Board found PSU output can't be enabled
Transition to Critical from less severe
Transition to Critical from less severe
Transition to Non-recoverable from less severe
System restart---due to fan error:power off
System Restart---due to fan error:power reset
System Restart---due to fan error:power cycle
Device disabled: PCIe module information not obtained
triggered an uncorrectable error
Slot/Connector Device installed/attached
Watchdog overflowAction:Timer expired
Watchdog overflowAction:Hard Reset
Watchdog overflowAction:Power Down
Watchdog overflowAction:Power Cycle
Watchdog overflowAction:Timer interrupt
Management controller off-line
Management controller off-line---BMC reset
Management controller off-line---HDM cold reboot
Management controller off-line---BMC WDT timeout event happened
Management controller off-line---BMC service restart
Management controller unavailable
Management controller unavailable---Adapter RAID-P460-B4 is in a fault condition
Sensor access degraded or unavailable--- Adapter RAID-P460-B4 has no response for 2 minutes
Sensor failure---Adapter RAID-P460-B4 has no response for 4 minutes
Battery low (predictive failure)
Management controller unavailable
System Source Monitor:Mem usage exceeds the threshold
System Source Monitor: Relieve resource alarm about Mem Usage
System Source Monitor:Cpu usage exceeds the threshold
System Source Monitor: Relieve resource alarm about Cpu Usage
Introduction
This document describes HDM log messages generated to notify the occurance and removal of system exceptions detected by sensors in the server. You can use this document to obtain message details and recommended actions for server maintenance.
Obtaining system log messages
You can obtain system log messages through the following methods:
· HDM Web interface—Access the HDM Web interface and click Remote O&M > Log > Log Download. On the Log Download tab, select to download the entire log or log entries for a period.
· Alert emails—Complete alert email settings to obtain log messages.
· Third-party platform—Complete SNMP settings to connect HDM to a third-party management platform, and obtain log messages from the platform.
· Redfish event subscription—If a remote subscription server is configured, Redfish uploads received log messages to the remote subscription server.
· IPMI commands—Use IPMItool commands to access the IPMI interface for HDM and enter commands to obtain event log messages.
System log severity level
Table 1 System log message severity levels
Severity |
Description |
Critical |
The target module might be powered off or the system might become unavailable. Actions must be taken immediately. |
Major |
The system or service modules, including computing, storage, communication, and data security, might fail to operate correctly and service interruption might occur. |
Minor |
Actions must be taken to prevent failure escalation, if necessary. |
Info |
Informational message. For example, a normal state change happened or an alarm is removed. No action is required. |
Using this document
This document explains messages in tables. Table 2 describes information provided in these tables.
Table 2 Message explanation table contents
Item |
Description |
Example |
Event code |
A hexadecimal code that uniquely represents a log message. The parity of the last character in the event code represents the alarm type: · Even—An alarm was generated. · Odd—An alarm was removed. |
0x02900002 |
Message text |
Presents the message description. The same message description might be reported by different types of sensors. |
Exceeded the upper major threshold.---Current reading:$1---Threshold reading:$2 |
Variable fields |
Briefly describes the variable fields in the order that they appear in the message text. The variable fields are numbered in the "$Number" form to help you identify their location in the message text. |
· $1: Current reading of the voltage sensor. · $2: Major overvoltage threshold of the voltage sensor. |
Severity level |
Provides the severity level of the message. |
Major |
Example |
Log example. |
Exceeded the upper major threshold.---Current reading:2.58---Threshold reading:2.56 |
Explanation |
Explains the message, including the event or error cause. |
The total input voltage exceeds the major overvoltage alarm threshold. To locate the alarm triggering component, see the sensor name on the Event Log page from the HDM Web interface. |
Recommended action |
Provides recommended actions. If the issue persists after the recommended actions have been taken, contact the technical support. |
1. Verify that the external power supply is operating correctly. 2. Access the HDM Web interface and verify that the power supply is operating correctly. 3. If the issue persists, contact Technical Support. |
Applicable products
This document is available for the following product models:
· H3C UniServer R4300 G5
· H3C UniServer R4330 G5
· H3C UniServer R4700 G5
· H3C UniServer R4900 G5
· H3C UniServer R4930 G5
· H3C UniServer R4950 G5
· H3C UniServer R5300 G5
· H3C UniServer R5500 G5
· H3C UniServer R6900 G5
· H3C UniServer B5700 G5
· H3C UniServer R2700 G3
· H3C UniServer R2900 G3
· H3C UniServer R4100 G3
· H3C UniServer R4300 G3
· H3C UniServer R4400 G3
· H3C UniServer R4700 G3
· H3C UniServer R4900 G3
· H3C UniServer R4950 G3
· H3C UniServer R5300 G3
· H3C UniServer R6700 G3
· H3C UniServer R6900 G3
· H3C UniServer R8900 G3
· H3C UniServer B5700 G3
· H3C UniServer B5800 G3
· H3C UniServer B7800 G3
· H3C UniServer E3200 G3
Event log messages
This section contains event log messages.
Temperature
Dropped below the lower minor threshold
Event code |
0x01000002 |
Message text |
Dropped below the lower minor threshold.---Current reading:$1---Threshold reading:$2 |
Variable fields |
$1: Current reading of the temperature sensor. $2: Value of the lower minor temperature alarm threshold. |
Severity level |
Minor |
Example |
Dropped below the lower minor threshold.---Current reading:2---Threshold reading:10 |
Explanation |
The current temperature detected by a temperature sensor dropped below the lower minor temperature alarm threshold. |
Recommended action |
1. Verify that the temperature of the equipment room is as required. 2. Log in to HDM, access the Fans page, and verify if the fan speed is too high. If yes, adjust the fan speed mode or fan speed level. 3. If the issue persists, contact Technical Support. |
Dropped below the lower major threshold
Event code |
0x01200002 |
Message text |
Dropped below the lower major threshold.---Current reading:$1---Threshold reading:$2 |
Variable fields |
$1: Current reading of the temperature sensor. $2: Value of the lower major temperature alarm threshold. |
Severity level |
Major |
Example |
Dropped below the lower major threshold.---Current reading:2---Threshold reading:5 |
Explanation |
The current temperature detected by a temperature sensor dropped below the lower major temperature alarm threshold. |
Recommended action |
1. Verify that the temperature of the equipment room is as required. 2. Log in to HDM, access the Fans page, and verify if the fan speed is too high. If yes, adjust the fan speed mode or fan speed level. 3. If the issue persists, contact Technical Support. |
Dropped below the lower critical threshold
Event code |
0x01400002 |
Message text |
Dropped below the lower critical threshold.---Current reading:$1---Threshold reading:$2 |
Variable fields |
$1: Current reading of the temperature sensor. $2: Value of the lower critical temperature alarm threshold. |
Severity level |
Critical |
Example |
Dropped below the lower critical threshold.---Current reading:2---Threshold reading:3 |
Explanation |
The current temperature detected by a temperature sensor dropped below the lower critical temperature alarm threshold. |
Recommended action |
1. Verify that the temperature of the equipment room is as required. 2. Log in to HDM, access the Fans page, and verify if the fan speed is too high. If yes, adjust the fan speed mode or fan speed level. 3. If the issue persists, contact Technical Support. |
Exceeded the upper minor threshold
Event code |
0x01700002 |
Message text |
Exceeded the upper minor threshold.---Current reading:$1---Threshold reading:$2 |
Variable fields |
$1: Current reading of the temperature sensor. $2: Value of the minor overtemperature alarm threshold. |
Severity level |
Minor |
Example |
Exceeded the upper minor threshold.---Current reading:100---Threshold reading:80 |
Explanation |
The current temperature detected by a temperature sensor exceeded the minor overtemperature alarm threshold. |
Recommended action |
1. Verify that the temperature of the equipment room is as required. 2. Verify that the server's air inlet and outlet are not blocked. 3. Log in to HDM, access the Fans page, and verify that all the fans are operating correctly, and the fan speed is not too low. If the fan speed is low, adjust the fan speed mode or fan speed level. 4. If the issue persists, contact Technical Support. |
Exceeded the upper major threshold
Event code |
0x01900002 |
Message text |
Exceeded the upper major threshold.---Current reading:$1---Threshold reading:$2 |
Variable fields |
$1: Current reading of the temperature sensor. $2: Value of the major overtemperature alarm threshold. |
Severity level |
Major |
Example |
Exceeded the upper major threshold.---Current reading:100---Threshold reading:85 |
Explanation |
The current temperature detected by a temperature sensor exceeded the major overtemperature alarm threshold. |
Recommended action |
1. Verify that the temperature of the equipment room is as required. 2. Verify that the server's air inlet and outlet are not blocked. 3. Log in to HDM, access the Fans page, and verify that all the fans are operating correctly, and the fan speed is not too low. If the fan speed is low, adjust the fan speed mode or fan speed level. 4. If the issue persists, contact Technical Support. |
Exceeded the upper critical threshold
Event code |
0x01b00002 |
Message text |
Exceeded the upper critical threshold.---Current reading:$1---Threshold reading:$2 |
Variable fields |
$1: Current reading of the temperature sensor. $2: Value of the critical overtemperature alarm threshold. |
Severity level |
Critical |
Example |
Exceeded the upper critical threshold.---Current reading:100---Threshold reading:90 |
Explanation |
The current temperature detected by a temperature sensor exceeded the critical overtemperature alarm threshold. |
Recommended action |
1. Verify that the temperature of the equipment room is as required. 2. Verify that the server's air inlet and outlet are not blocked. 3. Log in to HDM, access the Fans page, and verify that all the fans are operating correctly, and the fan speed is not too low. If the fan speed is low, adjust the fan speed mode or fan speed level. 4. If the issue persists, contact Technical Support. |
Voltage
State Asserted
Event code |
0x02100006 |
Message text |
State Asserted |
Variable fields |
N/A |
Severity level |
Major |
Example |
State Asserted |
Explanation |
Overvoltage was detected on the system board. To locate the alarm triggering component, see the sensor name on the Event Log page from the HDM Web interface. |
Recommended action |
1. Power off and then restart the server. 2. If the issue persists, contact Technical Support. |
Dropped below the lower major threshold
Event code |
0x02200002 |
Message text |
Dropped below the lower major threshold.---Current reading:$1---Threshold reading:$2 |
Variable fields |
$1: Current value of the total input voltage. $2: Lower major voltage alarm threshold. |
Severity level |
Major |
Example |
Dropped below the lower major threshold.---Current reading:2.58---Threshold reading:2.60 |
Explanation |
The total input voltage dropped below the lower major voltage alarm threshold. To locate the alarm triggering component, see the sensor name on the Event Log page from the HDM Web interface. |
Recommended action |
1. Verify that the external power supply is operating correctly. 2. Log in to HDM and verify that the power supply is operating correctly. 3. Power off and then restart the server. 4. If the issue persists, contact Technical Support. |
Exceeded the upper major threshold
Event code |
0x02900002 |
Message text |
Exceeded the upper major threshold.---Current reading:$1---Threshold reading:$2 |
Variable fields |
$1: Current value of the total input voltage. $2: Upper major voltage alarm threshold. |
Severity level |
Major |
Example |
Exceeded the upper major threshold.---Current reading:2.58---Threshold reading:2.56 |
Explanation |
The total input voltage exceeded the upper major voltage alarm threshold. To locate the alarm triggering component, see the sensor name on the Event Log page from the Web interface. |
Recommended action |
1. Verify that the external power supply is operating correctly. 2. Log in to HDM and verify that the power supply is operating correctly. 3. Power off and then restart the server. 4. If the issue persists, contact Technical Support. |
Current
State Asserted
Event code |
0x03100006 |
Message text |
State Asserted |
Variable fields |
N/A |
Severity level |
Critical |
Example |
State Asserted |
Explanation |
Overcurrent was detected for a component on the system board. |
Recommended action |
1. Log in to HDM, access the Logs page, and verify that no alarm is present for the power supply or system board. 2. Verify that power can be supplied to the server correctly and the voltage is within the normal range. 3. If the issue persists, contact Technical Support. |
Exceeded the upper minor threshold
Event code |
0x03700002 |
Message text |
Exceeded the upper minor threshold.---Current reading:$1---Threshold reading:$2 |
Variable fields |
$1: Real-time current value. $2: Value for the minor current alarm threshold. |
Severity level |
Minor |
Example |
Exceeded the upper minor threshold.---Current reading:20---Threshold reading:18 |
Explanation |
The current value for a component on the system board exceeded the minor current alarm threshold. |
Recommended action |
1. Verify that the threshold has a reasonable value. 2. Verify that the system is not overloaded according to the server rated power. 3. If the issue persists, contact Technical Support. |
Exceeded the upper major threshold
Event code |
0x03900002 |
Message text |
Exceeded the upper major threshold.---Current reading:$1---Threshold reading:$2 |
Variable fields |
$1: Real-time current value. $2: Value for the major current alarm threshold. |
Severity level |
Major |
Example |
Exceeded the upper major threshold.---Current reading:25---Threshold reading:22 |
Explanation |
The current value for a component on the system board exceeded the major current alarm threshold. |
Recommended action |
1. Verify that the threshold has a reasonable value. 2. Verify that the system is not overloaded according to the server rated power. 3. If the issue persists, contact Technical Support. |
Exceeded the upper critical threshold
Event code |
0x03b00002 |
Message text |
Exceeded the upper critical threshold.---Current reading:$1---Threshold reading:$2 |
Variable fields |
$1: Real-time current value. $2: Value for the critical current alarm threshold. |
Severity level |
Critical |
Example |
Exceeded the upper critical threshold.---Current reading:30---Threshold reading:25 |
Explanation |
The current value for a component on the system board exceeded the critical current alarm threshold. |
Recommended action |
1. Verify that the threshold has a reasonable value. 2. Verify that the system is not overloaded according to the server rated power. 3. If the issue persists, contact Technical Support. |
Fan
Transition to Running
Event code |
0x04000015 |
Message text |
Transition to Running. |
Variable fields |
N/A |
Severity level |
Info |
Example |
Transition to Running |
Explanation |
The fan presence state was removed, indicating the fan state changes from present to absent. |
Recommended action |
1. Verify that the fan is present. 2. Re-install the fan. 3. If the issue persists, contact Technical Support. |
Fully Redundant
Event code |
0x04000017 |
Message text |
Fully Redundant. |
Variable fields |
N/A |
Severity level |
Major |
Example |
Fully Redundant |
Explanation |
A fan redundancy error is present because a fan is absent, or a fan was removed or failed. |
Recommended action |
1. Re-install the removed fans. 2. Remove and re-install the fans, and make sure the fans are in good contact. 3. If a fan status sensor reports an error, replace the faulty fan. 4. If the issue persists, contact Technical Support. |
Non-redundant:Sufficient Resources from Redundant
Event code |
0x04300016 |
Message text |
Non-redundant:Sufficient Resources from Redundant |
Variable fields |
N/A |
Severity level |
Major |
Example |
Non-redundant:Sufficient Resources from Redundant |
Explanation |
The fans were faulty or absent and the operating fans can meet the heat dissipation requirements. |
Recommended action |
1. Re-install the removed fans. 2. Remove and re-install the fans, and make sure the fans are in good contact. 3. If a fan status sensor reports an error, replace the faulty fan. 4. If the issue persists, contact Technical Support. |
Transition to Off Line
Event code |
0x04400014 |
Message text |
Transition to Off Line. |
Variable fields |
N/A |
Severity level |
Info |
Example |
Transition to Off Line |
Explanation |
A fan became offline. |
Recommended action |
1. Re-install the removed fans. 2. Remove and re-install the fans, and make sure the fans are in good contact. 3. If a fan status sensor reports an error, replace the faulty fan. 4. If the issue persists, contact Technical Support. |
Non-redundant:Insufficient Resources
Event code |
0x04500016 |
Message text |
Non-redundant:Insufficient Resources |
Variable fields |
N/A |
Severity level |
Major |
Example |
Non-redundant:Insufficient Resources |
Explanation |
The fans were faulty or absent and the operating fans cannot meet the heat dissipation requirements. |
Recommended action |
1. Re-install the removed fans. 2. If a fan status sensor reports an error, replace the faulty fan. 3. Remove and re-install the fans, and make sure the fans are in good contact. 4. If the issue persists, contact Technical Support. |
Transition to Degraded
Event code |
0x04600014 |
Message text |
Transition to Degraded. |
Variable fields |
N/A |
Severity level |
Major |
Example |
Transition to Degraded |
Explanation |
The performance of a fan degraded. |
Recommended action |
1. Log in to HDM and view the fan speed. If the speed is low, a fan might have aged. If the speed is almost zero, a fan might be blocked or have failed. 2. Verify that the fans are not blocked. 3. If a fan status sensor reports an error, replace the faulty fan. 4. Replace the aged fans. 5. If the issue persists, contact Technical Support. |
Install Error
Event code |
0x04800014 |
Message text |
Install Error. |
Variable fields |
N/A |
Severity level |
Minor |
Example |
Install Error |
Explanation |
The fan was incorrectly installed. |
Recommended action |
1. Verify that the fans are installed as instructed. For more information about the installation principles, see the user guide for the server. 2. If the issue persists, contact Technical Support. |
Physical security
General Chassis Intrusion
Event code |
0x050000de |
Message text |
General Chassis Intrusion. |
Variable fields |
N/A |
Severity level |
Minor |
Example |
General Chassis Intrusion |
Explanation |
The chassis-open alarm module detected that the access panel was removed from the server. |
Recommended action |
1. Verify that the access panel was removed. 2. Verify that the access panel is installed correctly. 3. Verify that the chassis-open alarm module has a good contact with the chassis ear. 4. If the issue persists, contact Technical Support. |
LAN Leash Lost
Event code |
0x054000de |
Message text |
LAN Leash Lost. |
Variable fields |
N/A |
Severity level |
Info |
Example |
LAN Leash Lost |
Explanation |
The LAN cable was removed from a BMC's NCSI during BMC initialization, causing an Ethernet adapter (NIC) to lose connection. |
Recommended action |
1. Verify that an Ethernet adapter was disabled in the OS. 2. Verify that the message was reported on a power-on or power-off operation. 3. Verify that the Ethernet cable is connected correctly to the shared network port. 4. Disable the shared network port if the shared network port is not necessary. 5. If the issue persists, contact Technical Support. |
Processor
IERR
Event code |
0x070000de |
Message text |
Intel: $1 $2 err---Socket $3 AMD: GMI/xGMI err---Socket$1 Die$2 LinkID$3 |
Variable fields |
· Intel: ¡ $1: Signal type. Options include MSMI and CATERR. ¡ $2: Error type. Options include IERR and MCERR. ¡ $3: CPU number. · AMD: ¡ $1: CPU number. ¡ $2: Die number. ¡ $3: Link number. |
Severity level |
Critical |
Example |
Intel: CATERR IERR err---Socket 1 AMD: GMI/xGMI err---Socket1 Die1 LinkID1 |
Explanation |
A processor internal error, such as a Package Control Unit (PCU) uncorrectable error, occurred. |
Recommended action |
1. Upgrade the BIOS and HDM firmware to the latest version. 2. Review logs to troubleshoot the issue as instructed. 3. If the issue persists, contact Technical Support. |
State Asserted
Event code |
0x07100006 |
Message text |
State Asserted. |
Variable fields |
N/A |
Severity level |
Major |
Example |
State Asserted |
Explanation |
A processor was overheated. |
Recommended action |
1. Log in to HDM and verify that the fans are operating correctly. 2. If low-speed alarms are present, re-install or replace the faulty fans. 3. View resource summary to identify system service loads. If the system is overloaded, close uncritical services to reduce service loads. 4. Verify that the ambient temperature of the server is within the normal operation range. 5. Verify that the air inlets and outlets are not blocked. 6. Power off the server, and verify that the processor heatsink has a good contact. Smear the thermal grease onto the heatsink, install the heatsink, and power on the server. 7. If the issue persists, contact Technical Support. |
CPU Critical Temperature
Event code |
0x071000de |
Message text |
CPU Critical Temperature. |
Variable fields |
N/A |
Severity level |
Critical |
Example |
CPU Critical Temperature |
Explanation |
The temperature of a processor exceeded the critical overtemperature alarm threshold. |
Recommended action |
1. Log in to HDM and verify that the fans are operating correctly. 2. If a low-speed alarm is present, re-install or replace the faulty fan. 3. View resource summary to identify system service loads. If the system is overloaded, close uncritical services to reduce service loads. 4. Verify that the temperature in the equipment room is within the normal range. 5. Verify that the air inlets and outlets are not blocked. 6. Power off the server, and verify that the processor heatsink has a good contact. Smear the thermal grease onto the heatsink, install the heatsink, and power on the server. 7. If the issue persists, contact Technical Support. |
Thermal Trip
Event code |
0x071000de |
Message text |
Thermal Trip |
Variable fields |
N/A |
Severity level |
Critical |
Example |
Thermal Trip |
Explanation |
A processor was overheated, which might cause system power-off. If only Thermal Trip was reported, this alarm might be triggered by dramatic change of processor loads. If the ventilation policy cannot adapt itself in time to the load change, the processor temperature might increase. |
Recommended action |
1. Log in to HDM and verify that all fans are operating correctly. 2. If a low-speed alarm is present, re-install or replace the faulty fan. 3. View resource summary to identify system service loads. If the system is overloaded, close uncritical services to reduce service loads. 4. Verify that the temperature in the equipment room is within the normal range. 5. Verify the air inlets and outlets are not blocked. 6. Power off the server, and verify that the processor heatsink has a good contact. Smear the thermal grease onto the heatsink, install the heatsink, and power on the server. 7. If the issue persists, contact Technical Support.. |
FRB1/BIST failure
Event code |
0x072000de |
Message text |
FRB1/BIST failure |
Variable fields |
N/A |
Severity level |
Minor |
Example |
FRB1/BIST failure |
Explanation |
The processor core BIST failed. |
Recommended action |
1. Power off and power on the server to clear the alarm. 2. If the issue persists, replace the processors. 3. If the issue persists, contact Technical Support. |
Processor Presence detected
Event code |
0x077000df |
Message text |
Processor Presence detected. |
Variable fields |
N/A |
Severity level |
Critical |
Example |
Processor Presence detected |
Explanation |
The system detected the absence or misinstallation of the primary processor. |
Recommended action |
1. Verify that the primary processor is installed correctly. 2. Replace the faulty primary processor. 3. If the issue persists, contact Technical Support. |
Processor Automatically Throttled
Event code |
0x07a000de |
Message text |
Processor Automatically Throttled---due to fan error. |
Variable fields |
N/A |
Severity level |
Minor |
Example |
Processor Automatically Throttled---due to fan error |
Explanation |
The processor was underclocked because of an overtemperature condition. Overtemperature might occur if a fan fails. |
Recommended action |
1. Verify that the heat dissipation setting meets the requirements of the running services. 2. Verify that the temperature in the equipment room is within the normal range and the air inlet and outlet are not blocked. 3. Verify that the fans are not blocked and are operating correctly. 4. Replace the faulty fans. 5. If the issue persists, contact Technical Support. |
Machine Check Exception
Event code |
0x07b000de |
Message text |
Machine Check Exception---$1---$2---Location: Socket:$3 |
Variable fields |
$1: Error type. $2: Indicates whether the error occurred during this system boot. Options include: ¡ Current Boot Error. ¡ Last Boot Error. $3: CPU number. |
Severity level |
Critical |
Example |
Machine Check Exception---SMN---Last Boot Error---Location: Socket:1 |
Explanation |
A TWIX, WAFL, or SMU uncorrectable error occurred on the AMD server. |
Recommended action |
1. Upgrade the BIOS and HDM firmware to the latest version. 2. Review event logs to locate the failed processor or other components. 3. Reboot the device. 4. Verify that the processor and memories are operating correctly. 5. If the issue persists, contact Technical Support. |
triggered an uncorrectable error
Event code |
0x07b000de |
Message text |
CPU $1 triggered an uncorrectable error. |
Variable fields |
$1: CPU number. |
Severity level |
Critical |
Example |
CPU 1 triggered an uncorrectable error. |
Explanation |
An IERR or MCERR error triggered an uncorrectable error on the processor. |
Recommended action |
1. Upgrade the BIOS and HDM firmware to the latest version. 2. Review event logs to locate the failed memory, PCIe module, or processor. 3. Power off the server, and replace the failed component. 4. Replace the system board. 5. If the issue persists, contact Technical Support. |
Machine Check Error
Event code |
0x07b100de |
Message text |
Machine Check Error ---location: Processor:$1 ---IIO Stack number:$2 ---$3---$4 |
Variable fields |
$1: CPU number. $2: IIO Stack number (IIO port number). $3: Current boot or last boot. $4: Error type. |
Severity level |
Critical |
Example |
Machine Check Exception---Location: Processor:1 ---IIO Stack number:1 --Last Boot---ITC Error:ECC uncorrectable error in the ITC dat_dword RF |
Explanation |
Internal uncorrectable errors were detected on the processor, such as VT-d errors, ITC errors, OTC errors, DMA errors, IRP errors, and Ring errors. This error also triggers other alarms. |
Recommended action |
1. Review event logs. 2. If the issue persists, contact Technical Support. |
Machine Check Error---CPU core errors
Event code |
0x07b150de |
Message text |
Machine Check Error ---CPU core errors --- ErrorType:$1---Location: Processor:$2 core MCA bank: $(3) |
Variable fields |
$1: General error type. $2: CPU number. $3: Error type. |
Severity level |
Critical |
Example |
Machine Check Exception---CPU core errors--ErrorType:Unknow--Fatal Error--Last Boot---Location: Processor:1 core MCA bank: instruction fetch unit |
Explanation |
Internal uncorrectable errors were detected on the processor, such as CPU core errors. |
Recommended action |
1. Review event logs. 2. If the issue persists, contact Technical Support. |
triggered a correctable error
Event code |
0x07c000de |
Message text |
CPU $1 triggered a correctable error. |
Variable fields |
$1: CPU number. |
Severity level |
Minor |
Example |
CPU 1 triggered a correctable error. |
Explanation |
An IERR or MCERR error triggered a correctable error on the processor. |
Recommended action |
1. Upgrade the BIOS and HDM firmware to the latest version. 2. Review event logs to locate the failed CPU or other components. 3. Power off the server, and replace the failed component. 4. Replace the system board. 5. If the issue persists, contact Technical Support. |
Correctable Machine Check Error
Event code |
0x07c100de |
Message text |
Correctable Machine Check Error ---location: Processor:$1 ---IIO Stack number:$2 ---$3---$4 |
Variable fields |
$1: CPU number. $2: IIO Stack number. $3: Current boot or last boot. $4: Error type. |
Severity level |
Minor |
Example |
Correctable Machine Check Error---Location: Processor:1 ---IIO Stack number:1 --Last Boot---DMA Error:Descriptor Count Error |
Explanation |
Internal correctable errors were detected on the processor, such as VT-d errors, ITC errors, OTC errors, DMA errors, IRP errors, and Ring errors. |
Recommended action |
1. Review event logs and troubleshoot the present errors. 2. If the issue persists, contact Technical Support. |
Correctable Machine Check Error---CPU UPI errors
Event code |
0x07c110de |
Message text |
Correctable Machine Check Error ---CPU UPI errors ---Location: Processorr:$1 UPI port number:$2 |
Variable fields |
$1: CPU number. $2: UPI port. |
Severity level |
Minor |
Example |
Correctable Machine Check Error---CPU UPI errors---Location: Processor:2 UPI port number:0x1 |
Explanation |
Internal correctable errors were detected on the processor, such as CPU UPI errors. |
Recommended action |
1. Review event logs and troubleshoot the present errors. 2. If the issue persists, contact Technical Support. |
Correctable Machine Check Error---IOH UPI errors
Event code |
0x07c120de |
Message text |
Correctable Machine Check Error ---IOH UPI errors ---Location: Processor:$1 UPI port number:$2 ---Coherent interface (IRP) local group error code:$3 |
Variable fields |
$1: CPU number. $2: UPI port. $3: Error code. |
Severity level |
Minor |
Example |
Correctable Machine Check Error---IOH UPI errors---Location: Processor:1 UPI port number:0x1---Coherent interface (IRP) local group error code:0x6 |
Explanation |
Internal correctable errors were detected on the processor, such as IOH UPI errors. |
Recommended action |
1. Review event logs and troubleshoot the present errors. 2. If the issue persists, contact Technical Support. |
Correctable Machine Check Error---IOH core errors
Event code |
0x07c130de |
Message text |
Correctable Machine Check Error ---IOH core errors ---Location:Processor:$1 ---IIO core local group error code:$2 |
Variable fields |
$1: CPU number. $2: Error code. |
Severity level |
Minor |
Example |
Correctable Machine Check Error---IOH core errors---Location: Processor:2---IIO core local group error code:0x6 |
Explanation |
Internal correctable errors were detected on the processor, such as IOH core errors. |
Recommended action |
1. Review event logs and troubleshoot the present errors. 2. If the issue persists, contact Technical Support. |
Correctable Machine Check Error---Intel VT-d errors
Event code |
0x07c140de |
Message text |
Correctable Machine Check Error ---Intel VT-d errors ---Location: Processor:$1 ---Intel VT-d local group error code:$2 |
Variable fields |
$1: CPU number. $2: Error code. |
Severity level |
Minor |
Example |
Correctable Machine Check Error---VT-d errors---Location: Processor:2---VT-d local group error code:0x6 |
Explanation |
Internal correctable errors were detected on the processor, such as VT-d errors. |
Recommended action |
1. Review event logs and troubleshoot the present errors. 2. If the issue persists, contact Technical Support. |
Correctable Machine Check Error---CPU core errors
Event code |
0x07c150de |
Message text |
Correctable Machine Check Error ---CPU core errors ---ErrorType:$1 ---Location: Processor:$2 core MCA bank: $3 |
Variable fields |
$1: General error type. $2: CPU number. $3: Error type. |
Severity level |
Minor |
Example |
Correctable Machine Check Error---CPU core errors--ErrorType:Unknow--Current Boot---Location: Processor:2 core MCA bank: mid level cache |
Explanation |
Internal correctable errors were detected on the processor, such as CPU core errors. |
Recommended action |
1. Review event logs and troubleshoot the present errors. 2. If the issue persists, contact Technical Support. |
Correctable Machine Check Error---Cbo error
Event code |
0x07c160de |
Message text |
Correctable Machine Check Error ---Cbo error--location: CPU core ID:$1 thread ID:$2 caching agent MCA bank: Cbo$3 |
Variable fields |
$1: Core number. $2: Thread number. $3: Cbo number. |
Severity level |
Minor |
Example |
Correctable Machine Check Error---Cbo error---Location: CPU core ID:0x0 thread ID:0x0 caching agent MCA bank: Cbo0 |
Explanation |
Internal correctable errors were detected on the processor, such as Cbo error. |
Recommended action |
1. Review event logs and troubleshoot the present errors. 2. If the issue persists, contact Technical Support. |
Configuration Error---System is operating in KTI Link Slow Speed Mode
Event code |
0x075d7010 |
Message text |
Configuration Error---System is operating in KTI Link Slow Speed Mode- Location:CPU:$1 |
Variable fields |
$1: Core number. |
Severity level |
Minor |
Example |
Configuration Error---System is operating in KTI Link Slow Speed Mode- Location:CPU:1 |
Explanation |
The system is operating in Keizer Technology Interconnect (KTI) low speed mode. |
Recommended action |
1. Verify that the processors are installed correctly as instructed. For more information about the installation principles, see the user guide for the server. 2. If the issue persists, contact Technical Support. |
Power supply
Presence detected
Event code |
0x080000df |
Message text |
Presence detected. |
Variable fields |
N/A |
Severity level |
Info |
Example |
Presence detected |
Explanation |
The power supply was removed. |
Recommended action |
1. Verify that the power module is not removed. 2. Verify that the power module is installed correctly. 3. If the issue persists, contact Technical Support. |
Redundancy Lost
Event code |
0x08100016 |
Message text |
Redundancy Lost. |
Variable fields |
N/A |
Severity level |
Major |
Example |
Redundancy Lost |
Explanation |
The power supply redundancy was lost. |
Recommended action |
1. Verify that the environment is normal. 2. Verify that no power supply is removed. 3. Verify that the power supplies have good contacts with the power cords. 4. Verify that all power supplies are operating correctly. 5. If the issue persists, contact Technical Support. |
Power Supply Failure detected
Event code |
0x081000de |
Message text |
Power Supply Failure detected. |
Variable fields |
N/A |
Severity level |
Major |
Example |
Power Supply Failure detected |
Explanation |
Power supply detection failed. |
Recommended action |
1. Verify that the power supply fans are operating correctly. 2. Re-install the power supplies. 3. Verify that the input voltage of the power supply is normal. 4. Replace the faulty power supply. 5. If the issue persists, contact Technical Support. |
Power Supply Predictive Failure---PSU Self Check Failed
Event code |
0x082000de |
Message text |
Power Supply Predictive Failure---PSU Self Check Failed---Id: $1 |
Variable fields |
$1: Number of a power supply. |
Severity level |
Minor |
Example |
Power Supply Predictive Failure---PSU Self Check Failed---Id: 1 |
Explanation |
Power supply self-check failed. |
Recommended action |
1. Verify that the power supply LED is operating correctly. 2. Verify that the power supply fans are operating correctly. 3. Verify that the power supply is compatible with the server. 4. If the issue persists, contact Technical Support. |
Power Supply Predictive Failure
Event code |
0x082000de |
Message text |
Power Supply Predictive Failure. |
Variable fields |
N/A |
Severity level |
Minor |
Example |
Power Supply Predictive Failure |
Explanation |
A minor alarm occurred on the power supply. |
Recommended action |
1. Verify that the power supply LED is operating correctly. 2. Verify that the power supply fans are operating correctly. 3. Verify that the input voltage of the power supply is normal. 4. If the issue persists, contact Technical Support. |
Power Supply input lost (AC/DC)
Event code |
0x083000de |
Message text |
Power Supply input lost (AC/DC). |
Variable fields |
N/A |
Severity level |
Major |
Example |
Power Supply input lost (AC/DC) |
Explanation |
The AC or DC power input is abnormal. |
Recommended action |
1. Verify that all power cords are not damaged and are correctly connected. 2. Verify that all power supplies are correctly installed. 3. Verify that the power supply fans are operating correctly. 4. Verify that the power input is normal. 5. If the issue persists, contact Technical Support. |
Power Supply input lost or out-of-range
Event code |
0x084000de |
Message text |
Power Supply input lost or out-of-range. |
Variable fields |
N/A |
Severity level |
Major |
Example |
Power Supply input out-of-range |
Explanation |
No power supply was present or the power input was beyond the range. |
Recommended action |
1. Verify that the power supply has not been cut off manually. 2. Verify that the input voltage of the power supply is normal. 3. Verify that the power cords and power modules are installed correctly. 4. Re-install the power supplies. Make sure the power supplies have a good contact. 5. Verify that the power supply fans are operating correctly. 6. If the issue persists, contact Technical Support. |
Power Supply input out-of-range - but present
Event code |
0x085000de |
Message text |
Power Supply input out-of-range - but present. |
Variable fields |
N/A |
Severity level |
Major |
Example |
Power Supply input out-of-range - but present |
Explanation |
The input voltage was too low or too high. |
Recommended action |
1. Verify that the input voltage of the power supply is normal. 2. Verify that the power cords and power modules are installed correctly. 3. Re-install the power supplies. Make sure the power supplies have a good contact. 4. Verify that the power supply fans are operating correctly. 5. If the issue persists, contact Technical Support. |
Configuration error ---Vendor mismatch
Event code |
0x086000de |
Message text |
Configuration error ---Vendor mismatch. |
Variable fields |
N/A |
Severity level |
Minor |
Example |
Configuration error ---Vendor mismatch |
Explanation |
An incompatible power supply was installed. |
Recommended action |
1. Verify that all power supplies are compatible with the server. 2. If the issue persists, contact Technical Support. |
Configuration error---Power supply rating mismatch
Event code |
0x086000de |
Message text |
Configuration error---Power supply rating mismatch:PSU$1,POUT:$2 |
Variable fields |
$1: Power supply ID. $2: Output power of the power supply. |
Severity level |
Minor |
Example |
Configuration error---Power supply rating mismatch:PSU1,POUT:2000 |
Explanation |
The rated power of the installed power supplies was inconsistent. |
Recommended action |
1. If the rated power of the installed power supplies is consistent, remove and install the power supplies in sequence. 2. If the rated power of the installed power supplies is inconsistent, replace the power supplies to make sure they are of the same rated power. 3. If the issue persists, contact Technical Support. |
Exceeded the upper minor threshold
Event code |
0x08700002 |
Message text |
Exceeded the upper minor threshold. ---Current reading:$1---Threshold reading:$2 |
Variable fields |
$1: Current reading. $2: Total power alarm threshold. |
Severity level |
Minor |
Example |
Exceeded the upper minor threshold.---Current reading:2030---Threshold reading:493 |
Explanation |
The current total power input exceeded the upper minor power alarm threshold. |
Recommended action |
1. Log in to HDM and verify that the alarm threshold is appropriate. 2. Log in to HDM and verify that the total power is not too high. 3. Verify that the total power of the power supplies can meet service requirements. 4. If the issue persists, contact Technical Support. |
Power Supply Inactive/standby state
Event code |
0x087000df |
Message text |
Power Supply Inactive/standby state. |
Variable fields |
N/A |
Severity level |
Info |
Example |
Power Supply Inactive/standby state |
Explanation |
A power supply exited the cold standby state. If power redundancy is configured, a standby power supply automatically exits the cold standby state and supplies power to the server when the system power consumption is too high. |
Recommended action |
1. Log in to HDM and verify if the total power of the server is too high. 2. If the issue persists, contact Technical Support. |
Power limit is exceeded over correction time limit
Event code |
0x095000de |
Message text |
Power limit is exceeded over correction time limit---Current Power: $1W. |
Variable fields |
$1: Current power threshold value. |
Severity level |
Minor |
Example |
Power limit is exceeded over correction time limit---Current Power: 2000W. |
Explanation |
The time during which the power was higher than the threshold exceeded the correction time limit. |
Recommended action |
1. Adjust the power cap value or the server work load. 2. If the issue persists, contact Technical Support. |
Power limit is exceeded over correction time limit
Event code |
0x095010de |
Message text |
Power limit is exceeded over correction time limit---GPU Current Power: $1W. |
Variable fields |
$1: Configured power threshold value. |
Severity level |
Minor |
Example |
Power limit is exceeded over correction time limit---GPU Current Power: 2000W. |
Explanation |
The time during which the power is higher than the threshold exceeds the correction time limit. |
Recommended action |
1. Adjust the power cap value or the GPU work load. 2. If the issue persists, contact Technical Support. |
Memory
Correctable ECC or other correctable memory error
Event code |
0x0c0000de |
Message text |
Correctable ECC or other correctable memory error--$1-Location:CPU:$2 MEM CTRL:$3 CH:$4 DIMM:$5 $6 |
Variable fields |
$1: Indicates whether the error occurred during this system boot. Options include: · Current Boot Error. · Last Boot Error. $2: CPU number. $3: Memory controller number. $4: Channel number. $5: DIMM number. $6: DIMM mark. |
Severity level |
Minor |
Example |
Correctable ECC or other correctable memory error---Current Boot Error-Location:CPU:1 MEM CTRL:1 CH:1 DIMM:0 A1 |
Explanation |
A correctable memory error occurred but the system can still operate normally. |
Recommended action |
No action is required. |
CPU triggered a correctable error
Event code |
0x0c0000de |
Message text |
CPU $1 $2 triggered a correctable error |
Variable fields |
$1: CPU number. $2: DIMM number. |
Severity level |
Minor |
Example |
CPU 1 A0 triggered a correctable error |
Explanation |
An IERR or MCERR error was triggered. The error was identified as a memory correctable error. |
Recommended action |
No action is required. |
Uncorrectable ECC or other uncorrectable memory error
Event code |
0x0c1000de |
Message text |
Uncorrectable ECC or other uncorrectable memory error--$1-Location:CPU:$2 MEM CTRL:$3 CH:$4 DIMM:$5 $6 |
Variable fields |
$1: Indicates whether the error occurred during this system boot. Options include: · Current Boot Error. · Last Boot Error. $2: CPU number. $3: Memory controller number. $4: Channel number. $5: DIMM number. $6: DIMM mark. |
Severity level |
Major |
Example |
Uncorrectable ECC or other uncorrectable memory error---Current Boot Error-Location:CPU:1 MEM CTRL:1 CH:1 DIMM:0 A1 |
Explanation |
An uncorrectable error occurred. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
triggered an uncorrectable error
Event code |
0x0c1000de |
Message text |
CPU$1 $2 triggered an uncorrectable error |
Variable fields |
$1: CPU number. $2: DIMM number. |
Severity level |
Major |
Example |
CPU1 A0 triggered an uncorrectable error |
Explanation |
An IERR or MCERR error was triggered. The error was identified as a memory uncorrectable error. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the CPU socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity
Event code |
0x0c2000de |
Message text |
Parity---$1---Location: Location:CPU:$2 MEM CTRL:$3 CH:$4 DIMM:$5 $6 |
Variable fields |
$1: Indicates whether the error occurred during this system boot. Options include: · Current Boot Error. · Last Boot Error. $2: CPU number. $3: Memory controller number. $4: Channel number. $5: DIMM number. $6: DIMM mark. |
Severity level |
Minor |
Example |
Parity---Current Boot Error-Location:CPU:1 MEM CTRL:1 CH:1 DIMM:0 A0 |
Explanation |
A memory access error occurred because of a failed parity check on command or address lines. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Memory Training Faulty Part Tracking Uncorrectable Error
Event code |
0x0c201310 |
Message text |
Parity---Memory Training Faulty Part Tracking Uncorrectable Error-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Memory Training Faulty Part Tracking Uncorrectable Error-Location:CPU:2 CH:1 DIMM:B1 Rank:0 |
Explanation |
A Faulty Parts Tracking error occurred because of an uncorrectable error. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Memory Receive Enable Training Error
Event code |
0x0c204140 |
Message text |
Parity---Memory Receive Enable Training Error-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Memory Receive Enable Training Error-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
A Faulty Parts Tracking error occurred because memory receive enable training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Memory Write Leveling Training Error
Event code |
0x0c205150 |
Message text |
Parity---Memory Write Leveling Training Error-Location:CPU:&1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Memory Write Leveling Training Error-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
A Faulty Parts Tracking error occurred because memory write leveling training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Memory Write DqDqs Training Error
Event code |
0x0c206160 |
Message text |
Parity---Memory Write DqDqs Training Error-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Memory Write DqDqs Training Error-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
A Faulty Parts Tracking error occurred because memory write DqDqs training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Memory Sense Amp Training Error
Event code |
0x0c2072f0 |
Message text |
Parity---Memory Sense Amp Training Error-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Memory Sense Amp Training Error-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
Sense Amp Training failed because a voltage input error occurred. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Warning Command Clock Training Error
Event code |
0x0c208260 |
Message text |
Parity---Warning Command Clock Training Error-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Warning Command Clock Training Error-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
An error occurred for command clock training. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---An uncorrectable error occurs during the memory test phase
Event code |
0x0c20b1c0 |
Message text |
Parity---An uncorrectable error occurs during the memory test phase-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---An uncorrectable error occurs during the memory test phase-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
An uncorrectable error occurred during the memory test phase. The corresponding rank needs to be isolated. |
Recommended action |
1. Replace the DIMM. 2. If the issue persists, contact Technical Support. |
Parity---Memory Training Error
Event code |
0x0c20c290 |
Message text |
Parity---Memory Training Error-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Memory Training Error-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
A memory training error occurred on the DIMM during the POST phase. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---The number of correctable memory errors reached the error logging threshold
Event code |
0x0c21f010 |
Message text |
Parity---The number of correctable memory errors reached the error logging threshold-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---The number of correctable memory errors reached the error logging threshold-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
The number of correctable memory errors reached the error logging threshold. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---An error occurred on the DIMM slot
Event code |
0x0c21f020 |
Message text |
Parity---An error occurred on the DIMM slot-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---An error occurred on the DIMM slot-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
An error occurred on the DIMM slot. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---CMD eye width is too small
Event code |
0x0c226010 |
Message text |
Parity---CMD eye width is too small-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---CMD eye width is too small-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
CMD eye width was too small. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---The command is not in the FNv table
Event code |
0x0c228000 |
Message text |
Parity---The command is not in the FNv table-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---The command is not in the FNv table-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
The command was not in the FNv table. |
Recommended action |
1. Update BIOS and DCPMM controller firmware to the latest version. 2. If the issue persists, contact Technical Support. |
Parity---CTL is not consistent with clock in timing, and the channel is isolated
Event code |
0x0c229020 |
Message text |
Parity---CTL is not consistent with clock in timing, and the channel is isolated-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---CTL is not consistent with clock in timing, and the channel is isolated-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
CTL was inconsistent with Clock in timing and the channel was isolated. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Memory write flyby failed
Event code |
0x0c231000 |
Message text |
Parity---Memory write flyby failed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Memory write flyby failed-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Memory write flyby failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Timing error occurred during signal line adjustment for memory write leveling training
Event code |
0x0c231010 |
Message text |
Parity---Timing error occurred during signal line adjustment for memory write leveling training-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Timing error occurred during signal line adjustment for memory write leveling training-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Timing error occurred during signal line adjustment for memory write leveling training. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Memory read DqDqs training failed
Event code |
0x0c231130 |
Message text |
Parity---Memory read DqDqs training failed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Memory read DqDqs training failed-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Memory read Dq and Dqs training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Memory receive enable training failed
Event code |
0x0c231140 |
Message text |
Parity---Memory receive enable training failed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Memory receive enable training failed-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
Memory receive enable training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Memory write leveling training failed
Event code |
0x0c231150 |
Message text |
Parity---Memory write leveling training failed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Memory write leveling training failed-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
Memory write leveling training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Memory write DqDqs training failed
Event code |
0x0c231160 |
Message text |
Parity---Memory write DqDqs training failed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Memory write DqDqs training failed-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Memory write Dq and Dqs training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---An error occurs during memory test, and the rank is disabled
Event code |
0x0c2311c0 |
Message text |
Parity---An error occurs during memory test, and the rank is disabled-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---An error occurs during memory test, and the rank is disabled-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
An error occurred during the memory test phase. The rank is disabled. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Failed to find the RxVref for data eye training
Event code |
0x0c231250 |
Message text |
Parity---Failed to find the RxVref for data eye training-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Failed to find the RxVref for data eye training-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Failed to find the RxVref for data eye training. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---LRDIMM RCVEN training failed
Event code |
0x0c231260 |
Message text |
Parity---LRDIMM RCVEN training failed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---LRDIMM RCVEN training failed-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
LRDIMM RCVEN training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---RCVEN CYCLE training failed
Event code |
0x0c231270 |
Message text |
Parity---RCVEN CYCLE training failed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---RCVEN CYCLE training failed-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
RCVEN CYCLE training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Read delay training failed
Event code |
0x0c231280 |
Message text |
Parity---Read delay training failed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Read delay training failed-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Read delay training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Memory write leveling training failed
Event code |
0x0c231290 |
Message text |
Parity---Memory write leveling training failed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Memory write leveling training failed-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Memory write leveling training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Coarse write leveling training failed
Event code |
0x0c2312a0 |
Message text |
Parity---Coarse write leveling training failed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Coarse write leveling training failed-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Coarse write leveling training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---Write delay training failed
Event code |
0x0c2312b0 |
Message text |
Parity---Write delay training failed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Write delay training failed-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Write delay training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---QxCA_CLK_NO_EYE training failed
Event code |
0x0c2312c0 |
Message text |
Parity---QxCA_CLK_NO_EYE training failed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---QxCA_CLK_NO_EYE training failed-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
QxCA_CLK_NO_EYE training failed. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Parity---mapped out because failed critical mask test at cold boot
Event code |
0x0c28c020 |
Message text |
Parity---mapped out because failed critical mask test at cold boot-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---mapped out because failed critical mask test at cold boot-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
The critical mask test failed at cold boot and the DIMM was marked. |
Recommended action |
1. Replace the DIMM. 2. If the issue persists, contact Technical Support. |
Parity---Invalid SPD contents
Event code |
0x0c2ed090 |
Message text |
Parity---Invalid SPD contents-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Parity---Invalid SPD contents-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Invalid SPD contents. |
Recommended action |
1. Verify that the ambient temperature and humidity are as required. 2. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects. 3. Verify that the pins in the processor socket are not bent. If any pins are bent, replace the system board. 4. Replace the DIMM. 5. If the issue persists, contact Technical Support. |
Memory Device Disabled
Event code |
0x0c4000de |
Message text |
Memory Device Disabled---Location:CPU:$1 Channel:$2 Dimm:$3 $4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: DIMM mark. |
Severity level |
Major |
Example |
Memory Device Disabled---Location:Socket:1 Channel:1 Dimm:1 A1 |
Explanation |
The DIMM is disabled. |
Recommended action |
1. Verify if the DIMM is disabled from the BIOS. If yes, enable the DIMM from the BIOS. 2. Verify that the DIMM channel is not faulty. 3. If the issue persists, contact Technical Support. |
Memory Device Disabled---the DIMM is disabled
Event code |
0x0c40a040 |
Message text |
Memory Device Disabled---The DIMM is disabled-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Major |
Example |
Memory Device Disabled---The rank is disabled-Location:CPU:2 CH:1 DIMM:B1 Rank:1 |
Explanation |
The DIMM is disabled. |
Recommended action |
1. Verify if the DIMM is disabled from the BIOS. If yes, enable the DIMM from the BIOS. 2. Verify that the DIMM channel is not faulty. 3. If the issue persists, contact Technical Support. |
Memory Device Disabled---the rank is disabled
Event code |
0x0c40a030 |
Message text |
Memory Device Disabled---The rank is disabled-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Major |
Example |
Memory Device Disabled---The rank is disabled-Location:CPU:2 CH:1 DIMM:B1 Rank:1 |
Explanation |
A rank was disabled. |
Recommended action |
1. Verify that the DIMM is disabled from the BIOS. If yes, enable the DIMM from the BIOS. 2. Verify that the DIMM channel is not faulty. 3. If the issue persists, contact Technical Support. |
Memory Device Disabled---Pmem Media disabled
Event code |
0x0c484030 |
Message text |
Memory Device Disabled---Pmem Media disabled-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Major |
Example |
Memory Device Disabled---Pmem Media disabled-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
An error was detected during PMem initialization, which disabled the PMem media. The PMem can be reached and managed in an in-band manner, but the PMem was non-functional and data in the PMem cannot be accessed. |
Recommended action |
1. Replace the faulty DIMM. 2. If the issue persists, contact Technical Support. |
Correctable ECC or other memory error limit reached
Event code |
0x0c5000de |
Message text |
Correctable ECC or other memory error limit reached--$1-Location:CPU:$2 MEM CTRL:$3 CH:$4 DIMM:$5 $6 |
Variable fields |
$1: Indicates whether the error occurred during this system boot. Options include: · Current Boot Error. · Last Boot Error. $2: CPU number. $3: Memory controller number. $4: Channel number. $5: DIMM number. $6: DIMM mark. |
Severity level |
Minor |
Example |
Correctable ECC or other memory error limit reached---Current Boot Error-Location:CPU:1 MEM CTRL:1 CH:1 DIMM:0 A1 |
Explanation |
The number of correctable memory errors reached the logging threshold. A correctable memory error might occur if a DIMM is installed incorrectly or an internal memory error occurs. If the memory RAS mode is set, the system performs the specified operation. In memory repair mode, the system still generates the message if the logging threshold is exceeded. |
Recommended action |
1. Re-install the target DIMM. Verify that the gold contacts on the DIMM are not contaminated and the DIMM slot does not contain any foreign objects 2. Verify that the ambient temperature and humidity are as required. 3. Access the BIOS setup utility, and verify that the correctable error threshold setting is proper. 4. If the issue persists, contact Technical Support. |
Presence detected
Event code |
0x0c6000df |
Message text |
Presence detected. |
Variable fields |
N/A |
Severity level |
Minor |
Example |
Presence detected |
Explanation |
The system detected the absence of a DIMM. |
Recommended action |
1. Access the BIOS setup utility, and verify if the server starts up with the minimum configuration. If yes, components that are not started are isolated by the BIOS and cannot be detected by HDM. 2. Install or re-install DIMMs. Make sure the gold contacts on the DIMMs are not contaminated, and DIMM slots do not contain any foreign objects. 3. If the issue persists, contact Technical Support. |
Configuration error---RDIMMs are installed on the server that supports only UDIMMs
Event code |
0x0c701010 |
Message text |
Configuration error---RDIMMs are installed on the server that supports only UDIMMs-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---RDIMMs are installed on the server that supports only UDIMMs-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
RDIMMs are installed for a processor platform that supports only UDIMMs. |
Recommended action |
1. Access the component compatibility query tool at http://www.h3c.com/cn/Service/Document_Software/Document_Center/Server/ and verify that the DIMMs are compatible with the server. 2. If the issue persists, contact Technical Support. |
Configuration error---UDIMMs are installed on the server that supports only RDIMMs
Event code |
0x0c702010 |
Message text |
Configuration error---UDIMMs are installed on the server that supports only RDIMMs-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---UDIMMs are installed on the server that supports only RDIMMs-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
UDIMMs are installed on a server that supports only RDIMMs. |
Recommended action |
1. Access the component compatibility query tool at http://www.h3c.com/cn/Service/Document_Software/Document_Center/Server/ and verify that the DIMMs are compatible with the server. 2. If the issue persists, contact Technical Support. |
Configuration error---SODIMMs are installed on the server that supports only RDIMMs
Event code |
0x0c703010 |
Message text |
Configuration error---SODIMMs are installed on the server that supports only RDIMMs-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---SODIMMs are installed on the server that supports only RDIMMs-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
SODIMMs are installed on a server that supports only RDIMMs. |
Recommended action |
1. Access the component compatibility query tool at http://www.h3c.com/cn/Service/Document_Software/Document_Center/Server/ and verify that the DIMMs are compatible with the server. 2. If the issue persists, contact Technical Support. |
Configuration error---The number of ranks per channel can be only 1, 2, or 4
Event code |
0x0c707020 |
Message text |
Configuration error---The number of ranks per channel can be only 1, 2, or 4-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The number of ranks per channel can be only 1, 2, or 4-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
The number of ranks per channel does not meet the requirements of the processor platform. The processor platform supports only 1, 2, or 4 ranks. |
Recommended action |
1. Verify that the number of ranks is as required. If not, replace the DIMMs. 2. If the issue persists, contact Technical Support. |
Configuration error---Columns, rows, or banks of the DIMM cannot meet the JEDEC standards, and LRDIMMs are not supported
Event code |
0x0c707040 |
Message text |
Configuration error---Columns, rows, or banks of the DIMM cannot meet the JEDEC standards, and LRDIMMs are not supported-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---Columns, rows, or banks of the DIMM cannot meet the JEDEC standards, and LRDIMMs are not supported-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
Columns, rows, or banks of the DIMM cannot meet the JEDEC standards, or LRDIMMs are not supported. |
Recommended action |
1. Access the component compatibility query tool at http://www.h3c.com/cn/Service/Document_Software/Document_Center/Server/ and verify that the DIMMs are compatible with the server. 2. If the issue persists, contact Technical Support. |
Configuration error---The number of ranks in the channel exceeds 8
Event code |
0x0c707050 |
Message text |
Configuration error---The number of ranks in the channel exceeds 8-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The number of ranks in the channel exceeds 8-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
The number of ranks in the channel exceeded 8 (maximum supported number). |
Recommended action |
1. Verify that the number of ranks in the channel does not exceed upper limit. 2. If the issue persists, contact Technical Support. |
Configuration error---Support for ECC on the DIMMs is not consistent with support for ECC on the server
Event code |
0x0c707090 |
Message text |
Configuration error---Support for ECC on the DIMMs is not consistent with support for ECC on the server-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---Support for ECC on the DIMMs is not consistent with support for ECC on the server-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
Support for ECC on the DIMMs is inconsistent with support for ECC on the server. |
Recommended action |
1. Identify the DIMM type. Log in to HDM and view ECC support details. If the inconsistency is confirmed, replace the DIMMs. 2. If the issue persists, contact Technical Support. |
Configuration error---The voltage for a DDR4 DIMM must be 12V, and the voltage for a DDR5 DIMM must be 11V
Event code |
0x0c7070a0 |
Message text |
Configuration error---The voltage for a DDR4 DIMM must be 12V, and the voltage for a DDR5 DIMM must be 11V-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The voltage for a DDR4 DIMM must be 12V, and the voltage for a DDR5 DIMM must be 11V-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
The current voltage cannot meet the requirement of the present DIMMs. The voltage for a DDR4 DIMM must be 12V, and the voltage for a DDR5 DIMM must be 11V. |
Recommended action |
1. Replace with DIMMs compatible with the current voltage. 2. If the issue persists, contact Technical Support. |
Configuration error---The CPU is not compatible with 3DS DIMMs
Event code |
0x0c707100 |
Message text |
Configuration error---The CPU is not compatible with 3DS DIMMs-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The CPU is not compatible with 3DS DIMMs-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
The processor is not compatible with 3DS DIMMs. |
Recommended action |
1. Replace the DIMMs. 2. If the issue persists, contact Technical Support. |
Configuration error---NVDIMMs with stepping lower than 0x10 are not supported
Event code |
0x0c707110 |
Message text |
Configuration error---NVDIMMs with stepping lower than 0x10 are not supported-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---NVDIMMs with stepping lower than 0x10 are not supported-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
NVDIMMs with stepping lower than 0x10 are not supported. |
Recommended action |
1. Access the BIOS setup utility and verify that the DIMMs are supported by the processor. If not, replace the DIMMs. 2. If the issue persists, contact Technical Support. |
Configuration error---The CPU is not compatible with 16-GB single-rank DIMMs
Event code |
0x0c707120 |
Message text |
Configuration error---The CPU is not compatible with 16-GB single-rank DIMMs-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The CPU is not compatible with 16-GB single-rank DIMMs-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
The processor is not compatible with 16GB single-rank DIMMs. |
Recommended action |
1. Examine whether the DIMM is a 16GB single-rank DIMM. If yes, replace the DIMM. 2. If the issue persists, contact Technical Support. |
Configuration error---The CPU is not compatible with the DIMMs
Event code |
0x0c707140 |
Message text |
Configuration error---The CPU is not compatible with the DIMMs-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The CPU is not compatible with the DIMMs-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
The processor is not compatible with the DIMMs. |
Recommended action |
1. Access the component compatibility query tool at http://www.h3c.com/cn/Service/Document_Software/Document_Center/Server/ and verify that the DIMMs are compatible with the server. 2. If the issue persists, contact Technical Support. |
Configuration error---The frequency of the DIMM is not supported on the server
Event code |
0x0c707150 |
Message text |
Configuration error---The frequency of the DIMM is not supported on the server-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The frequency of the DIMM is not supported on the server-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
The frequency of the DIMM is not supported on the server. |
Recommended action |
1. Access the component compatibility query tool at http://www.h3c.com/cn/Service/Document_Software/Document_Center/Server/ and verify that the DIMMs are compatible with the server. 2. Access the BIOS setup utility and verify that Enforce POR is enabled. 3. If the issue persists, contact Technical Support. |
Configuration error---NVDIMMs are not compatible with the CPU
Event code |
0x0c7071a0 |
Message text |
Configuration error---NVDIMMs are not compatible with the CPU-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---NVDIMMs are not compatible with the CPU-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
DCPMMs are not compatible with the processor. |
Recommended action |
1. Access the component compatibility query tool at http://www.h3c.com/cn/Service/Document_Software/Document_Center/Server/ and verify that the DIMMs are compatible with the server. 2. If the issue persists, contact Technical Support. |
Configuration error---DCPMMs are not supported
Event code |
0x0c7071d0 |
Message text |
Configuration error---DCPMMs are not supported-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---DCPMMs are not supported-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
DCPMMs are not supported. |
Recommended action |
1. Access the component compatibility query tool at http://www.h3c.com/cn/Service/Document_Software/Document_Center/Server/ and verify that the DIMMs are compatible with the server. 2. If the issue persists, contact Technical Support. |
Configuration error---Memory LockStep Disable Error
Event code |
0x0c709090 |
Message text |
Configuration error---Memory LockStep Disable Error-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---Memory LockStep Disable Error-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
Failed to enable the LockStep mode. The mode was degraded to independent. |
Recommended action |
1. Verify that the installed DIMMs meet the requirements of the LockStep mode. For the DIMM installation requirements, see the user guide for the server. 2. If the issue persists, contact Technical Support. |
Configuration error---Memory Mirror Disable Error
Event code |
0x0c70a0c0 |
Message text |
Configuration error---Memory Mirror Disable Error-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---Memory Mirror Disable Error-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
A memory error that BIOS cannot identify occurred. The memory installation does not meet the requirements of the mirror mode. |
Recommended action |
1. Verify that the installed DIMMs meet the requirements of the Mirror mode. For the DIMM installation requirements, see the user guide for the server. 2. If the issue persists, contact Technical Support. |
Configuration error---Failed to enable the full mirror mode
Event code |
0x0c70c010 |
Message text |
Configuration error---Failed to enable the full mirror mode |
Variable fields |
N/A |
Severity level |
Minor |
Example |
Configuration error---Failed to enable the full mirror mode |
Explanation |
Failed to enable the Full Mirror RAS mode. The mirror configuration degraded. |
Recommended action |
1. Verify that the installed DIMMs meet the requirements of the LockStep mode. For the DIMM installation requirements, see the user guide for the server. 2. If the issue persists, contact Technical Support. |
Configuration error---The memory interleaving configuration cannot meet the requirements of the server
Event code |
0x0c70e030 |
Message text |
Configuration error---The memory interleaving configuration cannot meet the requirements of the server-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The memory interleaving configuration cannot meet the requirements of the server-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
Memory configuration is incorrect. The memory interleaving configuration cannot meet the requirements of the server. |
Recommended action |
1. Access the BIOS setup utility, and verify that the memory interleaving configuration (such as NUMA and interleave) can meet the server requirements. 2. Upgrade the BIOS firmware to the latest version. 3. If the issue persists, contact Technical Support. |
Configuration error---The memory interleaving configuration cannot meet the requirements of the server
Event code |
0x0c70e080 |
Message text |
Configuration error---The memory interleaving configuration cannot meet the requirements of the server-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The memory interleaving configuration cannot meet the requirements of the server-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
Memory configuration is incorrect. The memory interleaving configuration cannot meet the requirements of the server. |
Recommended action |
1. Access the BIOS setup utility, and verify that the memory interleaving configuration (such as NUMA and interleave) can meet the server requirements. 2. Upgrade the BIOS firmware to the latest version. 3. If the issue persists, contact Technical Support. |
Configuration error---Failed to enable the rank sparing mode The memory RAS mode has degraded to independent
Event code |
0x0c710010 |
Message text |
Configuration error---Failed to enable the rank sparing mode The memory RAS mode has degraded to independent-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---Failed to enable the rank sparing mode The memory RAS mode has degraded to independent-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
Configuration error occurred. Failed to enable the Rank Sparing mode. The memory RAS mode has degraded to independent mode. |
Recommended action |
1. Verify that the installed DIMMs meet the requirements of the Rank Sparing mode. For the DIMM installation requirements, see the user guide for the server. 2. If the issue persists, contact Technical Support. |
Configuration error---Memory Rank Sparing Error
Event code |
0x0c710100 |
Message text |
Configuration error---Memory Rank Sparing Error-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---Memory Rank Sparing Error-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
The memory rank sparing configuration does not take effect. |
Recommended action |
1. Access the BIOs setup utility and verify that Rank Sparing is enabled. 2. Verify that the installed DIMMs meet the requirements of the Rank Sparing mode. For the DIMM installation requirements, see the user guide for the server. 3. If the issue persists, contact Technical Support. |
Configuration error---Failed to enable patrol scrubbing
Event code |
0x0c711000 |
Message text |
Configuration error---Failed to enable patrol scrubbing-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---Failed to enable patrol scrubbing-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
Memory configuration is incorrect. Failed to enable Patrol Scrub. |
Recommended action |
1. Identify RAS features supported by the processor specifications as instructed in H3C G3 Servers RAS Technology White Paper. If Patrol Scrub is not supported, disable Patrol Scrub. 2. If the issue persists, contact Technical Support. |
Configuration error---The number of ranks in the black slot is greater than that in the white slot, or the DIMM is installed in the black slot with the white slot empty
Event code |
0x0c717010 |
Message text |
Configuration error---The number of ranks in the black slot is greater than that in the white slot, or the DIMM is installed in the black slot with the white slot empty-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The number of ranks in the black slot is greater than that in the white slot, or the DIMM is installed in the black slot with the white slot empty-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
The memory installation is incorrect. Make sure memory installation follows these restrictions: · Populate the DIMM with more ranks in the white slot in each channel. · Populate DIMMs first in white slots. |
Recommended action |
1. Re-install DIMMs as required in the user guide for the server. 2. If the issue persists, contact Technical Support. |
Configuration error---DIMM population error Two DDR-T memory modules cannot be installed in a channel
Event code |
0x0c717030 |
Message text |
Configuration error---DIMM population error Two DDR-T memory modules cannot be installed in a channel-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---DIMM population error Two DDR-T memory modules cannot be installed in a channel-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
Two DCPMM memory modules cannot be installed in the same channel. |
Recommended action |
1. Re-install DIMMs as required in the user guide for the server. 2. If the issue persists, contact Technical Support. |
Configuration error---The DDR-T memory module is installed in the white slot
Event code |
0x0c717050 |
Message text |
Configuration error---The DDR-T memory module is installed in the white slot-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The DDR-T memory module is installed in the white slot-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
Explanation |
The DCPMM memory module is installed in the white slot. |
Recommended action |
1. Re-install DIMMs as required in the user guide for the server. 2. If the issue persists, contact Technical Support. |
Configuration error---2LM IMC memory Mismatch
Event code |
0x0c7170c0 |
|
Message text |
Configuration error---2LM IMC memory Mismatch-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
|
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
|
Severity level |
Minor |
|
Example |
Configuration error---2LM IMC memory Mismatch-Location:CPU:1 CH:1 DIMM:A1 Rank:0 |
|
Explanation |
The memory installation did not meet the requirement for single Integrated Memory Controller (IMC) installation in 2LM mode. |
|
Recommended action |
1. Verify that DIMMs are installed as required in 2LM mode. Make sure each IMC contains a minimum of one DDR and one DCPMM whose available capacity is larger than 0. 2. If the issue persists, contact Technical Support. |
Configuration error---ODT configuration errorThe channel is isolated
Event code |
0x0c729030 |
Message text |
Configuration error---ODT configuration error The channel is isolated-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---ODT configuration errorThe channel is isolated-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Memory ODT is configured incorrectly, and the channel is isolated. |
Recommended action |
1. Re-install the DIMM. Make sure the gold contacts on the DIMM and the DIMM slot are clean. 2. Replace the DIMM. 3. If the issue persists, contact Technical Support. |
Configuration error---Failed to enable ADDDC
Event code |
0x0c73a010 |
Message text |
Configuration error---Failed to enable ADDDC |
Variable fields |
N/A |
Severity level |
Minor |
Example |
Configuration error---Failed to enable ADDDC |
Explanation |
Memory configuration is incorrect. Failed to enable ADDDC. |
Recommended action |
1. Access the BIOS setup utility and verify that the memory configuration meets the ADDDC requirements. 2. If the issue persists, contact Technical Support. |
Configuration error---Failed to enable SDDC
Event code |
0x0c73b020 |
Message text |
Configuration error---Failed to enable SDDC |
Variable fields |
N/A |
Severity level |
Minor |
Example |
Configuration error---Failed to enable SDDC |
Explanation |
Memory configuration is incorrect. Failed to enable SDDC. |
Recommended action |
1. Access the BIOS setup utility and verify that the memory configuration meets the SDDC requirements. 2. If the issue persists, contact Technical Support. |
Configuration error---DCPMM firmware version not supported
Event code |
0x0c73c000 |
Message text |
Configuration error---DCPMM firmware version not supported-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---DCPMM firmware version not supported-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Memory configuration is incorrect. The DCPMM firmware version is not supported. |
Recommended action |
1. Update the DCPMM firmware to the latest version. 2. If the issue persists, contact Technical Support. |
Configuration error---DCPMM firmware version not supported
Event code |
0x0c73c010 |
Message text |
Configuration error---DCPMM firmware version not supported-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---DCPMM firmware version not supported-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Memory configuration is incorrect. The DCPMM firmware version is not supported. |
Recommended action |
1. Update the DCPMM firmware to the latest version. 2. If the issue persists, contact Technical Support. |
Configuration error---NVMCTRL_MEDIA_NOTREADY
Event code |
0x0c784020 |
Message text |
Configuration error---NVMCTRL_MEDIA_NOTREADY-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---NVMCTRL_MEDIA_NOTREADY-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Memory configuration is incorrect. The DCPMM firmware medium is not ready. |
Recommended action |
1. Update the DCPMM firmware to the latest version. 2. Replace the DIMM. 3. If the issue persists, contact Technical Support. |
Configuration error---The DDR-T memory modules of the unexpected model are installed
Event code |
0x0c7ed0c0 |
Message text |
Configuration error---The DDR-T memory modules of the unexpected model are installed-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The DDR-T memory modules of the unexpected model are installed-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Memory configuration is incorrect. The DCPMMs are incompatible with the server. |
Recommended action |
1. Access the component compatibility query tool at http://www.h3c.com/cn/Service/Document_Software/Document_Center/Server/ and verify that the DIMMs are compatible with the server. 2. If the issue persists, contact Technical Support. |
Configuration error---Failed to set the VDD voltage of the DIMM
Event code |
0x0c7f0010 |
Message text |
Configuration error---Failed to set the VDD voltage of the DIMM |
Variable fields |
N/A |
Severity level |
Minor |
Example |
Configuration error---Failed to set the VDD voltage of the DIMM |
Explanation |
Memory configuration is incorrect. Failed to set the DIMM VDD voltage. |
Recommended action |
1. Replace the DIMMs. 2. Replace the system board. 3. If the issue persists, contact Technical Support. |
Configuration error---Too many RIR rules
Event code |
0x0c7f9010 |
Message text |
Configuration error---Too many RIR rules |
Variable fields |
N/A |
Severity level |
Minor |
Example |
Configuration error---Too many RIR rules |
Explanation |
Memory configuration is incorrect. Too many RIR rules. |
Recommended action |
1. Upgrade the BIOS to the latest version. 2. Verify that the DIMMs and processors are installed correctly according to the user guide for the server. 3. Access the BIOS setup utility and verify that the memory interleaving and NUMA settings are correct. 4. If the issue persists, contact Technical Support. |
Configuration error---The DIMMs for the CPU exceeded the limit
Event code |
0x0c7fa010 |
Message text |
Configuration error---The DIMMs for the CPU exceeded the limit-Location:CPU:$1 CH:$2 DIMM:$3 Rank:$4 |
Variable fields |
$1: CPU number. $2: Channel number. $3: DIMM number. $4: Rank number. |
Severity level |
Minor |
Example |
Configuration error---The DIMMs for the CPU exceeded the limit-Location:CPU:1 CH:2 DIMM:A0 Rank:0 |
Explanation |
Memory configuration is incorrect. The DIMMs for the processor exceeded the limit. |
Recommended action |
1. Verify that the memory configuration is supported by the processor specifications. 2. If the issue persists, contact Technical Support. |
Drive slot
Drive Presence
Event code |
0x0d0000df |
Message text |
Drive Presence --- $1: $2, HDD Slot: $3. |
Variable fields |
$1: Drive bay slot in HDD bay deployment or JBOD slot in a cabinet server. $2: ¡ When $1 is a drive bay slot, this parameter represents the drive bay slot number, which can be 1, 2, 5, 6, 9, 10, 13, or 14. ¡ When $1 is a JBOD slot, this parameter represents the JBOD slot number in the range of 1 to 8. $3: ¡ When $1 is a drive bay slot, this parameter represents the drive identifier in the range of 0 to 39. ¡ When $1 is a JBOD slot, this parameter represents the drive slot number in the range of 0 to 22. |
Severity level |
Info |
Example |
Drive Presence --- Bay Slot: 1, HDD Slot: 2 |
Explanation |
If the alarm is triggered, it indicates that the drive comes online. If the alarm is removed, it indicates that the drive goes offline. This message is not generated when the server starts up for the first time unless an error occurs. For a blade server HDD bay, the suffix is --- Bay Slot: $2, HDD Slot: $3. For a cabinet server, the suffix is --- JBOD: $2, HDD Slot: $3. |
Recommended action |
No action is required. |
Drive Fault
Event code |
0x0d1000de |
Message text |
Drive Fault --- $1: $2, HDD Slot: $3 |
Variable fields |
$1: Drive bay slot in HDD bay deployment or JBOD slot in a cabinet server. $2: ¡ When $1 is a drive bay slot, this parameter represents the drive bay slot number, which can be 1, 2, 5, 6, 9, 10, 13, or 14. ¡ When $1 is a JBOD slot, this parameter represents the JBOD slot number in the range of 1 to 8. $3: ¡ When $1 is a drive bay slot, this parameter represents the drive identifier in the range of 0 to 39. ¡ When $1 is a JBOD slot, this parameter represents the drive slot number in the range of 0 to 22. |
Severity level |
Major |
Example |
Drive Fault --- Bay Slot: 1, HDD Slot: 2 |
Explanation |
The drive cannot be identified or failed. For a blade server HDD bay, the suffix is --- Bay Slot: $2, HDD Slot: $3. For a cabinet server, the suffix is --- JBOD: $2, HDD Slot: $3. |
Recommended action |
1. Log in to HDM, view drive information, and verify that all drives in the logical drive are identified correctly. If a drive cannot be identified, re-install the drive. If the drive cannot be identified after re-installation, replace the drive. 2. View drive information and verify that the status of the drive is Unconfigured Good. 3. View drive information and verify that the drive can be identified and is normal, and the drive number on HDM is consistent with the drive number in the message. If the drive number on HDM is different from the drive number in the message, verify that the drive cables are connected correctly. 4. If multiple drives are absent, verify that the drive cables and the drive backplane are normal. 5. Verify that drive LEDs are normal, and the drive can be identified and is accessible in the OS. If a drive LED is orange, the drive is faulty. Replace the faulty components, if any. 6. Verify that the storage controller is in normal state. 7. If the issue persists, contact Technical Support. |
Predictive Failure
Event code |
0x0d2000de |
Message text |
Predictive Failure---Bay Slot: $1, HDD Slot: $2 |
Variable fields |
$1: Bay slot number, which can be 1, 2, 5, 6, 9, 10, 13, or 14. $2: Drive identifier in the range of 0 to 39. |
Severity level |
Minor |
Example |
Predictive Failure---Bay Slot: 1, HDD Slot: 2 |
Explanation |
The RAID controller reports a predictive failure, which can be a storage medium reserved block alarm, drive lifetime alarm, Prefail alarm, or bad sector alarm. If the message is generated for a drive in a bay slot of a blade server, the message is suffixed with --- Bay Slot: $1, HDD Slot: $2. The message has no suffix on other servers. |
Recommended action |
1. Log in to HDM to verify that the drive is in normal state. 2. Replace the drive. 3. If the issue persists, contact Technical Support. |
Consistency Check / Parity Check in progress. System Source Monitor: Hard Disk usage exceeds the threshold
Event code |
0x0d4000de |
Message text |
Linux: Consistency Check / Parity Check in progress. System Source Monitor: Hard Disk usage exceeds the threshold---OS:Linux/Unix,See disk details about Logical disk name, Threshold $1: ---Current usage $2 Windows: Consistency Check / Parity Check in progress. System Source Monitor: Hard Disk usage exceeds the threshold---OS:Windows, Logical disk $1:---Current usage $2 |
Variable fields |
Linux: · $1: Drive space usage threshold. · $2: Current drive space usage. Windows: · $1: Drive letter. · $2: Current drive space usage. |
Severity level |
Info |
Example |
Linux: Consistency Check / Parity Check in progress. System Source Monitor: Hard Disk usage exceeds the threshold --OS:Linux/Unix,See disk details about Logical disk name, Threshold 75%: ---Current usage 80% Windows: Consistency Check / Parity Check in progress. System Source Monitor: Hard Disk usage exceeds the threshold ---OS:Windows, Logical disk d: ---Current usage 80% |
Explanation |
Drive usage exceeded the threshold. You can configure the processor usage, memory usage, and drive usage thresholds from HDM. During operation, FIST SMS obtains system resource usage information, and sends the information to HDM through IPMI commands. HDM generates this message if a threshold is exceeded. |
Recommended action |
1. Use the HDM system resource monitoring feature to monitor the drive usage. If the usage is abnormal, contact Technical Support. 2. If the drive usage is normally high, back up data and expand the drive capacity. |
Consistency Check / Parity Check in progress. System Source Monitor: Relieve resource alarm about Hard Disk Usage
Event code |
0x0d4000df |
Message text |
Linux: Consistency Check / Parity Check in progress. System Source Monitor: System Source Monitor: Relieve resource alarm about Hard Disk Usage ---OS:Linux/Unix,See disk details about Logical disk name, Threshold $1: ---Current usage $2 Windows: Consistency Check / Parity Check in progress. System Source Monitor: System Source Monitor: Relieve resource alarm about Hard Disk Usage ---OS:Windows, Logical disk $1:---Current usage $2 |
Variable fields |
Linux: · $1: Drive space usage threshold. · $2: Current drive space usage. Windows: · $1: Drive letter. · $2: Current drive space usage. |
Severity level |
Info |
Example |
Linux: Consistency Check / Parity Check in progress. System Source Monitor: Relieve resource alarm about Hard Disk Usage ---OS:Linux/Unix,See disk details about Logical disk name, Threshold 80%: ---Current usage 75% Windows: Consistency Check / Parity Check in progress. System Source Monitor: Relieve resource alarm about Hard Disk Usage ---OS:Windows, Logical disk d: ---Current usage 80% |
Explanation |
This message is generated when the system resource usage drops below the alarm threshold. This is an alarm removal log for event 0x0d4000de. You can configure the processor usage, memory usage, and drive usage thresholds from HDM. During operation, FIST SMS obtains system resource usage information, and sends the information to HDM through IPMI commands. HDM generates this message if a threshold is exceeded. |
Recommended action |
No action is required. |
In Critical Array
Event code |
0x0d5000de |
Message text |
In Critical Array---$1:$2$3 :$4. |
Variable fields |
$1: Drive bay slot or PCIe slot. $2: When $1 is a drive bay slot, this parameter represents the drive bay slot number, which can be 1, 2, 5, 6, 9, 10, 13, or 14. When $1 is a PCIe slot, this parameter represents the slot number of the storage controller that manages the logical drive. $3: HDD slot or LDDevno. $4: When $3 represents HDD slot, this parameter represents the drive identifier in the range of 0 to 39. When $3 represents LDDevno, this parameter represents the logical drive number. |
Severity level |
Major |
Example |
In Critical Array---PCIe slot:1---LDDevno :2 |
Explanation |
A drive in a logical drive was removed or failed and the logical drive degraded. If the message is generated for a drive in a bay slot of a blade server, the message is suffixed with --- Bay Slot: $2, HDD Slot: $4. On other servers, the message is suffixed with ---PCIe slot: $2---LDDevno : $4. |
Recommended action |
1. Verify that the drive is not removed. If the drive is removed, re-install the drive and recreate the RAID array. 2. Log in to HDM, view drive information, and verify that all drives in the logical drive are identified correctly. If a drive cannot be identified, re-install the drive. If the drive cannot be identified after re-installation, replace the drive. 3. Log in to HDM, view drive information, and verify that the status of the drive is Unconfigured Good. 4. After the drive is identified correctly, recreate the RAID array. 5. If the issue persists, contact Technical Support. |
In Failed Array
Event code |
0x0d6000de |
Message text |
In Failed Array---$1:$2$3 :$4. |
Variable fields |
$1: Drive bay slot or PCIe slot. $2: When $1 is a drive bay slot, this parameter represents the drive bay slot number. When $1 is a PCIe slot, this parameter represents the slot number of the storage controller that manages the logical drive. $3: HDD slot or LDDevno. $4: When $3 represents HDD slot, this parameter represents the drive identifier. When $3 represents LDDevno, this parameter represents the logical drive number. |
Severity level |
Major |
Example |
In Failed Array---PCIe slot:1---LDDevno :2 |
Explanation |
A drive in a logical drive was removed or failed and the logical drive was totally corrupted. If the message is generated for a drive in a bay slot of a blade server, the message is suffixed with --- Bay Slot: $2, HDD Slot: $4. On other servers, the message is suffixed with ---PCIe slot: $2---LDDevno : $4. |
Recommended action |
1. Verify that the drive is not removed. If the drive is removed, re-install the drive and recreate the RAID array. 2. Log in to HDM, view drive information, and verify that all drives in the logical drive are identified correctly. If a drive cannot be identified, re-install the drive. If the drive cannot be identified after re-installation, replace the drive. 3. Log in to HDM, view drive information, and verify that the status of the drive is Unconfigured Good. 4. After the drive is identified correctly, verify that the RAID array is normal. If the RAID array is faulty, recreate the RAID array. 5. If the issue persists, contact Technical Support. |
Rebuild/Remap in progress
Event code |
0x0d7000de |
Message text |
Rebuild/Remap in progress---Bay Slot: $1, HDD Slot: $2. |
Variable fields |
$1: Drive bay number, including 1, 2, 5, 6, 9, 10, 13, and 14. $2: Drive number on a drive bay in the range of 0 to 39. |
Severity level |
Info |
Example |
Rebuild/Remap in progress---Bay Slot: 1, HDD Slot: 2 |
Explanation |
If the message is generated for a drive in a bay slot of a blade server, the message is suffixed with ---Bay Slot: $1, HDD Slot: $2. On other servers, the message has no suffix. |
Recommended action |
No action is required. |
The disk triggered a media error
Event code |
0x0da000de |
Message text |
The disk triggered a media error--$1. |
Variable fields |
$1: Drive location. |
Severity level |
Info |
Example |
The disk triggered an media error--Front 1 |
Explanation |
The number of media errors exceeded the threshold. |
Recommended action |
1. Upgrade the firmware of the drive. 2. Replace the drive. 3. If the issue persists, contact Technical Support. |
The disk triggered an uncorrectable error
Event code |
0x0db000de |
Message text |
The disk triggered an uncorrectable error--$1. |
Variable fields |
$1: Drive location. |
Severity level |
Minor |
Example |
The disk triggered an uncorrectable error--Front 1 |
Explanation |
The number of uncorrectable errors exceeded the threshold. |
Recommended action |
1. Upgrade the firmware of the drive. 2. Replace the drive. 3. If the issue persists, contact Technical Support. |
The disk is missing
Event code |
0x0dc000de |
Message text |
The disk is missing. |
Variable fields |
N/A |
Severity level |
Major |
Example |
The disk is missing |
Explanation |
The drive cannot be identified because the drive is faulty or drive cables are connected incorrectly. |
Recommended action |
1. Log in to HDM, view drive information, and verify that all drives in the logical drive are identified correctly. 2. Verify that the drive data cables, power cords, and signal cables are connected correctly. 3. Re-install the drive. 4. Replace the drive. 5. If the issue persists, contact Technical Support. |
System firmware Progress
System Firmware Error (POST Error)---CPU matching failure
Event code |
0x0f0000de |
Message text |
System Firmware Error (POST Error)---CPU matching failure. |
Variable fields |
N/A |
Severity level |
Major |
Example |
System Firmware Error (POST Error)---CPU matching failure |
Explanation |
The BIOS detected a CPU frequency, microcode, or UPI matching error at POST. |
Recommended action |
1. Verify that the processors are installed correctly as required in the user guide for the server. 2. Verify that the CPUs have the same model. 3. If the issue persists, contact Technical Support.. |
System Firmware Error (POST Error)---Firmware (BIOS) ROM corruption detected
Event code |
0x0f0000de |
Message text |
System Firmware Error (POST Error)---Firmware (BIOS) ROM corruption detected. |
Variable fields |
N/A |
Severity level |
Major |
Example |
System Firmware Error (POST Error)---Firmware (BIOS) ROM corruption detected |
Explanation |
The BIOS detected ROM corruption at POST. The BIOS firmware is damaged when this message is generated. |
Recommended action |
1. Upgrade the BIOS firmware. 2. Upgrade the BIOS with the factory defaults (if any) or default settings of the BIOS restored. 3. If the issue persists, contact Technical Support. |
System Firmware Error (POST Error)---Load microcode failed
Event code |
0x0f0000de |
Message text |
System Firmware Error (POST Error)---Load microcode failed. |
Variable fields |
N/A |
Severity level |
Minor |
Example |
System Firmware Error (POST Error)---Load microcode failed |
Explanation |
The BIOS detected errors at POST because CPU microcode failed to be loaded, but the system did not hang. |
Recommended action |
1. Power off and then power on the server. 2. Upgrade HDM and the BIOS firmware to the latest version. 3. If the issue persists, contact Technical Support. |
System Firmware Error (POST Error)---No system memory or invalid memory configuration
Event code |
0x0f0000de |
Message text |
System Firmware Error (POST Error)---No system memory or invalid memory configuration. |
Variable fields |
N/A |
Severity level |
Major |
Example |
System Firmware Error (POST Error)---No system memory or invalid memory configuration |
Explanation |
No DIMM was detected during BIOS startup. This symptom might occur if DIMMs are installed incorrectly. |
Recommended action |
1. Verify that the DIMMs are installed correctly as required in the user guide of the server. Re-install all DIMMs if needed. 2. If the issue persists, contact Technical Support. |
System firmware error (POST error)---Firmware (BIOS) ROM corruption detected:Image is unsigned or Certificate is invalid
Event code |
0x0f0000de |
Message text |
System firmware error (POST error)---Firmware (BIOS) ROM corruption detected:Image is unsigned or Certificate is invalid. |
Variable fields |
N/A |
Severity level |
Major |
Example |
System firmware error (POST error)---Firmware (BIOS) ROM corruption detected:Image is unsigned or Certificate is invalid |
Explanation |
The BIOS detected ROM corruption at POST. |
Recommended action |
1. Verify if the BIOS boot mode meets the requirements of secure boot. If not, change the boot mode to UEFI. 2. Verify that the BIOS firmware is upgraded successfully. 3. Upgrade the BIOS with the factory defaults (if any) or default settings of the BIOS restored. 4. If the issue persists, contact Technical Support. |
System firmware error (POST error)---Firmware (BIOS) ROM corruption detected:Image Certificate not found in Authorized database(db)
Event code |
0x0f0000de |
Message text |
System firmware error (POST error)---Firmware (BIOS) ROM corruption detected:Image Certificate not found in Authorized database(db). |
Variable fields |
N/A |
Severity level |
Major |
Example |
System firmware error (POST error)---Firmware (BIOS) ROM corruption detected:Image Certificate not found in Authorized database(db) |
Explanation |
The BIOS detected ROM corruption at POST. |
Recommended action |
1. Verify if the BIOS boot mode meets the requirements of secure boot. If not, change the boot mode to UEFI. 2. Verify that the BIOS firmware is upgraded successfully. 3. Upgrade the BIOS with the factory defaults (if any) or default settings of the BIOS restored. 4. If the issue persists, contact Technical Support. |
System firmware error (POST error)---Firmware (BIOS) ROM corruption detected:Image Certificate is found in Forbidden database(dbx)
Event code |
0x0f0000de |
Message text |
System firmware error (POST error)---Firmware (BIOS) ROM corruption detected:Image Certificate is found in Forbidden database(dbx). |
Variable fields |
N/A |
Severity level |
Major |
Example |
System firmware error (POST error)---Firmware (BIOS) ROM corruption detected:Image Certificate is found in Forbidden database(dbx) |
Explanation |
The BIOS detected ROM corruption at POST. |
Recommended action |
1. Verify if the BIOS boot mode meets the requirements of secure boot. If not, change the boot mode to UEFI. 2. Verify that the BIOS firmware is upgraded successfully. 3. Upgrade the BIOS with the factory defaults (if any) or default settings of the BIOS restored. 4. If the issue persists, contact Technical Support. |
System Firmware Error (POST Error)---Memory Population Rule Error
Event code |
0x0f002170 |
|
Message text |
System Firmware Error (POST Error)---Memory Population Rule Error |
|
Variable fields |
N/A |
|
Severity level |
Major |
|
Example |
System Firmware Error (POST Error)---Memory Population Rule Error |
|
Explanation |
DIMM Faulty Parts Tracking error occurred because of incorrect DIMM population. |
|
Recommended action |
1. Verify that DIMMs are installed correctly base on the user guide of the server. Re-install the DIMMs if needed. 2. If the issue persists, contact Technical Support. |
System firmware error (POST error)---DIMM installation or compatibility error occurred
Event code |
0x0f003070 |
Message text |
System firmware error (POST error)---DIMM installation or compatibility error occurred. |
Variable fields |
N/A |
Severity level |
Major |
Example |
System firmware error (POST error)---DIMM installation or compatibility error occurred |
Explanation |
DIMMs were installed incorrectly. |
Recommended action |
1. Log in to HDM, access the Memory page, and verify that the no faulty DIMMs exist. 2. Verify that DIMMs are installed correctly as required in the user guide for the server. 3. Verify that a minimum of one DIMM operates correctly for each processor. |