This Reference Guide has information about iDRAC–only Error and Event Messages (EEMs)
pertaining to 12G, 13G, 14G, 15G, 16G, and 17G PowerEdge servers. These EEMs are
displayed on User Interface (UI), Command-Line Interface (CLI), Rest API, and sometimes
stored in the data log files. EEMs are displayed or stored as a result of user action,
automatic event occurrence, or for data logging purposes. EEMs are prefixed with an
event type to help in identification. For example PSU, STOR, and VLT representing Power
Supply Unit, Storage, and Voltage categories respectively.
These messages are displayed on Graphical User
Interface (UI), Command-Line Interface (CLI), Rest API, and sometimes stored in the
data log files. EEMs are displayed or stored as a result of user action, automatic
event occurrence, or for data logging purposes.
This guide serves as a vital tool for IT
administrators and support personnel to diagnose, respond to, and resolve system
alerts efficiently, ensuring optimal server performance and reliability. An EEM has
three elements:
Message: Indicates the message, and possible causes,
wherever applicable.
Recommended Response Action: Indicates the remedial
tasks that you can perform to resolve an issue. Where applicable, the iDRAC
user interface, RACADM, or Redfish methods of resolving an issue is
described.
Detailed Description: Provides more info about the
error or event, where appropriate.
EEMs are classified according to hardware and
software categories in the server and its management applications such as iDRAC,
iSM, CMC, and Lifecycle Controller. The category and severity of each message is
also specified.
Use this EEM Reference Guide to understand more
about the events and resolve issues that are associated with your PowerEdge servers.
Messages are prefixed with event type to help
identification. For example PSU, STOR, and VLT representing power supply unit,
storage, and voltage respectively.
NOTE:Event types that are enabled by
default are identified using an asterisk (*) symbol in the Applicable
Event Types section.
The expansion of acronyms that are used for classifying error messages is listed in the table
here:
Alert category definition
System Health: Represents all the alerts that are related to hardware within the system chassis. For example, temperature errors, voltage errors, and device errors.
Storage Health: Represents
alerts that are related to the storage subsystem. For example, controller
errors, hard drive errors, and virtual drive errors.
Configuration: Represents alerts that are related to hardware, firmware, and software configuration changes. For example, PCI-e card added or removed, RAID configuration changed, and iDRAC license changed.
Audit: Category represents the audit log. Examples include, user login/logout information, Password authentication failures, session info, power states.
Update: Represents alerts that are generated because of firmware or driver upgrades and downgrades. This does not represent firmware inventory.