Unsolved

Closed

1 Rookie

 • 

17 Posts

797

August 3rd, 2023 03:00

Not alerting

Hello, several years ago I set up OME at a different company and it was literally type in iDRAC IP address, username and password and that was that it worked and added into OME

I'm now at a different company and i'm not receiving alerts from the iDRAC's which I have entered.  I done the same thing of adding the devices by idrac root password.  These iDRACS are showing in OME and as healthy

These servers all have SMTP set up, and as they are going down for reboots I'm receiving emails directly from the host about like NIC1 down, but i'm NOT getting the same alerts from OME and unsure why.

Emails from OME is setup and working, so it's not that, they just never get registered inside OME, so don't get an email directly from OME about a managed iDRAC

I did watch a video on setting up Incoming Alerts, but i'm 99% sure I never done that previously, is this an absolute must now?

Unfortunately I do not have the same luxry as I did in my previous job where I could pull out a redundancy PSU for testing purposes.

From the iDRAC I did find the Test ID so I put in CPU0002 and I get an email from the iDRAC, but nothing in OME.

Thanks

Moderator

 • 

4.7K Posts

August 3rd, 2023 08:00

Hello DanSty,

 

When Discovering make sure that the device being monitored in OME has it's SNMP alerts configured to be sent to OME.  This part sometimes gets missed when doing Discovery.

When you discover the device make sure they are "Managed with Alerts". Check the "Enable trap reception..." check box

Step 8 page 76  OpenManage Enterprise 3.10 User's Guide

https://dell.to/45eSlmG

 

 

To receive Alerts on OMEnt the iDRAC needs SNMP Traps Configuration:

  1. iDRAC9 > Configuration > System Settings > Alert Configuration > SNMP Traps Configuration… Alert Destination check/ensure SNMP Traps include the OMEnt IP
  2. iDRAC9 > Configuration > System Settings > Alert Configuration > Alerts: make sure Alerts Enabled

 

Additional resources:

 

OpenManage Enterprise Alert Actions and Configuration 

https://dell.to/45aoQCg

 

iDRAC E-mail Alerts and SNMP Traps

https://dell.to/3rVqc5x

 

How to configure Integrated Dell Remote Access Controller (iDRAC) Email Alerts

https://dell.to/45eSnuO

 

1 Rookie

 • 

17 Posts

August 4th, 2023 08:00

Hello,

I removed all devices and ran a discovery again making sure I selected "Enable trap reception from discovered iDRAC servers and MX7000 chassis.

I can now confirm that alerts seem to be coming through, I looked at the iDRACS afterwards and can see that OME has put in the trap destination for me, so I didn't need to set up the traps manually on each iDRAC.

P.S, is this a new requirement to click on the option? I'm sure I never clicked that in the past a few years ago

Thanks for your help

Moderator

 • 

4.7K Posts

August 4th, 2023 08:00

Hello DanSty,

 

Glad to see it is working.

As far as I recall it was there from initial release or very shortly after.

1 Rookie

 • 

17 Posts

August 7th, 2023 04:00

Hello, just when I thought it was all sorted we had an alert over the weekend for a predicted failed disk.

The iDRAC itself sent an email about predicted failed disk which said the following


"System Host Name: REMOVED

Event Message: A predictive failure detected on drive 22 in disk drive bay 1.

Date/Time: Sat, 05 Aug 2023 03:34:56 -0500

Severity: Warning

 

Detailed Description: The controller received a SMART error from the drive. The drive is operational but needs replacement.

Recommended Action: The drive will need replacement at the next service window.

Message ID: PDR1002"

However, OME only sent this

"Event occurred for Device Name: REMOVED, Device IP Address: REMOVED, Identifier: REMOVED, UTC Time: 2023-08-05 02:35:08.758, Severity: Warning, Message ID: CDEV6175, Device health has deteriorated.

Check the device subsystems for components that require immediate attention. For information about device health statuses, see the Online Help by clicking the help icon. Also see the User's Guide available on the support site. "

Why didn't Dell OME send the information about the content of the issue rather than a generic message, do I need to configure something else?

Thanks

Moderator

 • 

3.5K Posts

August 7th, 2023 07:00

It seems like the difference in the content of the alert messages between iDRAC and Dell OpenManage Essentials (OME) might be due to the way each system handles and presents alert notifications.

iDRAC is a Dell remote access controller that provides out-of-band management capabilities. It is designed to be more low-level and hardware-focused. When it detects a predictive failure on a disk drive, it generates a detailed alert that includes specific information about the failure, such as the drive number, disk drive bay, and the SMART error received from the drive.

On the other hand, Dell OpenManage Essentials (OME) is a more comprehensive management tool that provides centralized monitoring and management of Dell servers, storage, and networking devices. It gathers data from various hardware components and management interfaces, including iDRAC. OME may aggregate and summarize alerts to provide a more high-level view of the overall system health. This is likely why the alert message from OME appears more generic and does not include the detailed information about the specific disk drive failure.

To receive more detailed and specific alert messages like the one from iDRAC, you may need to configure OME to provide more granular alerts or include specific alert settings for disk drive issues. OME should have some configuration options or settings where you can customize the level of detail you want to receive in the alerts.

You can check the OME documentation or contact Dell support for guidance on how to configure the alert notifications to get more detailed information about disk drive issues. Keep in mind that alert configurations might be different depending on the version of OME you are using, so make sure to refer to the appropriate documentation for your specific version.

No Events found!

Top