3 Apprentice

 • 

2.8K Posts

April 12th, 2012 07:00

Hi and thanks for the post.

I think to troubleshoot this we will need to know the details of the hard disk alert and compare them to the email alert filter that you have set up.

1. confirm that the email alert filter you set up matches the *time* that the alert happened.  Does your filter apply to a window of time or set up to always trigger?

2. confirm that the alert category from the disk alert matches the criteria you set up in the filter.  Did you set up a filter for a limited set of alerts or *all*.

3. confirm that the server that triggered the alert is included in the alert filter.

Let's start with that.

Thanks,

Rob

6 Posts

April 12th, 2012 07:00

hello Rob

thanks for your quick answer!

1. no time or date filter is set at all

2. in the category filter is warning and critial enabled

3. the server is inclouded in the group named "Servers" which is set up in the devices filter

Reto

3 Apprentice

 • 

2.8K Posts

April 12th, 2012 10:00

#2 category is not the same as severity.  Be sure you check both.

Confirm the alert has the correct, time, severity, category, etc to match the filter.

You may have to look at the enterprise OID in the event console then go to the category browser to see if you think it meets the criteria.

Ensure email settings are accurate.

Use the test action function of the action to ensure the email server is setup correctly and such.  Ensure the email did not get put in junk mail or something.

You can also go to email preferences in OME and turn on logging to see that the email got sent out.  But be sure to *NOT* leave the log checkbox on as it will slow things down.

Rob

6 Posts

April 13th, 2012 00:00

ok sorry, my fault!

in the category filter is everything selected.

email settings are correct. the test-message has been sent to the configured mailbox.

the log is almost empty. there are only a few errors because a server was down for a couples of minutes.

Reto

3 Apprentice

 • 

2.8K Posts

April 13th, 2012 07:00

Ok, so are you saying all is working ok now?

If not, you should:

1. enable the SMTP log

2. trigger a h/w failure (perhaps by simulating a temperature threshold change in OMSA.  go into the OMSA gui and temporarily change the temperature slider to cause an alert)

3. see if the email gets sent out and review the log.

Rob

6 Posts

April 16th, 2012 00:00

no, i'm not, sorry!

the test message has been sent, yes! and other warnings are getting also sent. the only warning that i doesent get, is the altert if a disk is failing, and the virtual disk gets degraded!

Reto

3 Apprentice

 • 

2.8K Posts

April 16th, 2012 07:00

That's ok.  Alright, so please copy/paste the details of the alert into the thread here.  Be sure to mask your ip addr and hostname.

Rob

6 Posts

April 18th, 2012 02:00

the alert looks like this:

2049 Tue Apr 17 14:50:18 2012 Storage Service Physical disk removed: Physical Disk 0:0:1 Controller 0, Connector 0  

 2048 Tue Apr 17 14:50:18 2012 Storage Service Device failed: Physical Disk 0:0:1 Controller 0, Connector 0

Reto

3 Apprentice

 • 

2.8K Posts

April 18th, 2012 07:00

Ok, but you need more info....click on the alert details: What is the category?  What is the source?  And what is the enterprise OID?  

Then you need to go to the alert console and view the categories to ensure they are defined.  And finally look at your filter to be sure that they are all 'checked' in the right place.

Thanks!

Rob

6 Posts

April 18th, 2012 08:00

ok, i think i figured it out. in the "dell openmanage server administrator" on the server which to be monitored, do i have to check all the alerts and warnings to be forwarded?

Reto

No Events found!

Top