Unsolved
This post is more than 5 years old
3 Posts
0
36651
March 8th, 2013 07:00
Missing something - Health status red X - System Administration
Good morning,
I'm just getting into OME and have gotten it installed and patched to 1.1.1 and installed OMSA on several of my Dell servers. I have them showing in OME, but I have a red X on health status for all servers. Pulling up the inventory info, it shows a red X beside Server Administrator. Version shows 7.2
I can access the OMSA by typing in http://Server_IP:1311 from my OME box and can log in, etc.
I'm missing something simple - any ideas?
Thanks.
Damian
No Events found!


DELL-Rob C
4 Apprentice
•
2.8K Posts
0
March 8th, 2013 10:00
Hi Damian.
A few ideas.
First off, when you get a red x next to SA, you should right click on that server in the device tree and launch OMSA (or do it as you indicated). The details in OMSA should indicate the cause of the critical status.
Other ideas, depending on the situation, you may have received an SNMP trap on this, so check your alert console.
One last idea is to go to the h/w log tab and look at the h/w logs for that server.
That should be a start.
Good luck,
Rob
DamianBailey
3 Posts
0
March 8th, 2013 15:00
Rob,
Thanks - excellent suggestions. It turned out I think you were right:
server #1 actually has a bad DIMM (good to know)
servers 2 and 3 only have 1 PS plugged in..
which leads me to my next question - how did I not get alerts on these conditions vs just a red health X? Misconfiguration?
They are a mix of Win 2012 and 2008 if that matters.
Thanks again.
Damian
DamianBailey
3 Posts
0
March 8th, 2013 15:00
just as a follow up to my server #1 above - turns out via chat support it wasn't a bad DIMM after all. they had me run: C:\Program Files\Dell\SysMgt\omsa\bin\dcicfg32 command=clearmemfailures
which will return if the DIMMS are OK or not; in my case, are OK. Also recommended I make sure my BIOS is up to date (2.7.0 currently).
All green in the OME now. Thanks again.
DELL-Rob C
4 Apprentice
•
2.8K Posts
0
March 8th, 2013 18:00
Hey Damian,
Good to know.
So to answer your question on why you did not get the alerts...
#1. Be sure you have configured the managed nodes to send SNMP traps to OME. This is discussed in the OME tutorial section. But basically you have to go into your SNMP properties on the managed node (server) and be sure you enter the ip address or name of the OME server so that traps will be sent over to the OME box.
#2. Of course if the failure condition in the DIMM pre-dated your OME installation then it would not have gotten the alert :-) *Now* that you have OME installed, if your server detects another similar condition, you will see the alert in the OME alert console.
Regards,
Rob