Start a Conversation

Unsolved

A

2 Posts

577

February 27th, 2019 23:00

Mechanics of how server works

Hi,

I need to understand how the Dell server responds to any hardware failure. Eg, lets say there is a raid array/controller failure in the server, I need to understand from how the server detects the failure, to how it sends the alert to the iDrac logs and then displays the service lights that are related to the error, eg red, amber or green light.

I tried googling for solutions but not getting the proper information. The reason I need to know is because I am researching on how to implement IoT in the servers for predictive maintenance, i.e., we are alerted of any hardware problem in the server, before the server crashes.

Thanks.

Moderator

 • 

6.2K Posts

February 28th, 2019 09:00

Hello

You can find information about the server on the system support page.

http://www.dell.com/support/

You can find information about individual components on the support page of that component. We do not update the documentation with every firmware release, so if there is no documentation on the latest firmware then keep going backward in firmware until you find the documentation.

http://www.dell.com/storagecontrollermanuals/

http://www.dell.com/idracmanuals/

It sounds like you are requesting design level information, that information is not likely available. If you are just trying to find out what methods a device/application can communicate(SNMP, IPMI, Redfish/OData, etc) with a server to gather information, you should be able to find that in the iDRAC manual. Most external communication to manage and monitor the server would go through the iDRAC.

Thanks

2 Posts

March 14th, 2019 20:00

How does iDRAC detect any faults in the server hardware, eg Smart Array failure, and send the alerts to the logs?

In this case, the situation will be that the iDRAC detects that the Smart Array has failed, and then only send the alert, thus the server already in a very bad state, whereby it crashes.

Does iDRAC send a warning to before the actual failure alert? So that the server crash can be prevented?

No Events found!

Top