Unsolved

This post is more than 5 years old

1 Rookie

 • 

24 Posts

612512

June 20th, 2013 16:00

Power Supply Redundancy Lost with no corresponding PSU Issue

Looking for troubleshooting advise for this issue being alerted as Critical in OME.

I am receiving solely the Power Supply redundancy is lost alert without any sort of issues with either of the redundant power supplies. There isn't anything correlating in the hardware or event logs concerning power or the PSUs. However, I am seeing a "The memory module 1 temperature is greater than the upper warning threshold." at the same time frame (only thing consistently mapping up over past week. The redundancy is lost for 8-10 hours and then comes back for 16 or so hours over past few weeks.

All firmware and drivers are updated to current Dell pack. that seemed to address it for a day or two...

Before I send a field person to look at it, any ideas what this might be???

Moderator

 • 

9.4K Posts

June 20th, 2013 16:00

What model server is this on? Do the same errors show up in the server administrator logs, drac logs or on the front display on the server?

2 Intern

 • 

793 Posts

June 20th, 2013 17:00

If the Hardware Logs and Alert Logs disagree in OME, then make sure your OMSA and BMC/ESM/DRAC firmwares are up to date.  Odd error combinations like this are usually caused by out of sync OMSA and embedded server management because they aren't validated to work with each other.

1 Rookie

 • 

24 Posts

June 21st, 2013 09:00

Sorry if I was unclear. The logs all have the same error solo error. The issues is that it is JUST a redundancy lost alarm and absolutely no issues in any logs with the associated power supplies.

OME, OMSA (on machine), iDRAC, ESM all show the Redundancy Lost alarm. (don't want to truck the 20 minutes to check the front display, hence remote management tools :) )

The only other alarm that matches up in any of the logs is that memory high temp warning (and it does match up)

It is a DELL PowerEdge R710 with Server 2003 R2. I have used OME to fully update the machine to current packs.

So how is there a redundancy lost when both PSU are working perfectly fine? And a consistent and recurring redundancy lost?

1 Rookie

 • 

24 Posts

June 21st, 2013 10:00

Nope. Not is OME or OMSA.

 

Moderator

 • 

9.4K Posts

June 21st, 2013 10:00

If you look under the FRU tab in OMSA what does it show for the model and firmware version of the power supplies? Some of them have a firmware update that is not done with the normal updates.

Moderator

 • 

9.4K Posts

June 21st, 2013 10:00

Does it have a part number for the power supply?

1 Rookie

 • 

24 Posts

June 21st, 2013 10:00

The power supply lists the version as 08.05.00

Moderator

 • 

9.4K Posts

June 21st, 2013 10:00

Here is the update for the 570W power supply, if it is a flextronics branded one this update may help, if it is a different brand it wont update: Flextronics 570W (VPR1M) Power Supply Firmware Version 1.32, Released 1/12/12, Optional

http://ftp.us.dell.com//FOLDER39977M/1/PSU_FRMW_WIN_R242159.EXE

 

You may be able to get the part number under the FRU section seperate from the power supply section on the left pane in OMSA.

1 Rookie

 • 

24 Posts

June 21st, 2013 11:00

The power supplies are not listed on the FRU on OMSA or OME.

And that wasn't the right package for the system. :(

Moderator

 • 

9.4K Posts

June 21st, 2013 13:00

Can you run a DSET report http://downloads.dell.com/FOLDER01378061M/1/Dell_DSET_3.4.0.271.exe on the server and send it to me? Josha_craig@dell.com

It sounds like it might be a motherboard issue with the odd reporting and the memory temperature errors.

1 Rookie

 • 

24 Posts

June 21st, 2013 13:00

Um yeah, exactly what I originally said. Redundancy lost with no issue for Power supplies :)

Yes, the memory high temp happens almost every time.

Yes it is under warranty

Moderator

 • 

9.4K Posts

June 21st, 2013 13:00

In that screenshot it shows redundancy lost but each of the power supplies show healthy. Do the memory temperatures happen repeatedly as well when the power supply redundancy is lost? Is the system under warranty?

1 Rookie

 • 

24 Posts

June 21st, 2013 14:00

Doing that now. I also checked the box that says "upload automatically to dell site' not sure what that will do :)

14 Posts

June 4th, 2014 08:00

Did you ever find a solution to this? I just experienced the same issue on an R620. Power supply redundancy lost and at the same a time, a memory temp alarm. However, the power supplies show up as normal/green in open manage.

1 Message

July 28th, 2014 14:00

The error is caused by a firmware bug in the 8.05 package. 

To resolve the issue you will need to apply a power supply firmware update. 

Windows:

http://ftp.us.dell.com/power/PSU_FRMW_WIN_R213337.EXE

 

Linux:

http://ftp.us.dell.com/power/PSU_FRMW_LX_R213337.BIN

No Events found!

Top