Start a Conversation

Unsolved

This post is more than 5 years old

K

4191

December 6th, 2016 06:00

Dell Poweredge SC1435 temp sensor failure ??

Hi

I Have a problem with my Dell Poweredge SC1435. It boots on one CPU only, the fans are all the time in full speed and HW monitor application don't show any air temperature. What it comes to my mind is that the air temp sensor collapsed (as fans don't slow down). Is it possible to fix it somehow ??

Krzysztof

December 6th, 2016 12:00

When it happen Bios was 2.2.3 and BMC 2.2.2. I updated Bios to 2.2.5 yesterday and after your mail reinstalled BMC to 2.2.2 but it didn't help. During installation Fans slowed down for a second and speed up to full again.

Krzysztof

Moderator

 • 

8.5K Posts

December 6th, 2016 12:00

Krzysiaczek99,

I would actually start by seeing if the server is up to date on its BIOS and ESM to start, the reason being is that the Fans are primarily controlled by the BMC. If the server is really out of date I would NOT suggest running the very latest update, as that can cause unrecoverable issues. If you let me know what revision they are at currently I can get you the links you will need. 

Let me know what you see.

Moderator

 • 

8.5K Posts

December 7th, 2016 07:00

With it only booting to the single CPU, have you tried swapping the CPU's to see if it is a slot or processor issue? May also be that the thermal paste between heatsink and processor is bad as well.

December 7th, 2016 08:00

Yes, I was swaping CPU so all of them work well  in 1st slot. After downgrading BIOS from 2.2.5 to 2.2.3 i got fan problem fixed temporary i.e. on one cpu fans were working ok but when I added 2nd cpu problem returned i.e. no boot and after fans in full on one cpu afterwards. Than I repeated those procedure i.e. upgraded to 2.2.5 and downgraded 2.2.3 but I didnt got  good fans for one cpu anymore.

I have another sc1435 and BIOS there is Phoenix 2.2.3 PLUS just wonder what PLUS means...It has no Dell logo during booting

Krzysztof

December 8th, 2016 06:00

Finally i got BMC IPMI viever up and running and it shows me ambient temperature -83 deg so I believe it can be a reason of fans full speed. Any idea how to fix it ?? Another sc1435 has ambient temp 26deg

Krzysztof

Moderator

 • 

8.5K Posts

December 8th, 2016 08:00

Are you seeing an amber light or an error being displayed? Anything in the hardware regarding sensor failures?

December 8th, 2016 09:00

Yes, its blinking even that it booted OK.

Krzysztof

December 8th, 2016 12:00

During booting with both CPUs even if booting don't start BMC is working. Here are the sensors

CPU2 looks OK here. So is it possible that due to ambient temp. sensor failure booting with 2 CPUs is stopped and it boots with one CPU only, puts fans in full and blinks the led ?? Is this sensor reparailable ?? or maybe is easier to patch BMC software and hardcode ambient temperaature ??

Does anybody has expirience with it ??

Krzysztof

C:\Program Files (x86)\Dell\SysMgt\bmc>ipmish -ip 192.168.1.4 -u Krzysztof -p 05
1263KF sensor
Index : 1
Status : Good
Probe Name : System Board CMOS Battery

Index : 2
Status : Good
Probe Name : CPU1 VCORE

Index : 3
Status : Good
Probe Name : System Board VDDIO

Index : 4
Status : Good
Probe Name : CPU1 VDDA

Index : 5
Status : Good
Probe Name : CPU1 VTT

Index : 6
Status : Good
Probe Name : CPU2 VCORE

Index : 7
Status : Good
Probe Name : System Board VDDIO

Index : 8
Status : Good
Probe Name : CPU2 VDDA

Index : 9
Status : Good
Probe Name : CPU2 VTT

Index : 10
Status : Good
Probe Name : System Board VDD 1.2V PG

Index : 11
Status : Good
Probe Name : System Board Linear PG

Index : 12
Status : Normal
Probe Name : System Board FAN MOD 1A RPM
Reading : 14100 RPM
Minimum Warning Threshold : N/A
Maximum Warning Threshold : N/A
Minimum Failure Threshold : 2175 RPM
Maximum Failure Threshold : N/A

Index : 13
Status : Normal
Probe Name : System Board FAN MOD 1C RPM
Reading : 11625 RPM
Minimum Warning Threshold : N/A
Maximum Warning Threshold : N/A
Minimum Failure Threshold : 2175 RPM
Maximum Failure Threshold : N/A

Index : 14
Status : Normal
Probe Name : System Board FAN MOD 1B RPM
Reading : 14700 RPM
Minimum Warning Threshold : N/A
Maximum Warning Threshold : N/A
Minimum Failure Threshold : 2175 RPM
Maximum Failure Threshold : N/A

Index : 15
Status : Normal
Probe Name : System Board FAN MOD 1D RPM
Reading : 11625 RPM
Minimum Warning Threshold : N/A
Maximum Warning Threshold : N/A
Minimum Failure Threshold : 2175 RPM
Maximum Failure Threshold : N/A

Index : 16
Status : Normal
Probe Name : System Board FAN MOD 2A RPM
Reading : 14400 RPM
Minimum Warning Threshold : N/A
Maximum Warning Threshold : N/A
Minimum Failure Threshold : 2175 RPM
Maximum Failure Threshold : N/A

Index : 17
Status : Normal
Probe Name : System Board FAN MOD 2C RPM
Reading : 11775 RPM
Minimum Warning Threshold : N/A
Maximum Warning Threshold : N/A
Minimum Failure Threshold : 2175 RPM
Maximum Failure Threshold : N/A

Index : 18
Status : Normal
Probe Name : System Board FAN MOD 2B RPM
Reading : 14775 RPM
Minimum Warning Threshold : N/A
Maximum Warning Threshold : N/A
Minimum Failure Threshold : 2175 RPM
Maximum Failure Threshold : N/A

Index : 19
Status : Normal
Probe Name : System Board FAN MOD 2D RPM
Reading : 11400 RPM
Minimum Warning Threshold : N/A
Maximum Warning Threshold : N/A
Minimum Failure Threshold : 2175 RPM
Maximum Failure Threshold : N/A

Index : 20
Status : Present
Probe Name : CPU1 Presence

Index : 21
Status : Present
Probe Name : CPU2 Presence

Index : 22
Status : Presence Detected
Probe Name : CPU1 Status

Index : 23
Status : Presence Detected
Probe Name : CPU2 Status

Index : 24
Status : None
Probe Name : System Board OS Watchdog

Index : 25
Status : Chassis is open
Probe Name : System Board Intrusion


C:\Program Files (x86)\Dell\SysMgt\bmc>

Moderator

 • 

8.5K Posts

December 9th, 2016 13:00

So the server boots without error if only one CPU is installed, regardless of if it is CPU1 or CPU2 in slot 1, but when both are installed at same time it does error?

December 9th, 2016 14:00

yes but with one cpu fans are in full and amber diode is blinking, ambient temp. sensor in both cases (1 and 2 CPUs) is -83 deg. So I believe it boots on one cpu with error notification. (blinking diode).

Krzysztof

December 11th, 2016 15:00

Cris,

So can you tell me where the ambient temp sensor is located at sc1435 MB  ??

Unfortunately its not possible to insert image here otherwise I would post scree from IPMIViev program.

Krzysztof

Moderator

 • 

8.5K Posts

December 14th, 2016 08:00

The sensor itself is embedded to the motherboard, so it isn't replaceable without the replacing the entire board.

No Events found!

Top