Highlighted
dpetrek
Bronze

CPU Machine Chk: processor sensor, transition to non-recoverable

New PowerEdge 2950 server, freshly installed with all the latest firmware and drivers.
Rebooted today with following error in ESM log:
 
"CPU Machine Chk: processor sensor, transition to non-recoverable"
 
Why did this happen, how can I resolve the issue?
I have DSET report available if somene cares to take a look.
 
Thank you,
Drazen
0 Kudos
30 Replies
majagual
Copper

Re: CPU Machine Chk: processor sensor, transition to non-recoverable

I have the same problem can somebody help us.
thanks.
Majagual
0 Kudos
bowo
Copper

CPU Machine Chk E1422 on poweredge 2950

So do I,  in my office, Poweredge 2950 , running Windows Server 2003 Standard Edition SP1 , BOD and rebooted at least 3 times a day with the same error message "E1422  CPU machine Chk ". While the load is average 25% , because it only serve as File Server + exchange server+ DNS + Wins + DHCP + Domain Controller + Application Server.

 

The funny thing is our previous machine with the exactly same settings, purposes and load and is only a "home made" server using old Pentium 4 and common desktop mainboard, it never hung,  at least for a week....

0 Kudos
njaure
Copper

Re: CPU Machine Chk E1422 on poweredge 2950

Has anybody found out something? I have the same issue with a PE2950 - 2 CPU Dualcore

I've just open a Support Case, and they asked me to do some HW tests, basically boot the machine with just one CPU, and then the other one. In order to know which CPU is working wrong.

I'll do it this afternoon. If I have some news I will let you know.

0 Kudos
CodeMan47
Copper

Re: CPU Machine Chk E1422 on poweredge 2950

Anyone have any luck fixing this?  Similar issue, same error message on the front lcd thing.  Machine is currently running just fine, but with this message I'm kind of concerned.  I talked to a Dell rep and they just suggested upgrading the bios, perc driver, perc firmware - personally i don't know what this would accomplish with it being a CPU error...  The bios maybe, but the raid controller???

Anyway, if anyone was able to correct this could you please let me know what the issue was.  Many thanks!

 

-Jon

0 Kudos
csorel
Copper

Re: CPU Machine Chk E1422 on poweredge 2950

Jon,

My server just started this as well and did you find a solution?

 

Chris

0 Kudos
CodeMan47
Copper

Re: CPU Machine Chk E1422 on poweredge 2950

Chris,

I'm not 100% sure what ended up fixing it, but what was suggested by Dell and what I did was update the Bios, BMC, Perc5 driver and Perc5 firmware.  After these updates I rebooted then hit CTRL+E and cleared the log in there - this is where the error was actually logged, so it's possible that all I had to do was clear this log and may have been fine, but figured since they suggested doing the updates that I might as well just do them.

 

Here are the links I was given to the updates:

BIOS:

BMC:

 

PERC5 driver:

<ADMIN NOTE: Broken link has been removed / replaced from this post by Dell>

PERC5 Firmware:

 

 

unclejose
Copper

Re: CPU Machine Chk E1422 on poweredge 2950

FYI, on a PE2950III it still took 9 minutes to flash the BMC. I thought it crashed and restarted, that's bad news. But if you make this error, just restart and retry it, give it all the time it needs and it should work. Note that during update, it will thrash, fans will phase up and down madly, and the LCD will go nuts. All SOP.

 

All the same other CPU errors as the rest of you guys too, even with the newest firmwares.

- Joe

0 Kudos
rmjds73
Copper

Re: CPU Machine Chk: processor sensor, transition to non-recoverable

Same issue here, after contacting Dell support, they told me to run the dset application and send them the report.  They called me back saying the system was fine, that it had no erros, and it could be a bug from a sensor so i was told to update the BIOS and BMC, and to run the dset application again with option 3 (clear esm logs)

I did that and the system is running again, and the error code is gone!!!   I think that simply clearing the logs it will remove the error, but it may eventually come up once again, so its best to do the firmware upgrades.  This error is very generic, and in order to know what the problem is you need to run a diagnostics, it can be a problem with any of the hardware on the system or it may not be a problem at all, sometimes just a bug.

RoyceRacer
Bronze

Re: CPU Machine Chk: processor sensor, transition to non-recoverable

We, too, have had the E1422 CPU Machine Chk error show up on one of our Dell 2950's 3 times.  We've called Dell and had them come to replace parts two times.  The error still comes up.   This is on a SLES 11 box.  We've confirmed that the BIOS and BMC are up-to-date.  Still happening.

RLR:-)

0 Kudos