Highlighted
exevolium
1 Copper

Chassis management controller (CMC) redundancy is lost. - DELL M1000e

Hello,

Sometimes the mails come up like this:

1.Critical Chassis management controller (CMC) redundancy is lost.
2.Critical Failure detected on Removable Flash Media.

The system logs and cmc logs on the M1000e are as follows:

System log:
CMC-BLADE03 Aug 8 15:31:25 EMAIL alert (re. Global Chassis) sent successfully to: arge @ (Arge)
CMC-BLADE03 Aug 8 15:30:34 Failed to query Blade-13 Memory information
CMC-BLADE03 Aug 8 15:30:34 Failed to query Blade-13 CPU information
CMC-BLADE03 Aug 8 15:30:18 Failed to query Blade-11 Memory information
CMC-BLADE03 Aug 8 15:30:18 Failed to query Blade-11 CPU information
CMC-BLADE03 Aug 8 15:29:50 Failed to query Blade-6 Memory information
CMC-BLADE03 Aug 8 15:29:50 Failed to query Blade-6 CPU information
CMC-BLADE03 Aug 8 15:26:09 Chassis health is Critical
CMC-BLADE03 Aug 8 15:25:47 EMAIL alert (re. Global Chassis) sent successfully to: arge @ Arge)
CMC-BLADE03 Aug 8 15:25:41 EMAIL alert (re.CMC Redundancy) sent successfully to: arge @ (Arge)
CMC-BLADE03 Aug 8 15:23:29 CMC2: failover was initiated by mond
CMC-BLADE03 Aug 8 15:23:14 System health failover / reboot request RC = 00008008

CMC Log:
Tue Aug 08 2017 15:31:24 Failure detected on Removable Flash Media.
Tue Aug 08 2017 15:25:15 The chassis management controller (CMC) is redundant.
Tue Aug 08 2017 15:25:14 Chassis management controller (CMC) redundancy is lost.
Tue Aug 08 2017 15:24:47 Failure detected on Removable Flash Media.
Tue Aug 01 2017 23:33:10 The chassis management controller (CMC) is redundant.
Tue Aug 01 2017 23:33:09 Chassis management controller (CMC) redundancy is lost.


The firmware of cmc cards is up to date. CMC1 and CMC2 Firmware:5.22
What exactly is the problem?

0 Kudos
3 Replies
Moderator
Moderator

RE: Chassis management controller (CMC) redundancy is lost. - DELL M1000e

Hello

CMC-BLADE03 Aug 8 15:23:14 System health failover / reboot request RC = 00008008

What is in the log before this? This is a system health event that initiated the failover. Typically the health event that caused the failover would be listed just before the detection of the health event is logged.

www.dell.com/support/article/sln283238/

Thanks

Daniel Mysinger
Dell EMC, Enterprise Engineer

Get support on Twitter @DellCaresPRO

exevolium
1 Copper

RE: Chassis management controller (CMC) redundancy is lost. - DELL M1000e

Hello Daniel,

Thank you for the answer but that's not exactly the answer I'm looking for.

In which situations does the following log occur?
"Critical Failure detected on Removable Flash Media."
"Critical Chassis management controller (CMC) redundancy is lost."

Unfortunately, these are all logs. No log before.
I think it's a problem with cmc cards, but they seem seamless in the iDRAC.

Regards.

0 Kudos
Moderator
Moderator

RE: Chassis management controller (CMC) redundancy is lost. - DELL M1000e

Everything that is happening after the event I quoted is a result of that event. The CMC is initiating a failover because of some health event. If it is happening on both CMCs then it is likely something they have in common. My first guess would be a networking problem. If the CMC loses network connectivity it will fail over.

Thanks

Daniel Mysinger
Dell EMC, Enterprise Engineer

Get support on Twitter @DellCaresPRO

0 Kudos