Connectrix:Cisco MDS 多个接口丢帧导致服务性能问题
Summary: 多个接口丢帧导致服务性能问题
Symptoms
多个接口丢弃帧导致客户环境出现服务性能问题。
MDS 交换机记录以下 PMON 警报:-
logging log:PMON-SLOT1-3-RISING_THRESHOLD_REACHED: Credit Loss Reco has reached the rising threshold PMON-SLOT1-3-RISING_THRESHOLD_REACHED: TX Credit Not Available has reached the rising threshold
Cause
板载日志记录日志审查:
以下 IPA 错误表示数据包在接收后损坏。 应更换发生这些问题的模块`show hardware internal errors`|------------------------------------------------------------------------|| Device:Tbird Que Driver Role:QUE Mod: 1 || Device Statistics Category :: ERROR|------------------------------------------------------------------------|Instance:0Cntr Name Value Ports----- ---- ----- ----- 24 THB_IPA_IPA0_CNT_BAD_CRC 0000000000637620 1-4 -
25 THB_IPA_IPA0_CNT_CORRUPT 0000000000637620 1-4 -
以下 EBM 错误与出口线卡上收到的损坏数据包相同。这些数据包可能会出现在接收这些损坏数据包的多个模块上。这些模块没有故障。更换生成 IPA 错误的模块,这些错误将停止:`show hardware internal errors`|------------------------------------------------------------------------|| Device:Tbird Fwd Driver Role:L2 Mod: 2 || Device Statistics Category :: ERROR|------------------------------------------------------------------------|Instance:1Cntr Name Value Ports----- ---- ----- ----- 300 THB_EBM0_ACC_ERR_QUEUE_DROP_PKT 0000000000000017 9-12 - <--------
301 THB_EBM0_CNT_ERR_QUEUE_DROP_SF 0000000000000010 9-12 - <--------13414 TBIRD_FWD_EBM_0_PACK0_EPR_DROP 0000000000000001 9-12 - 13422 TBIRD_FWD_EBM_0_PACK1_EPR_DROP 0000000000000005 9-12 - 13430 TBIRD_FWD_EBM_0_PACK2_EPR_DROP 0000000000000001 9-12 -
13438 TBIRD_FWD_EBM_0_PACK3_EPR_DROP 0000000000000003 9-12 - 13727 TBIRD_FWD_EPR0_PKT_CRC_ERR 0000000000000001 9-16 - 13755 TBIRD_FWD_EPR1_PKT_CRC_ERR 0000000000000005 9-16 - 13783 TBIRD_FWD_EPR2_PKT_CRC_ERR 0000000000000001 9-16 -
13811 TBIRD_FWD_EPR3_PKT_CRC_ERR 0000000000000003 9-16 - |------------------------------------------------------------------------|| Device:Tbird Fwd Driver Role:L2 Mod: 3 || Device Statistics Category :: ERROR|------------------------------------------------------------------------|Instance:2Cntr Name Value Ports----- ---- ----- ----- 300 THB_EBM0_ACC_ERR_QUEUE_DROP_PKT 0000000000068016 17-20 -
301 THB_EBM0_CNT_ERR_QUEUE_DROP_SF 0000000000057196 17-20 - |------------------------------------------------------------------------|| Device:Tbird Fwd Driver Role:L2 Mod: 4 || Device Statistics Category :: ERROR|------------------------------------------------------------------------|Instance:0Cntr Name Value Ports----- ---- ----- ----- 300 THB_EBM0_ACC_ERR_QUEUE_DROP_PKT 0000000000004504 1-4 -
301 THB_EBM0_CNT_ERR_QUEUE_DROP_SF 0000000000003598 1-4 - |------------------------------------------------------------------------|| Device:Tbird Fwd Driver Role:L2 Mod: 5 || Device Statistics Category :: ERROR|------------------------------------------------------------------------| 300 THB_EBM0_ACC_ERR_QUEUE_DROP_PKT 0000000000020069 1-4 -
301 THB_EBM0_CNT_ERR_QUEUE_DROP_SF 0000000000019715 1-4 -
EBM 错误与出口线卡上收到的损坏数据包相同。这些数据包可能会出现在接收这些损坏数据包的多个模块上。这些模块没有故障。更换生成 IPA 错误的模块,这些错误将停止:
仅在 MDS DS-X9232-256K9 和 DS-X9248-256K9 模块上发生。
Resolution
修复:
更换线卡,记录以下错误:THB_IPA_IPA0_CNT_BAD_CRC THB_IPA_IPA0_CNT_CORRUPT
提醒:port-monitor 有 3 个特殊计数器,可以在上述情况下发出警报。由于大多数情况下使用默认的“慢漏”策略,因此不会监视这些策略。
计数器为:
err-pkt-from-port
err-pkt-to-xbar
err-pkt-from-xbar
另请参阅其他信息:
CSCum09652 将第 4 代 IPA 和 EFI 错误添加到端口监视器 xbar 错误