Skip to main content
  • Place orders quickly and easily
  • View orders and track your shipping status
  • Create and access a list of your products
  • Manage your Dell EMC sites, products, and product-level contacts using Company Administration.
Some article numbers may have changed. If this isn't what you're looking for, try searching all articles. Search articles

OneFS: Under rare situations, Mellanox cards, 40G, and Inifiniband, may stop responding to commands

Summary: OneFS: Under rare situations, Mellanox cards, 40G, and Inifiniband, may stop responding to commands.

This article may have been automatically translated. If you have any feedback regarding its quality, please let us know using the form at the bottom of this page.

Article Content


Symptoms



An internal error in the Mellanox 40 Gb or InfiniBand card may cause the card to fail. When failure occurs, the interface no longer responds to commands, such as ifconfig or pciconfig. In addition, when this issue occurs, and the card is configured for an external network. Flexnet and smartconnect are unable to assign IP addresses to the interface.
 
Footprints of the failure are seen in the messages file, include the following syntax:     
 
Errors indicating that the driver can no longer post commands:      
Notice the driver number  mlx4_core1:     
mlx4_core1: mlx4_cmd_post:cmd_pending failed

           Or
 
Indication of Internal error detected:     
Notice the driver number  mlx4_core1:     
2018-12-26T16:31:34-08:00 <0.7> isilon-1 /boot/kernel.amd64/kernel: mlx4_core1: Internal error detected:
2018-12-26T16:31:34-08:00 <0.7> isilon-1 /boot/kernel.amd64/kernel: mlx4_core1:   buf[00]: ffffffff
.
.
2018-12-26T16:31:34-08:00 <0.3> isilon-1 /boot/kernel.amd64/kernel: mlx4_en mlx4_core1: Internal error detected, restarting device

Cause

This occurs when there are Cisco BiDi QSFP+ Optics in use with this card. The optic can produce up to 3.5 W of power while the NIC can only accept a maximum of 1.5 W of power. Since the margin is too great for the input rail to handle, the NIC stops functioning causing the node to panic.

Resolution

Workaround: Use non-BiDi optical cable to avoid over use of power.

Solution: Shut down the node and replace the NIC. Replacement NICs are available with a larger fuse and power capacity.

Additional Information

This content is translated in different languages:
https://downloads.dell.com/TranslatedPDF/AR-SA_530469.pdf
https://downloads.dell.com/TranslatedPDF/DE_530469.pdf
https://downloads.dell.com/TranslatedPDF/ES_530469.pdf
https://downloads.dell.com/TranslatedPDF/ES-XL_530469.pdf
https://downloads.dell.com/TranslatedPDF/FR_530469.pdf
https://downloads.dell.com/TranslatedPDF/IT_530469.pdf
https://downloads.dell.com/TranslatedPDF/JA_530469.pdf
https://downloads.dell.com/TranslatedPDF/KO_530469.pdf
https://downloads.dell.com/TranslatedPDF/NL_530469.pdf
https://downloads.dell.com/TranslatedPDF/PT_530469.pdf
https://downloads.dell.com/TranslatedPDF/PT-BR_530469.pdf
https://downloads.dell.com/TranslatedPDF/RU_530469.pdf
https://downloads.dell.com/TranslatedPDF/SV_530469.pdf
https://downloads.dell.com/TranslatedPDF/ZH-CN_530469.pdf
https://downloads.dell.com/TranslatedPDF/ZH-TW_530469.pdf

Article Properties


Product

Isilon Gen6, Isilon HD400, Isilon NL410, Isilon S210, Isilon X210, Isilon X410

Last Published Date

22 May 2023

Version

4

Article Type

Solution