PowerEdge: IR7000: M7725 ConnectX-8 Disconnection with PCI3040 Error During High System Load

Summary: In rare cases, such as the CPU, memory, NVIDIA ConnectX-8 InfiniBand card, and other PCIe cards are under a heavy workload, the ConnectX-8 connection drops for about five seconds. When this happens, a PCI3039 or PCI3040 error is logged in the System event log (SEL). ...

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

In rare corner cases, when the M7725 is configured with a NVIDIA ConnectX-8 (P/N: NYKN5), the LC/SEL logs report a PCI3039 or PCI3040: A high-severity issue is detected error on the CX-8 InfiniBand card.

screenshot or error in SELlog

Concurrently, the OS logs recorded a CmpltTO (Completion TimeOut) PCIe fatal error on the ConnectX-8 card, which then recovers successfully after about five seconds.

OSlog hardware error

OSLog link active after recovery

Cause

Unknown

Resolution

Follow the NVIDIA Performance Tuning Guide This hyperlink is taking you to a website outside of Dell Technologies. to enable Relaxed Ordering (RO) on the ConnectX-8 card.

NVIDIA Relaxed Ordering

Affected Products

Mellanox Family of Adapters, PowerEdge M7725
Article Properties
Article Number: 000357077
Article Type: Solution
Last Modified: 21 Nov 2025
Version:  2
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.