NVIDIA H100 GPU:NVIDIA-smi 輸出報告不正確的匯總 SRAM 可修正值

Summary: NVIDIA H100 圖形處理器 (GPU) 的 NVIDIA-smi 輸出可能會針對匯總靜態隨機存取記憶體 (SRAM) 可修正計數器報告不正確的值。

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

輸出範例來自」nvidia-smi -q -d ECC」命令使用的常見選項:

 

具有匯總 SRAM 可修正值為 18446744073709551615 的 NVSMI 記錄輸出

在上面的例子中,總SRAM可修正值18446744073709551615異常高且不正確。

 

Cause

SRAM 計數器計算錯誤。

Resolution

更新至 NVIDIA H100 驅動程式套件 570.124.06 版或更新版本。

Affected Products

PowerEdge XE8640, PowerEdge XE9640, PowerEdge XE9680
Article Properties
Article Number: 000317812
Article Type: Solution
Last Modified: 12 May 2025
Version:  1
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.