跳至主要內容
  • 簡單快速地下訂單
  • 檢視訂單及追蹤商品運送狀態
  • 建立並存取您的產品清單
  • 使用「公司管理」來管理您的 Dell EMC 網站、產品和產品層級連絡人。

Excessive SMART Error Rates Logged for Read and Verify ECC Errors on Certain Enterprise Hard Drives

摘要: High SMART error rates on some Enterprise HDD are informational and have no bearing on hardware health.

本文可能採用自動翻譯。如果您對翻譯品質有任何寶貴意見,請使用此頁面底部的表單告訴我們,謝謝。

文章內容


症狀

-

原因

-

解析度

Table of Contents

1. Introduction
2. Description
3. Solution
4. Further Information

 


Introduction

S.M.A.R.T. (Self-Monitoring, Analysis and Reporting Technology; often written as SMART) is a health monitoring system for hard-drives and solid-state drives which has been internationally standardised.

The primary function of SMART is to detect and report various indicators of drive reliability, with the intent of anticipating imminent hardware failures, allowing the hardware to be replaced before failure and the integrity of the data to be maintained.

Dell EMC have collaborated with our hard-drive vendors in the interpretation of these values.

 


Description

It has been found, whilst reviewing the SMART status of some enterprise hard-drives, high read and verify ECC (Error Correction Code) correction rates may be observed on some models. If these values are compared against other hard-drives, it may appear to indicate that some particular models of hard-drives have a much higher error rate than others which may report zero ECC corrections. In some cases this error rate will appear as hundreds of millions of ECC corrections and may increment rapidly as more I/O transactions occur.

 

An example of this situation is provided below. This example was gathered by running the command "smartctl -a /dev/sdX" under Linux OS.
SLN316623_en_US__1image(9511)
Figure 1: Error Counter Log

SLN316623_en_US__2icon Note: The Smartctl application is a component of smartmontools, an open-source tool-set for querying the health of physical disks.
SLN316623_en_US__2icon Note: SMART logs which are presented by enterprise hard-drives and are interpreted by third-party utilities such as smartmontools may not represent the actual count of ECC errors for the devices.


 


Solution


The SMART specification allows vendors to provide these counters, such as the ones shown in the above example, for informational purposes. The counters are not necessarily a count of soft or hard faults within the ECC logic. This allows each drive vendor flexibility as to what will be displayed in the available SMART fields. For some vendors, no error data is presented in the ECC read or verify categories. In the example above, the vendor has chosen to use the counters for monitoring the ECC functionality. The values which are presented do not represent an error-rate. Similarily, a higher rate of events on some disks in comparison to others does not indicate that a performance problem exists.
 
SLN316623_en_US__2icon Note: SMART parameters should not be compared across a mixed install-base of vendors or models; only similar models and firmware should be compared.

If you have a specific query in relation to the health counters on a certain model of Enterprise HDD, please contact our support technicians who can work to obtain an answer from Dell EMC's engineering teams.

 


Further Information

  • For more information on the international standardisation of SMART values and other SCSI Storage Interfaces please see the homepage of the T10 technical committee located here.
  • Smartmontools is a set of utility programs to control and monitor computer storage systems using the Self-Monitoring, Analysis and Reporting Technology system built into most modern ATA, Serial ATA, SCSI/SAS and NVMe hard drives. It is not a DellEMC tool. More information on Smartmontools can be found here.


 


 

 

文章屬性


受影響的產品

PowerEdge, C Series, PowerVault, Legacy Storage Models

上次發佈日期

08 10月 2021

版本

4

文章類型

Solution