Knowledge Base

How to Troubleshoot SMART Errors on a Dell PowerEdge™ RAID Controller (PERC)



Article Summary: This article provides information on how to troubleshoot SMART Errors on a Dell PowerEdge RAID Controller



Issue 1:

If a hot spare is present, the rebuild starts with the hot spare after the drive is forced offline. SMART errors If a hot spare is present, the rebuild starts with the hot spare after the drive is forced offline.
The Self-Monitoring Analysis and Reporting Technology (SMART) feature monitors the internal performance of all motors, heads, and physical disk electronics to detect predictable physical disk failures. The SMART feature helps monitor physical disk performance and reliability.

SMART-compliant physical disks have attributes for which data (values) can be monitored to identify changes in values and determine whether the values are within threshold limits. Many mechanical and electrical failures display some degradation in performance before failure.

A SMART failure is also referred to as a predicted failure. There are numerous factors that relate to predicted physical disk failures, such as a bearing failure, a broken read/write head, and changes in spin-up rate. In addition, there are factors related to read/write surface failure, such as seek error rate and excessive bad sectors. For more information, click (Here).

Issue 2:

Drive carrier status LED's

Green/amber flashing (Green On 500 ms / Amber On 500 ms, Off 1000 ms)
Predicted failure reported by drive.

Issue 3: A SMART error is detected on a physical disk in a redundant virtual disk

Perform the following steps:

  1. Force the physical disk offline.
  2. Replace it with a new physical disk of equal or higher capacity. Perform the Replace Member operation. The Replace Member operation allows you to copy data from a source physical disk of a virtual disk to a target physical disk that is not a part of the virtual disk.
Note: If a hot spare is present, the rebuild starts with the hot spare after the drive is forced offline.

Issue 4:

A SMART error is detected on a physical disk in a non-redundant virtual disk

Perform the following steps:
  1. Back up your data.
  2. Use Replace Member or set up a global hot spare to replace the disk automatically.
  3. Replace the affected physical disk with a new physical disk of equal or higher capacity.
  4. Restore from the backup.

Issue 5:

A SMART error occurs during a consistency check (CC)


Specify how the CC operation must perform when a SMART error is encountered. There are two settings, Yes and No. No is the default setting and allows CC to continue when the first error is encountered. The Yes setting halts CC when the first error is encountered. Events are generated in the Event Log when errors are encountered during CC.



Quick Tips content is self-published by the Dell Support Professionals who resolve issues daily. In order to achieve a speedy publication, Quick Tips may represent only partial solutions or work-arounds that are still in development or pending further proof of successfully resolving an issue. As such Quick Tips have not been reviewed, validated or approved by Dell and should be used with appropriate caution. Dell shall not be liable for any loss, including but not limited to loss of data, loss of profit or loss of revenue, which customers may incur by following any procedure or advice set out in the Quick Tips.

Article ID: SLN156053

Last Date Modified: 07/30/2015 01:49 PM


Rate this article

Accurate
Useful
Easy to understand
Was this article helpful?
Yes No
Send us feedback
Comments cannot contain these special characters: <>()\
Sorry, our feedback system is currently down. Please try again later.

Thank you for your feedback.