Data Domain: Hardware Healthcheck
Summary: This article provides detailed instructions on using the support healthcheck hardware command to perform comprehensive diagnostics on Data Domain systems.
Instructions
# support healthcheck hardware
The support healthcheck hardware command is designed to initiate a comprehensive hardware health check on Data Domain systems. The command conducts thorough diagnostics on various hardware components, including storage disks, power supply units, fans, SAS controllers, QAT, NVRAM, DIMMs, I/O cards, and CPUs. The output provides the status of each component. If any component fails the check, the output includes detailed information and recommended remediation steps.
Supported Platforms and Versions.
This command is compatible with the following platforms:
- DD6400
- DD6410
- DD6900
- DD9400
- DD9900
- DD9410
- DD9910
This command is supported on the following versions:
- DDOS 7.10.1.40 and later
- DDOS 7.13.1.10 and later (LTS)
- DDOS 8.1.0.10 and later
How to run this command:
To run this command, log in to the Data Domain system as an admin user using SSH or a direct console connection. Enter the command support healthcheck hardware. The process may take up to a minute to complete, during which a message indicates that the health check is in progress. The results are displayed on the CLI, showing the status of each hardware component. A detailed report is saved in a file located at /ddr/var/log/healthcheck/.
If any issues are identified, generate a complete Support Bundle and upload it for review by the Data Domain Support team. Follow this article for instructions on collecting the Support Bundle, Data Domain: How to collect and upload a support bundle (SUB) from a Data Domain Restorer (DDR) (Log in to Dell Support as a registered user is required to view this article.)
Starting with DDOS 8.4.0, the hardware health check framework has been enhanced into a comprehensive system health check feature.
This functionality can now be accessed using the "system health check" command, providing a unified way to assess overall system health beyond just hardware components.