Avamar: How to Apply the "Avamar troubleshooting hierarchy" Approach Correctly

Summary: This article aims to guide to the correct troubleshooting prioritization when multiple issues are concurrently affecting the Avamar product.

Ez a cikk a következő(k)re vonatkozik: Ez a cikk nem vonatkozik a következő(k)re: Ez a cikk nem kapcsolódik egyetlen konkrét termékhez sem. Ez a cikk nem azonosítja az összes termékverziót.

Instructions

The approach of the Avamar troubleshooting hierarchy:
  • While reviewing the proactive check results and when troubleshooting multiple Avamar issues, understanding how the various issues and failures affect one another.
  • The resolution to many issues depends on underlying operations completing successfully.
For example:
  • If checkpoint validation (hfscheck) is failing, the checkpoint overhead building up on the system increases the operating system capacity utilization.
  • This increase in operating system capacity utilization causes garbage collection to fail once the operating system capacity utilization reaches the disknogc limit.
  • The garbage collection failure, in turn, leads to high GSAN capacity utilization and sooner or later, the system goes into admin mode.
  • The underlying hfscheck issue must be resolved first to free up enough operating system capacity to allow garbage collection to run before resolving the issue with GSAN capacity utilization.
The following is an Avamar "hierarchy of needs."
  • If there are multiple issues, they must be worked through and resolved in the order listed below in order (and return the Avamar grid back into a healthy state).
  • Correcting each issue in this list requires that all the issues above it be resolved first.

Note: Keep this hierarchy in mind whenever working on a grid that has encountered multiple issues.
 
Hierarchy of Avamar Needs:
  • Critical hardware failures
  • Stripes or nodes offline or suspended
  • Checkpoint failures
  • hfscheck failures
  • Operating system capacity issues (high fs-percent-full, freespaceunbalance issues or stripe pool exhaustion)
  • Garbage collection failing or failing to run
  • High GSAN capacity utilization
  • Capacity-related backup failures (For example, where the server has reached the diskreadonly threshold)

Once the above hierarchy has been satisfied, additional considerations may be present (for example, a long-running hfscheck may cause operational issues on the system) but it is always critical to move down through this hierarchy first in order to ensure that a sick grid becomes a healthy grid as quickly as possible.

Here is a graphical representation of which type of issue must be addressed first based on the Avamar priority, starting from the bottom and working upwards:

Graphical representation of Avamar Troubleshooting Hierarchy

Additional Information

Érintett termékek

Avamar

Termékek

Avamar
Termék tulajdonságai
Article Number: 000013832
Article Type: How To
Utoljára módosítva: 31 okt. 2025
Version:  7
Választ kaphat kérdéseire más Dell-felhasználóktól
Támogatási szolgáltatások
Ellenőrizze, hogy a készüléke rendelkezik-e támogatási szolgáltatással.