Data Domain: Too Frequent Cleaning Causing Poor Data Locality

Summary: Cleaning too often can cause fragmentation (poor locality), which negatively affects performance in all areas.

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

  • Customer reports cleaning is taking too long or appears to be stuck.
  • Poor cache hit rate
  • High level of disk utilization
  • Poor restore and/or backup performance

Cause

This article helps diagnose cases of slow cleaning/garbage collection (GC) caused by poor file locality. Check other possible causes using the slow cleaning troubleshooting document (linked below). This is one step in that troubleshooting procedure.

Resolution

The following steps are ordered in the most appropriate sequence to isolate the issue and identify the proper resolution. Do not skip a step.
  1. Although too-frequent cleaning can lead to file fragmentation, cleaning every day may be appropriate if the files have a short retention period such as 24 hours.
    Therefore, consider: If there are many old files on the system and cleaning is scheduled more often than once per week, the system may be experiencing poor file locality.
  2. Next, check the cleaning copy phase efficiency, which shows the percentage of live data that is copied. If this is low, the retention period is likely low so the copy phase is more efficient at reclaiming more space. This information can be found by searching for "GC copy phase details" in a recent autosupport (ASUP).
    GC copy phase details:                                   Recent    Cumulative
    Number of containers selected:                         13091499     13091499
    Number of containers copied:                           13091499     13091499
    Number of GC containers copied:                         6131917      6131917
    Number of containers generated:                         4173221      4173221
    Number of Data containers copied:                       6959582      6959582
    Percentage of selected copied:                            100.0%       100.0%
    Percentage of live data in copied:                         31.9%        31.9%	<==
    Copy selected speed in containers/second:                 724.2        724.2
    Space freed in MiB:                                  40132251.0   40132251.0
  3. GC phase: Pre-enumeration times increasing with time (implies locality becoming worse) trending. This information can be found by searching for "GC stats for Physical Cleaning" in a recent ASUP.
    GC stats for Physical Cleaning on Active Success 1 Aborted 0 Skipped 0
    Most recent successful GC container range: 575 to 442476459
    
    GC phase:        pre-merge    time:     566 average:     566    seg/s:        0    cont/s:   23129
    GC phase:     pre-analysis    time:    6576 average:    6576    seg/s:        0    cont/s:    1990
    GC phase:  pre-enumeration    time:    7722 average:    7722    seg/s: 27104949    cont/s:    1695	<==
    GC phase:       pre-select    time:    6292 average:    6292    seg/s: 18046557    cont/s:   16754
    GC phase:             copy    time:   18078 average:   18078    seg/s:        0    cont/s:     724
    GC phase:          summary    time:    8059 average:    8059    seg/s:        0    cont/s:    1624
  4. Review physical space usage trends.
If your problem persists after performing the steps in this article, contact your contracted support provider:

Additional Information

Affected Products

Data Domain

Products

Data Domain
Article Properties
Article Number: 000028842
Article Type: Solution
Last Modified: 12 Jan 2026
Version:  6
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.