Article Number: 000019552
2017-06-09 23:00:25 avtar Info <5586>: Loading cache files from C:\Program Files\avs\var 2017-06-09 23:00:25 avtar Info <8650>: Opening filename cache file 'C:\Program Files\avs\var\f_cache2.dat' 2017-06-09 23:00:25 avtar Info <5573>: - Loaded filename cache file (6,532,792 bytes) 2017-06-09 23:00:26 avtar Info <8650>: Opening hash cache file 'C:\Program Files\avs\var\p_cache.dat' 2017-06-09 23:00:28 avtar Info <5573>: - Loaded hash cache file (402,653,728 bytes) 2017-06-09 23:01:01 avtar Info <6426>: Done loading cache files
2017-06-09 23:04:32 avtar Info <19008>: Obtaining available VSS providers 2017-06-09 23:04:32 avtar Info <8776>: Freezing volumes now... 2017-06-09 23:04:32 avtar Info <8780>: Creating the shadow copy set (DoSnapshotSet) ... 2017-06-09 23:14:33 avtar Info <8781>: Shadow copy set successfully created. 2017-06-09 23:14:34 avtar Info <6074>: VSS snapshot set creation successful
Taking the above stages we split them into 'phases' which have the greatest impact on backup performance:
Phase 0. Create VSS snapshots.
The Volume Shadowcopy Service (VSS) creates snapshots of volumes specified within the source dataset. Applications can continue to write to the volume whilst the backup runs.
Avamar backs up the read-only 'frozen' snapshot of the volume rather than the write-able volume. This ensures it has a consistent set of data to back up.
VSS snapshots take seconds to complete. If a client is experiencing VSS issues, this delay or prevents the backup from proceeding.
Phase 1. File scan phase. The avtar process stats all files in the target dataset
For clients with millions of files, this phase may be the most time consuming.
Database data contains few, larger files so the file scan phase takes little time. Database clients typically consume their time during phase #2.
For a client with rotational disks in RAID 5 configuration, file scan performance of ~1 million files per hour is typical. This varies from 300,000 to 3 million per hour. It depends on the client environment and the characteristics of the data being backed up.
From v7.3, Linux clients backing up to Data Domain can take advantage of Linux Fast Incremental (LFI) functionality. This avoids scanning the entire dataset each time the backup runs.
Critical resources: random-seek performance of the disk where the backup data is stored.
Phase 2. Avtar reads changed files and then chunks, compresses, and hashes the data.
A lot of computation occurs during this phase. For each modified or new file, avtar breaks it into small chunks. It compresses each chunk and calculates a hash as a 'fingerprint' to identify the chunk.
Critical resources: Client disk and CPU
For LAN backups where there are no bottlenecks in sending data to the Avamar server, phases #1 and #2 take the most time.
In the following chart, consider that the amount of area in the bars of the graph corresponds to how long the backup takes. Changed files can drastically increase the amount of time required, especially if those files are large.
For file system datasets, expect ~0-3% of files to change on a daily basis.
Avtar must 'stat()' each file that changes by performing two I/O operations, one to check the file attributes, another for the security attributes.
To achieve the benchmark scan rate performance of one ~1 million files/hour for file system backups avtar requires approximately two million seek operations per hour, or 600 seek operations per second.
For example: If a backup has a 3% change rate, 97 out of a 100 files require two disks seek operations in order to identify whether they changed. The remaining three which did change, must be scanned, chunked, compressed, and hashed.
This considers only the file scan phase and does not take into account I/O resources required for processing any files which were modified.
The more data within the modified files, the more work is needed to complete the backup.
Phase 3. Checking the existence of hashes on the Avamar server
Phases #1 and #2 produce hashes which point to elements of the backup. These elements could be unique file chunks, file systems or entire backups.
Avamar Client
06 Feb 2024
5
How To