Avamar:RMCP 未移除檢查點

Summary: 本文說明當檢查點未從 Avamar 移除檢查點時,即使檢查點驗證成功,仍觀察到的行為。

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

在維護活動期間,不會移除檢查點。此外,如果 Avamar 與 Data Domain 整合,快照也不會到期。

admin@av-srv-prod:~/>: cplist --full
cp.20241021171415 Mon Oct 21 13:14:15 2024   valid --- del  nodes   1/1 stripes    277
cp.20241022164600 Tue Oct 22 12:46:00 2024   valid rol del  nodes   1/1 stripes    277
cp.20241022171838 Tue Oct 22 13:18:38 2024   valid --- del  nodes   1/1 stripes    277
cp.20241022193333 Tue Oct 22 15:33:33 2024   valid rol del  nodes   1/1 stripes    277
cp.20241024164621 Thu Oct 24 12:46:21 2024   valid rol ---  nodes   1/1 stripes    277
cp.20241024171054 Thu Oct 24 13:10:54 2024   valid --- ---  nodes   1/1 stripes    277
admin@av-srv-prod:~/>:

使用 mccli 命令數個經過驗證的檢查點 (滾動 HFS 檢查) 顯示為「失敗」:

admin@av-srv-prod:~/>: mccli checkpoint show --verbose
0,23000,CLI command completed successfully.
Tag               Time                    Validated Deletable Nodes Stripes Validation Start Time   Validation Finished Time Errors
----------------- ----------------------- --------- --------- ----- ------- ----------------------- ------------------------ ------
cp.20241021171415 2024-10-21 13:14:15 EDT           No        1     277     Not Validated           Not Validated            N/A
cp.20241022164600 2024-10-22 12:46:00 EDT Failed    No        1     277     2024-10-22 12:53:44 EDT 2024-10-22 13:09:46 EDT  1
cp.20241022171838 2024-10-22 13:18:38 EDT           No        1     277     Not Validated           Not Validated            N/A
cp.20241022193333 2024-10-22 15:33:33 EDT Failed    No        1     277     2024-10-22 15:42:07 EDT 2024-10-22 15:56:48 EDT  1
cp.20241024164621 2024-10-24 12:46:21 EDT Failed    No        1     277     2024-10-24 12:53:09 EDT 2024-10-24 13:08:04 EDT  1
cp.20241024171054 2024-10-24 13:10:54 EDT           No        1     277     Not Validated           Not Validated            N/A
admin@av-srv-prod:~/>: 

執行已移除的檢查點 (rmcp) 命令時,不會移除任何檢查點。

admin@av-srv-prod:~/>: avmaint rmcp --full --ava
<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<checkpointrmlist has-approved-checkpoint="false">
  <checkpoint
    tag="cp.20241021171415"
    deleted="false"
    ddr-deleted="false"/>
  <checkpoint
    tag="cp.20241022164600"
    deleted="false"
    ddr-deleted="false"/>
  <checkpoint
    tag="cp.20241022171838"
    deleted="false"
    ddr-deleted="false"/>
  <checkpoint
    tag="cp.20241022193333"
    deleted="false"
    ddr-deleted="false"/>
  <checkpoint
    tag="cp.20241024164621"
    deleted="false"
    ddr-deleted="false"/>
  <checkpoint
    tag="cp.20241024171054"
    deleted="false"
    ddr-deleted="false"/>
</checkpointrmlist>

在 Data Domain 上,快照不會自動到期。必須手動到期:

avboost@dd-srv-prod# snapshot list mtree /data/col1/avamar-1234567890
Snapshot Information for MTree: /data/col1/avamar-1234567890
----------------------------------------------
Name                Pre-Comp (GiB)   Create Date         Retain Until        Status
-----------------   --------------   -----------------   -----------------   -------
cp.20241015171741          69287.4   Oct 15 2024 13:19   Oct 22 2024 13:13   expired
cp.20241015194118          69287.4   Oct 15 2024 15:43   Oct 22 2024 13:13   expired
...
...
cp.20241020164654          65247.4   Oct 20 2024 12:49
cp.20241020171602          65262.9   Oct 20 2024 13:18
cp.20241021164757          65257.4   Oct 21 2024 12:50
cp.20241021171415          65272.9   Oct 21 2024 13:16
cp.20241022164600          65280.0   Oct 22 2024 12:48
-----------------   --------------   -----------------   -----------------   -------
...
avboost@dd-srv-prod# 

另一個觀察到的行為是在 Avamar Server 上執行命令時速度緩慢。儘管伺服器未運行任何任務或備份,但平均負載仍然很高。

 

 

Cause

有幾個因素可能會導致此行為。在徹底分析 Avamar 伺服器上執行的程序 (使用 top 或 ps -ef) 後,才發現所有問題。一些方案包括:

  • 舊的 Perl 程序
  • 過時的自訂複寫
  • 自訂報告
  • 舊的 Avtar 程序

在某些場景中可以找到證據:

admin    15007  0.0  0.0   9664  2812 ?        Ss    2023   0:00 bash -c export TERM=${TERM:-dumb} ; /usr/bin/ssh-agent /tmp/dpnctl-run-self.14963.aux
admin    15042  0.0  0.0   9528  2192 ?        S     2023   0:00  \_ /bin/bash /tmp/dpnctl-run-self.14963.aux
admin    15043  0.0  0.0  30792   680 ?        Ss    2023   0:52      \_ /usr/bin/ssh-agent /tmp/dpnctl-run-self.14963.aux
admin    15049 99.6  0.1  81996 39340 ?        R     2023 272656:21      \_ /usr/bin/perl /usr/local/avamar/bin/dpnctl --rerun --mcs_user=root stop 
admin    26975     1  0  80   0 -  3440 -      Oct08 ?        00:00:00 bash -c ./avReplication.40 --report --csv --quiet
admin    27290 25935  0  80   0 -  3440 -      Oct08 ?        03:55:24 bash -c ./avReplication.40 --quiet --report --short-status
admin    27761 26975  0  80   0 -  3440 -      Oct08 ?        03:50:39 bash -c ./avReplication.40 --report --csv --quiet
root      9046  0.0  0.0 314212  6792 ?        SNl  Nov08   0:00 /usr/local/avamar/bin/avtar.bin --vardir=/usr/local/avamar/var --bindir=/usr/local/avamar/bin --sysdir=/usr/local/avamar/etc --sysdir="/usr/l
root     20385  0.0  0.0 314212  6624 ?        SNl  Nov08   0:00 /usr/local/avamar/bin/avtar.bin --vardir=/usr/local/avamar/var --bindir=/usr/local/avamar/bin --sysdir=/usr/local/avamar/etc --sysdir="/usr/l
root     22784  0.0  0.0 314212  6544 ?        SNl  Nov08   0:00 /usr/local/avamar/bin/avtar.bin --vardir=/usr/local/avamar/var --bindir=/usr/local/avamar/bin --sysdir=/usr/local/avamar/etc --sysdir="/usr/l

 

Resolution

1.以系統管理員身分登入 Avamar 伺服器,交換器登入根:

su -

2.執行下列命令以徹底分析程序:

top
ps -aux --forest
ps -ef

 

警告:如有任何疑慮,請勿終止任何程序。

 

3.識別行程後,請使用行程 ID (PID) 將其終止:

kill <pid>

4.如果程序未終止,請強制執行:

kill -9 <pid>

5.命令應該會再次開始加快速度。

6.執行 RMCP:

avmaint rmcp --full --ava

7.以下兩個命令再次正確顯示檢查點:

cplist --full
mccli checkpoint show --verbose

範例:

admin@av-srv-prod:~/>: cplist --full
cp.20241024164621 Thu Oct 24 12:46:21 2024   valid rol ---  nodes   1/1 stripes    277
cp.20241024171054 Thu Oct 24 13:10:54 2024   valid --- ---  nodes   1/1 stripes    277
admin@av-srv-prod:~/>: 
admin@av-srv-prod:~/>: mccli checkpoint show --verbose
0,23000,CLI command completed successfully.
Tag               Time                    Validated Deletable Nodes Stripes Validation Start Time   Validation Finished Time Errors
----------------- ----------------------- --------- --------- ----- ------- ----------------------- ------------------------ ------
cp.20241024164621 2024-10-24 12:46:21 EDT Validated No        1     277     2024-10-24 12:53:09 EDT 2024-10-24 13:08:04 EDT  0
cp.20241024171054 2024-10-24 13:10:54 EDT           No        1     277     Not Validated           Not Validated            N/A
admin@av-srv-prod:~/>: 

8.請確定 Data Domain 上的快照顯示「已到期」狀態。

Affected Products

Avamar, Avamar Server
Article Properties
Article Number: 000255751
Article Type: Solution
Last Modified: 16 Apr 2025
Version:  3
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.