Avamar:Data Domain:由于 AVE 部署不正确,无法创建检查点备份

Summary: 由于 AVE 部署不正确,未能创建检查点备份。

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

AVE 部署不正确

  • 无法创建检查点备份
  • 所有维护作业均已成功完成
  • 已确认 Av 和 DD 之间的连接
    mccli event show | grep -i "failed to create"
    71546 2019-02-21 09:09:31 CET ERROR       31034 SYSTEM      PROCESS  /                          Failed to create a checkpoint backup.
    71229 2019-02-20 09:09:22 CET ERROR       31034 SYSTEM      PROCESS  /                          Failed to create a checkpoint backup.
    70926 2019-02-19 09:26:29 CET ERROR       31034 SYSTEM      PROCESS  /                          Failed to create a checkpoint backup.
    70340 2019-02-18 09:10:41 CET ERROR       31034 SYSTEM      PROCESS  /                          Failed to create a checkpoint backup.
  • 在 cpbackup 日志中针对某个数据分区上某个 cp 返回的错误:data07
    admin@*****:/usr/local/avamar/var/client/>: less cpbackup-cp.20190221080716-28111.log
    .
    .
    [Thu Feb 21 09:09:17 2019] Backup data07 finished in 00:00:01.
    [Thu Feb 21 09:09:17 2019] Cleanup backup for data07
    [Thu Feb 21 09:09:17 2019] Backup data07 returned with exit code 158
    [Thu Feb 21 09:09:17 2019] Execute: ps -o pid,ppid,cmd --no-headers --ppid 28226 || true
    > [Thu Feb 21 09:09:17 2019] Killing all child processes: 28226
    > [Thu Feb 21 09:09:17 2019] Killing PIDs (28226) with signal 15.
    [Thu Feb 21 09:09:17 2019] Sleeping for 10 seconds after killing processes...
    [Thu Feb 21 09:09:27 2019] Execute: ps -o pid,ppid,cmd --noheaders --pid 28226 || true
    [Thu Feb 21 09:09:27 2019] Execute output:
    28226 28111 [sh] <defunct>
    .
    .
    [Thu Feb 21 09:09:28 2019] Backup data06 finished in 00:00:12.
    [Thu Feb 21 09:09:28 2019] Cleanup backup for data06
    [Thu Feb 21 09:09:28 2019] Backup data06 returned with exit code 158
    [Thu Feb 21 09:09:28 2019] Finished backing up files in 00:00:12.
    [Thu Feb 21 09:09:28 2019] Execute: /usr/local/avamar/bin/mccli event publish --code=31034 --attribute="checkpoint" --value="
    cp.20190221080716" --attribute="logfile" --value="/space/avamar/var/client/cpbackup-cp.20190221080716-28111.log" --attribute=
    "cache" --value="OK" --attribute="data06 elapsed" --value="00:00:12" --attribute="data06 fail" --value="Exit code: 158. Signa
    l: 0." --attribute="data07 elapsed" --value="00:00:01" --attribute="data07 fail" --value="Exit code: 158. Signal: 0." --attri
    bute="max ddr streams" --value="6" --attribute="max parallel avtars" --value="2" --attribute="parallel running avtars" --valu
    e="2" --attribute="pass thru flags" --value="--id=root --ap=******** --hfsaddr=***** --hfsport=27000" --attribute="total
    elapsed time" --value="00:00:12" --attribute="volumes" --value="data01 data02 data03 data04 data05 data06 data07"
     
    [Thu Mar 22 09:10:03 2018] Execute: /usr/local/avamar/bin/mccli event publish --code=31034 --attribute="checkpoint" --value="cp.20180322150523" --attribute="logfile" --value="/data01/avamar/var/cpbackup-cp.20180322150523-34518.log" 
    --attribute="abort reason" --value="Flag --max-ddr-streams must be greater than zero" --attribute="cache" --value="OK" --attribute="max ddr streams" --value="0" --attribute="max parallel avtars" --value="2" --attribute="pass thru flags" --value="--id=root --ap=******** --hfsaddr=avamar" --attribute="volumes" --value="data01 data02 data03"
     abort reason        Flag --max-ddr-streams must be greater than zero
    [Thu Feb 21 09:09:31 2019] Execute output:
    0,23000,CLI command completed successfully.
     Attribute               Value
     ----------------------- -------------------------------------------------------------
     checkpoint              cp.20190221080716
     cache                   OK
     parallel running avtars 2
     logfile                 /space/avamar/var/client/cpbackup-cp.20190221080716-28111.log
     max ddr streams         6
     pass thru flags         --id=root --ap=******** --hfsaddr=***** --hfsport=27000
     volumes                 data01 data02 data03 data04 data05 data06 data07
     data06 fail             Exit code: 158. Signal: 0.
     total elapsed time      00:00:12
     data07 elapsed          00:00:01
     max parallel avtars     2
     data06 elapsed          00:00:12
     data07 fail             Exit code: 158. Signal: 0.
  • 查看报告的 cpbackup 失败事件
    mccli event show --id=71546
    0,23000,CLI command completed successfully.
    Attribute   Value                                                                                                            
    ----------- ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
    ID          71546
    Date        2019-02-21 09:09:31 CET
    Type        ERROR
    Code        31034
    Category    SYSTEM
    Severity    PROCESS
    Domain      /
    Summary     Failed to create a checkpoint backup.
    SW Source   MCS:BS
    For Whom    All Users
    HW Source   *****
    Description Failed to create a checkpoint backup.
    Remedy      No action required.
    Notes       N/A
    Data        <data><entry key="checkpoint" type="text" value="cp.20190221080716" version="1"/><entry key="cache" type="text" value="OK" version="1"/><entry key="parallel running avtars" type="text" value="2" version="1"/><entry key="logfile" type="text" value="/space/avamar/var/client/cpbackup-cp.20190221080716-28111.log" version="1"/><entry key="max ddr streams" type="text" value="6" version="1"/><entry key="pass thru flags" type="text" value="--id=root --ap=******** --hfsaddr=***** --hfsport=27000" version="1"/><entry key="volumes" type="text" value="data01 data02 data03 data04 data05 data06 data07" version="1"/><entry key="requestor" type="xml" value="&lt;requestor domain=&quot;/&quot; product=&quot;NONE&quot; role=&quot;Administrator&quot; user=&quot;MCUser&quot;/&gt;" version=""/><entry key="data06 fail" type="text" value="Exit code: 158. Signal: 0." version="1"/><entry key="total elapsed time" type="text" value="00:00:12" version="1"/><entry key="data07 elapsed" type="text" value="00:00:01" version="1"/><entry key="max parallel avtars" type="text" value="2" version="1"/><entry key="data06 elapsed" type="text" value="00:00:12" version="1"/><entry key="data07 fail" type="text" value="Exit code: 158. Signal: 0." version="1"/></data>
  • 检查其中一个受影响数据分区的 cpbackup 日志文件,发现 data06 和 data07 的 cpbackup 日志中由于 diskfull 而导致服务器为只读的消息
    admin@*****:/usr/local/avamar/var/client/>: less cpbackup-cp.20190221080716-data06.log
    .
    .
    
    2019-02-21 09:09:16 avtar Info <5554>: Connecting to one node in each datacenter
    2019-02-21 09:09:16 avtar Info <5993>: - Connect: Connected to 172.27.7.3:29000, Priv=0, SSL Cipher=AES256-SHA
    2019-02-21 09:09:16 avtar Info <5993>: - Datacenter 0 has 1 nodes: Connected to 172.27.7.3:29000, Priv=0, SSL Cipher=AES256-SHA
    2019-02-21 09:09:16 avtar Info <42862>: - Server is in read-only mode due to diskfull
    2019-02-21 09:09:16 avtar Info <17972>: - Server is in Read-only mode.
    2019-02-21 09:09:16 avtar FATAL <8604>: Fatal server connection problem, aborting initialization. Verify correct server address and login credentials.
    2019-02-21 09:09:16 avtar FATAL <8941>: Fatal server connection problem, aborting initialization. Verify correct server address and login credentials.
    2019-02-21 09:09:16 avtar Info <6149>: Error summary: 2 errors: 8604, 8941
    2019-02-21 09:09:16 avtar Info <6645>: Not sending wrapup anywhere.
    2019-02-21 09:09:16 avtar Info <5314>: Command failed (2 errors, exit code 10008: cannot establish connection with server (possible network or DNS failure))
  • 运行 fs-perc 显示 data01 与其他分区之间的很大差异:
    avmaint nodelist |grep -i "fs-perc"
     fs-percent-full="37.0"
     fs-percent-full="8.8"
     fs-percent-full="8.7"
     fs-percent-full="8.7"
     fs-percent-full="8.8"
     fs-percent-full="8.7"
     fs-percent-full="8.7"
  • 运行 df -h 显示 data01 的大小比其他分区小,但已满 37%,而其他分区已满 9%
    df -h
    
    Filesystem      Size  Used Avail Use% Mounted on
    /dev/sda2        16G  5.0G   10G  34% /
    udev             18G  184K   18G   1% /dev
    tmpfs            18G     0   18G   0% /dev/shm
    /dev/sda1      1011M   91M  869M  10% /boot
    /dev/sda6       7.6G  287M  6.9G   4% /var
    /dev/sda8        62G   14G   45G  25% /space
    /dev/sdb1       250G   93G  158G  37% /data01
    /dev/sdc1       1.0T   90G  934G   9% /data02
    /dev/sdd1       1.0T   89G  935G   9% /data03
    /dev/sde1       1.0T   90G  935G   9% /data04
    /dev/sdf1       1.0T   91G  934G   9% /data05
    /dev/sdg1       1.0T   90G  935G   9% /data06
    /dev/sdh1       1.0T   90G  935G   9% /data07

 

Cause

分区 /data01 为 250G,而其他分区为 1 TB,因此其利用率高于其他分区的原因。我们不支持在同一 AVE 中具有不同的数据分区大小。

由于这是一个 4TB AVE,因此应该有六个数据分区(每个分区 1 TB);来源: https://www.delltechnologies.com/asset/en-us/products/data-protection/technical-support/docu91853.pdf (“AVE 虚拟磁盘要求”,第 17 页)

同一页面中有一条注释:“
由于 AVE .ova 安装会创建三个 250 GB 存储分区以及作系统磁盘,因此在安装时需要大约 900 GB 的可用磁盘空间。但是,AVE .ovf 安装不会在安装过程中创建存储分区。因此,在安装时作系统磁盘只需要足够的磁盘空间,并且可以在其他数据存储上创建后续存储分区”

由于此 AVE 有 7 个分区(其中一个分区为 250 GB,其余分区为 1 TB),而不是 6 个分区(1 TB),因此 AVE 部署错误。

 

Resolution

在这种情况下,客户必须部署新的 4 TB AVE(具有 6 个 1 TB 数据分区,无 250 GB 分区)。然后将其所有数据复制到新系统。

 

Additional Information

新的部署将由当地团队和专业服务部门负责。

 

Affected Products

Avamar

Products

Avamar, Avamar Server, Avamar Virtual Edition
Article Properties
Article Number: 000056258
Article Type: Solution
Last Modified: 11 Dec 2025
Version:  6
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.