ECS:容器在节点上永久关闭:症状代码:2021 年
Summary: 本知识库文章介绍了 ECS 报告容器在节点上永久关闭的原因。
This article applies to
This article does not apply to
This article is not tied to any specific product.
Not all product versions are identified in this article.
Symptoms
系统已呼叫总部,并显示以下警报:
Clarify Id: APMxxxxxxxx Site Name: UNKNOWN Vendor: EMC DeviceType: ElasticCloudStorageApp Model: ElasticCloudStorage SerialNumber: APMxxxxxxxx WWN: APMxxxxxxxx Platform: platform OS: SLES OS_VER: 12.4 EmbedLevel: 2 InternalMaxSize: 512800 Comment: Fabric Ucode_Ver: 3.7.0.6-7700.ed29023b ConnectType: ESRS IP_Address: Not Available IP_Name: hostname.domainname.net ConnectNum: 169.254.1.1 Port: 22 SymptomCode: 2021 Category: Status Severity: Critical Status: Failed Component: Node ComponentID: xxxxxxx-xxxx-xxxx-xxxx-xxxxxxxx SubComponent: Service SubComponentID: <docker container name> CallHome: true FirstTime: 2023-12-09T07:48:20.232Z Description: Container <container> is permanently down on node <node>
Cause
容器停止、暂停或根本未启动至少 10 分钟。
Resolution
Docker 容器(object-main、fabric-lifecycle、fabric-zookeeper、fabric-registry)停止、暂停或根本没有启动至少 10 分钟。使用以下过程确定故障容器:
- 根据组件 ID 或节点 ID,确定 ECS 群集中记录故障的节点。示例:组件 ID
4ca42022-46ed-475e-8ab7-6ef9141e5415
sudo /opt/emc/caspian/fabric/cli/bin/fcli lifecycle node.network --id 4ca42022-46ed-475e-8ab7-6ef9141e5415
{
"network": {
"hostname": "hostname.domainname.net", << Hostname
"private_ip": "169.254.1.3", << NAN IP
"mgmt_ip": "10.2.3.4", << Management IP
"public_ip": "10.241.207.59",
"data_ip": "10.241.207.73",
"replication_ip": "10.241.207.59",
"public_interface_name": "public",
"private_interface_name": "private.4",
"mgmt_interface_name": "public",
"data_interface_name": "public:data",
"replication_interface_name": "public"
},
"status": "OK",
"etag": 50
}
- 使用管理 IP、专用 IP 或主机名通过 SSH 连接到目标节点。
- 使用正确的参数验证 docker 服务器是否正常运行
# ps -ef | grep docker root 50062 1 0 Jun02 ? 00:02:11 /usr/bin/docker daemon -H fd:// --insecure-registry=0.0.0.0/0 --log-level=warn
- 接下来,我们必须验证哪个容器(NAMES 列)由于某种原因停止或根本没有启动(查看 STATUS 列):
# sudo docker ps -a CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES 7bf16df0ef15 464b97154c24 "/opt/vipr/boot/boot." 3 days ago Up 3 days object-main 0ef5cc422543 24d9d6008893 "./boot.sh lifecycle" 3 days ago Up 3 days fabric-lifecycle 87d6c77d98ca 32cce433c3dc "./boot.sh 3 1=169.25" 3 days ago Up 3 days fabric-zookeeper
- 验证结构服务是否正在运行。结构代理尝试自动恢复有问题的容器
# sudo service fabric-agent status fabric-agent.service - fabric agent Loaded: loaded (/usr/lib/systemd/system/fabric-agent.service; enabled) Active: active (running) since Thu 2016-06-02 17:56:39 UTC; 3 days ago Process: 50643 ExecStartPre=/bin/rm -f /var/run/fabric-agent.pid (code=exited, status=0/SUCCESS) Main PID: 50645 (java) CGroup: /system.slice/fabric-agent.service
- 查看已停止/故障的容器状态
# sudo docker inspect fabric-zookeeper | grep -A12 State
"State": {
"Status": "running",
"Running": true,
"Paused": false,
"Restarting": false,
"OOMKilled": false,
"Dead": false,
"Pid": 80462,
"ExitCode": 0,
"Error": "",
"StartedAt": "2016-06-06T17:29:12.968133861Z",
"FinishedAt": "2016-06-06T17:29:12.882812946Z"
},
如果我们仍不确定问题是否存在,请咨询 ECS 技术支持以获得进一步的帮助。
Affected Products
ECS Appliance Software without EncryptionProducts
ECS Appliance, ECS Appliance Software with Encryption, ECS Appliance Software without EncryptionArticle Properties
Article Number: 000064491
Article Type: Solution
Last Modified: 17 Dec 2025
Version: 6
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.