NetWorker:在 Pacemaker 更新后,Red Hat 8.x 群集 NetWorker 服务器无法启动

Summary: 部署在 Red Hat 8.x 故障切换群集上的 NetWorker 服务器无法启动,并显示“无法启动 nsrd,因为 /nsr/res (/nsr) 是本地的,并且 NetWorker 配置为群集服务器。”

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

  • NetWorker 服务器配置部署在 Red Hat 8.x 高可用性群集上。
  • nws 资源无法启动:
[root@NetWorker_Node_Name ~]# pcs status
...
Full List of Resources:
  * Resource Group: NW_group:
    * fs        (ocf::heartbeat:Filesystem):     Started NetWorker_Node_Name
    * ip        (ocf::heartbeat:IPaddr):         Started NetWorker_Node_Name
    * nws       (ocf::EMC_NetWorker:Server):     FAILED NetWorker_Node_Name

Failed Resource Actions:
  * nws_start_0 on NetWorker_Node_Name 'error' (1): call=25, status='Timed Out', exitreason='Resource agent did not complete within 5m', last-rc-change='DDD MM # HH:MM:SS 2022', queued=0ms, exec=300011ms
  • NetWorker 服务器服务无法启动,共享 nsr 目录的daemon.raw中记录了以下内容:
    • 示例: nsr_render_log /nsr_share/nsr/logs/daemon.raw
73248 MM/DD/YY HH:MM:SS AM/PM  5 5 0 1783289664 29561 0 NetWorker_Node_Name nsrd NSR critical Can't start nsrd because /nsr/res (/nsr) is local, and NetWorker is configured as a cluster server. Use cluster manager to check NetWorker service status. 
144354 MM/DD/YY HH:MM:SS AM/PM  1 5 0 3720415040 28971 0 NetWorker_Node_Name nsrctld NSR notice Daemon nsrdispd terminated. 
144359 MM/DD/YY HH:MM:SS AM/PM  3 5 0 3720415040 28971 0 NetWorker_Node_Name nsrctld NSR error Scheduling restart of daemon nsrdispd in 5 seconds 
137911 MM/DD/YY HH:MM:SS AM/PM  5 5 0 3720415040 28971 0 NetWorker_Node_Name nsrctld NSR critical Aborting startup sequence: Process nsrd exited in less than 10 seconds at startup: exit code 1 
127108 MM/DD/YY HH:MM:SS AM/PM  5 5 0 3720415040 28971 0 NetWorker_Node_Name nsrctld NSR critical Failed to start all daemons; shutting down... 
  • lcmap 不显示群集 IP 地址或拥有的路径:
[root@NetWorker_Node_Name ~]# lcmap
type: NSR_CLU_TYPE;
clu_type: NSR_LC_TYPE;
interface version: 1.0;

[root@NetWorker_Node_Name ~]#
  • NetWorker 服务器为 19.8.0.1 或更低版本。

Cause

此问题是在以下情况下发现的: pcs 版本为 0.10.14(或更高版本)。 lcmap 未正确映射共享位置。

[root@NetWorker_Node_Name ~]# pcs --version
0.10.14

Resolution

解决方案:

此问题已在 NetWorker 19.8.0.4 中得到解决;但是,19.8 是截至 2025 年 11 月 11 日终止支持期限 (EOSL)。升级到 19.9.0.2 或更高版本以获取代码修复: https://www.dell.com/support/home/product-support/product/networker/drivers

提醒:NetWorker 产品页面的概述选项卡中列出了 NetWorker 版本的支持终止日期。

请参阅:NetWorker:Red Hat Pacemaker 群集 如何升级 NetWorker 服务器和最佳实践
 

解决办法:

yum downgrade pcs

提醒:Red Hat 系统更新会自动更新 pcs 找到较新的版本时。在代码修复可用之前,可以通过排除 pcs 在 yum.conf。从 中删除此条目 yum.conf NetWorker 升级到上面列出的版本之一(或更高版本)后,该文件。
root@NWrhelNodeA:~# echo exclude=pcs >> /etc/yum.conf
root@NWrhelNodeA:~# cat /etc/yum.conf | grep pcs
exclude=pcs

曾经在 0.10.12 上 lcmap 正确查看拥有的路径,并 nws 资源开始:

[root@NetWorker_Node_Name ~]# pcs --version
0.10.12

[root@NetWorker_Node_Name ~]# lcmap
type: NSR_CLU_TYPE;
clu_type: NSR_LC_TYPE;
interface version: 1.0;

type: NSR_CLU_VIRTHOST;
hostname: Cluster_IP;
local: TRUE;
owned paths: /nsr_share;

[root@NetWorker_Node_Name ~]# pcs resource status
  * Resource Group: NW_group:
    * fs        (ocf::heartbeat:Filesystem):     Started NetWorker_Node_Name
    * ip        (ocf::heartbeat:IPaddr):         Started NetWorker_Node_Name
    * nws       (ocf::EMC_NetWorker:Server):     Started NetWorker_Node_Name

Additional Information

Affected Products

NetWorker

Products

NetWorker Family, NetWorker Series
Article Properties
Article Number: 000205728
Article Type: Solution
Last Modified: 04 Nov 2025
Version:  12
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.