Isilon:由于不正确的重新启动,如何在维护后手动重新启动失败的进程

Summary: 在 Isilon 节点上进行维护后,使用 SSH 和 isi_for_array 命令检查进程是否正在运行,对进程故障进行故障排除;如果进程正在运行,则通过清除警报来解决问题;如果没有,则寻求手动重新启动的帮助。

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Instructions

维护活动后,进程可能无法自动启动。

可能会注意到以下警报:

Event : Process <Process Name> of service <Service Name> has failed to restart after multiple attempts
Example:
Process isi_healthcheck of service isi_healthcheck has failed to restart after multiple attempts

通常,该过程会自动重新启动。如果进程未自动重新启动,手动重新启动进程通常可解决问题。


验证进程是否正在运行:

  1. 在群集中的任意节点上建立 SSH 连接,并使用“root”帐户登录
  2. 检查进程现在是否在 显示错误消息的节点上运行。
# isi_for_array -n <Node Numeber> 'ps auxwww | grep -i <Process name> | grep -v grep'
Where <Node Number> is the Node on which the process is not running.
# isi_for_array -n1 'ps auxwww | grep isi_healthcheck | grep -v grep'


如果进程正在运行:

  1. 使用以下命令获取组 ID:

# isi event groups list
  1. 清除警报:
# isi event groups modify --resolved=yes --ignore=yes --id=<Group ID from Step #1>


如果进程未在运行:

根据进程的不同,重新启动进程的确切过程可能有所不同。如果需要帮助,请手动重新启动该过程,请联系 戴尔技术支持 。  


注意:如果在错误的节点上错误地重新启动服务,则可能会出现严重问题。

Additional Information

请参阅文章:

Affected Products

Isilon

Products

Isilon
Article Properties
Article Number: 000019770
Article Type: How To
Last Modified: 04 Mar 2025
Version:  5
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.