Article Number: 000168665
[root@NodeX:~] tail /var/log/vmkernel.log 2018-12-06T14:55:30.958Z cpu3:2550663)MemSchedAdmit: 470: Admission failure in path: vsanperfsvc/python.2550635/uw.2550635 2018-12-06T14:55:30.958Z cpu3:2550663)MemSchedAdmit: 477: uw.2550635 (3479393) extraMin/extraFromParent: 64/64, vsanperfsvc (2360) childEmin/eMinLimit: 38886/38912 2018-12-06T14:55:30.958Z cpu3:2550663)MemSchedAdmit: 470: Admission failure in path: vsanperfsvc/python.2550635/uw.2550635 2018-12-06T14:55:30.958Z cpu3:2550663)MemSchedAdmit: 477: uw.2550635 (3479393) extraMin/extraFromParent: 64/64, vsanperfsvc (2360) childEmin/eMinLimit: 38886/38912 2018-12-06T14:55:30.958Z cpu3:2550663)MemSchedAdmit: 470: Admission failure in path: vsanperfsvc/python.2550635/uw.2550635 2018-12-06T14:55:30.958Z cpu3:2550663)MemSchedAdmit: 477: uw.2550635 (3479393) extraMin/extraFromParent: 64/64, vsanperfsvc (2360) childEmin/eMinLimit: 38886/38912 2018-12-06T14:55:30.958Z cpu3:2550663)MemSchedAdmit: 470: Admission failure in path: vsanperfsvc/python.2550635/uw.2550635 2018-12-06T14:55:30.958Z cpu3:2550663)MemSchedAdmit: 477: uw.2550635 (3479393) extraMin/extraFromParent: 64/64, vsanperfsvc (2360) childEmin/eMinLimit: 38886/38912 2018-12-06T14:55:30.958Z cpu3:2550663)MemSchedAdmit: 470: Admission failure in path: vsanperfsvc/python.2550635/uw.2550635 2018-12-06T14:55:30.958Z cpu3:2550663)MemSchedAdmit: 477: uw.2550635 (3479393) extraMin/extraFromParent: 64/64, vsanperfsvc (2360) childEmin/eMinLimit: 38886/38912
[root@NodeX:~] /etc/init.d/vsanmgmtd status vsanperfsvc is running [root@NodeX:~] /etc/init.d/vsanmgmtd restart watchdog-vsanperfsvc: Terminating watchdog process with PID 2099001 stopping timed out vsanperfsvc started
[root@NodeX:~] ls -lah /var/core/ total 595924 drwxr-xr-x 1 root root 8 Dec 6 15:40 . drwxr-xr-x 1 root root 8 Jan 1 1970 .. -rwx------ 1 root root 1.3M Dec 5 13:58 cmmds-tool-zdump.000 -rwx------ 1 root root 63.8M Dec 4 19:25 hostd-worker-zdump.000 -rwx------ 1 root root 67.1M Dec 4 18:48 hostd-worker-zdump.002 -rwx------ 1 root root 69.6M Dec 4 19:04 hostd-worker-zdump.003 -rwx------ 1 root root 3.9M Dec 6 15:18 localcli-zdump.000 -rwx------ 1 root root 3.3M Dec 6 08:14 localcli-zdump.002 -rwx------ 1 root root 3.9M Dec 6 15:15 localcli-zdump.003 -rwx------ 1 root root 178.0M Dec 6 15:28 python-zdump.001 -rwx------ 1 root root 178.0M Dec 6 15:37 python-zdump.002 -rwx------ 1 root root 12.6M Dec 6 15:40 python-zdump.003
These errors are due to the vsanmgmtd running out of memory, crashing, then attempting to restart on each host. Each time the service crashes in this situation, a python-zdump file is created. While vsanmgmtd is unresponsive or has crashed, the host can no longer report certain vSAN statistics to vCenter. This leads to vCenter reporting false or improper information for one or more hosts, causing many regular management operations to time-out or fail.
The vsanmgmtd is commonly known as the vSAN Health Service or vSAN Management Service. Primarily, vCenter makes API calls to this service for vSAN reporting on various components. VMware also allows other software vendors to utilize this API for their software to provide reporting and other monitoring or analytical services. As this API is generally updated with every version of vSAN, it is important to ensure that the external software is updated to maintain compatibility. In this situation the version incompatibility lead to the vsanmgmtd service being polled too often, which further lead to memory exhaustion and crashes.
Customers utilizing these tools, such as Veeam ONE, may still be running versions that only support ESXi 6.5 / vSAN 6.61 and below at the time of upgrading to VxRail 4.7.x. Due to this, it is imperative to verify compatibility of all external software in the vSAN environment for the new target vSAN version before upgrading.
VxRail Appliance Family, VxRail Software
10 May 2023
3
Solution