Isilon OneFS: A faulty DIMM causing Backend latency

Summary: A faulty memory module (DIMM) in a node may cause backend latency.

Αυτό το άρθρο ισχύει για Αυτό το άρθρο δεν ισχύει για Αυτό το άρθρο δεν συνδέεται με κάποιο συγκεκριμένο προϊόν. Δεν προσδιορίζονται όλες οι εκδόσεις προϊόντων σε αυτό το άρθρο.

Symptoms

Clients are reporting poor performance and there is high load and CPU utilization on one node due to isi_mca_dump process. 
 

# isi_for_array -s uptime
xxxxx-1:   1:37AM  up 216 days,  9:16, 1 users, load averages: 7.95, 6.69, 6.87
xxxxx-2:   1:37AM  up 216 days,  9:17, 2 users, load averages: 12.69, 7.75, 6.18
xxxxx-3:   1:37AM  up 216 days,  9:13, 0 users, load averages: 9.18, 5.96, 5.79
xxxxx-4:   1:37AM  up 128 days,  4:51, 1 users, load averages: 9.49, 6.47, 5.90
xxxxx-5:   1:37AM  up 216 days,  9:13, 0 users, load averages: 10.18, 6.25, 6.06
xxxxx-6:   1:37AM  up 216 days,  9:13, 0 users, load averages: 9.10, 6.52, 5.58
xxxxx-7:   1:37AM  up 216 days,  9:13, 0 users, load averages: 5.99, 4.45, 4.14
xxxxx-8:   1:37AM  up 90 days, 11:17, 2 users, load averages: 15.96, 26.64, 28.16 
xxxxx-9:   1:37AM  up 216 days,  9:13, 1 users, load averages: 5.99, 4.88, 5.12
xxxxx-10:  1:37AM  up 216 days,  9:13, 1 users, load averages: 12.52, 7.48, 6.28

# isi_for_array -n8 top
xxxxx-8: last pid: 75601;  load averages: 20.75, 19.19, 23.94  up 90+11:21:47    01:42:08
xxxxx-8: 135 processes: 5 running, 129 sleeping, 1 zombie
xxxxx-8:
xxxxx-8: Mem: 1310M Active, 69G Inact, 157G Wired, 96G Buf, 5078M Free
xxxxx-8: Swap:
xxxxx-8:
xxxxx-8:
xxxxx-8:   PID USERNAME       THR PRI NICE   SIZE    RES STATE   C   TIME    WCPU COMMAND
xxxxx-8: 73859 root             1 103    0   122M 10148K CPU19  19   6:00 100.00% isi_mca_dump
xxxxx-8: 74626 root             1 103    0   122M 10172K CPU26  26   3:01  98.97% isi_mca_dump
 

Cause

Machine Check Architecture (MCA) is an error reporting mechanism for the CPU and memory. Due to the faulty DIMM, there is a delay in the data written to or read from the affected node. This delay can cause backend latency and eventually affect the overall cluster performance. In worst-case scenarios, the latency can lead to a data unavailable (DU) situation.

Resolution

Replace the faulty DIMM.

Επηρεαζόμενα προϊόντα

Isilon

Προϊόντα

Isilon, PowerScale OneFS
Ιδιότητες άρθρου
Article Number: 000069748
Article Type: Solution
Τελευταία τροποποίηση: 17 Δεκ 2025
Version:  6
Βρείτε απαντήσεις στις ερωτήσεις σας από άλλους χρήστες της Dell
Υπηρεσίες υποστήριξης
Ελέγξτε αν η συσκευή σας καλύπτεται από τις Υπηρεσίες υποστήριξης.