Isilon:某些節點上的 /var/log 訊息會不時顯示記憶體不足訊息

Summary: 某些節點上的 /var/log 訊息會不時顯示這些記憶體不足 (OOM) 訊息:OOM:v_wire_count:2832982、v_active_count:516

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

 完整的消息集如下所示: 
2021-08-17T14:40:50.703082+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: OOM: v_wire_count: 2843879, v_active_count: 871
2021-08-17T14:40:50.703246+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: Malloc Pigs:
2021-08-17T14:40:50.703277+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: Type                   InUse   MemUse   Requests
2021-08-17T14:40:50.703303+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: 8kB dinodes           840507  647775K 13330759531
2021-08-17T14:40:50.703325+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: isi_hash               31424   88002K 1548035744
2021-08-17T14:40:50.703344+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: lbm super              17535   84150K    1435677
2021-08-17T14:40:50.703362+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: layout_hints          183473   57336K   22320273
2021-08-17T14:40:50.703379+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: iaddr_set             886694   55419K 5188138062
2021-08-17T14:40:50.703397+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: devbuf                 17379   37840K     184744
2021-08-17T14:40:50.703414+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: newblk                    26   32774K    2364548
2021-08-17T14:40:50.703432+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: bar_owner_vec259         261   32288K    2220148
2021-08-17T14:40:50.703451+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: parent_vec153         218013   27538K    4567646
2021-08-17T14:40:50.703475+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: ddvec84               183471   22840K  177809736
2021-08-17T14:40:50.703496+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: inodedep                  22   16390K    1677902
2021-08-17T14:40:50.703515+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: sysctloid             201426   10989K     201598
2021-08-17T14:40:50.703535+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: vfscache                   4    8241K          4
2021-08-17T14:40:50.703555+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: KTR                        2    7168K          2
2021-08-17T14:40:50.703574+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: ptr_llcb_map           56126    7017K  116151979
2021-08-17T14:40:50.703592+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: Unshown bins account for 84039K
2021-08-17T14:40:50.703611+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: Total: 1219801K
2021-08-17T14:40:50.703630+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: UMA Zalloc Pigs:
2021-08-17T14:40:50.703648+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: ZONE NAME       SIZE     LIMIT     COUNT   MEM USED
2021-08-17T14:40:50.703668+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: mbuf          000256, 06012268, 00017191, 00004297K
2021-08-17T14:40:50.703687+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: socket        000872, 00765000, 00002327, 00001981K
2021-08-17T14:40:50.703704+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: tcpcb         001080, 00765000, 00001652, 00001742K
2021-08-17T14:40:50.703722+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: tcp_inpcb     000488, 00765000, 00001665, 00000793K
2021-08-17T14:40:50.703740+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: udp_inpcb     000488, 00765000, 00000270, 00000128K
2021-08-17T14:40:50.703758+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: unpcb         000192, 00765000, 00000391, 00000073K
2021-08-17T14:40:50.703776+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: hostcache     000096, 00015360, 00000191, 00000017K
2021-08-17T14:40:50.703794+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: udpcb         000032, 00765000, 00000270, 00000008K
2021-08-17T14:40:50.703812+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: clpbuf        001016, 00015872, 00000006, 00000005K
2021-08-17T14:40:50.703832+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: tcptw         000096, 00027767, 00000013, 00000001K
2021-08-17T14:40:50.703853+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: ripcb         000488, 00765000, 00000001, 00000000K
2021-08-17T14:40:50.703872+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: Unshown zones account for 0K
2021-08-17T14:40:50.703890+12:00 <0.4> cluster-1(id1) /boot/kernel.amd64/kernel: Total: 9051K

Cause

OneFS 9.2 中推出的新 OneFS 功能會新增這些訊息。
每次當分頁守護程式無法將可用記憶體提升到 free_target 以上時,此新函數都會記錄分頁函數的記憶體使用方式資訊。
作業系統會主動呼叫上述功能以記錄資訊,以防記憶體進一步不足,且節點停止回應或發生錯誤。

Resolution

重要:
如果節點只有 16 GB RAM,建議使用者升級至 64 GB。

如果節點有 24 GB 或更多的 RAM,則在滿足所有這些條件的情況下,可以安全地忽略以下訊息:

  • /var/log/vmlog 顯示可用記憶體未經常低於 50% 的可用目標
  • 叢集中沒有節點發生錯誤重新開機,錯誤字串中出現BUF_TIMELOCK或其他 OOM 相關的逾時訊息
  • 可使用 freevnodes sysctl 在所有節點上顯示已啟用 (值為 1):
# isi_for_array 'sysctl vfs.vnlru_reuse_freevnodes'

注意:
記得改變 sysctl vfs.vnlru_reuse_freevnodes 如果滿足下列所有條件,則返回預設值 (0 - 零):
  • 節點有 24 GB 或更多的實體記憶體
  • 可使用 vnlru_reuse_freevnodes=1 設定最初是為了解決高 8KB Dinode 問題而添加的
  • OneFS 版本會升級至以下其中一個層級或更新版本:
    • OneFS 9.2.1.25_GA-RUP_2023-12
    • OneFS 9.4.0.17_GA-RUP_2024-02
    • OneFS 9.5.0.7_LTS2023_GA-RUP (2024 年 1 月)
Article Properties
Article Number: 000191515
Article Type: Solution
Last Modified: 03 Apr 2024
Version:  5
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.