VMware 的 PowerPath/VE 無法在開機時宣告 (所有) VPLEX LUN
Summary: 當主機重新開機時,NMP 會管理部分或全部 VPLEX LUN (而不是 PowerPath/VE)。
Symptoms
環境:
作業系統:VMware ESXi 6.0.0 Update 2 (組建 3620759,組建 4192238)
EMC 軟體:適用於 VMware vSphere 6.0
的 PowerPath/VE EMC 軟體:VMware vSphere 6.0 SP1
的 PowerPath/VE EMC 軟體:VMware vSphere 6.1
Server 的 PowerPath/VE:HP ProLiant BL460c Gen9
主機匯流排配接卡:Emulex Corporation Emulex OneConnect OCe14000,FCoE 啟動器:650FLB CNA
HBA 驅動程式:lpfc 11.1.145.18-1OEM.600.0.0.2768847 EMU VMware認證 2016-12-04
產品:VPLEX (5410、5520)
從vmkernel.log
2017-05-16T08:06:50.035Z cpu21:33912)ScsiClaimrule: 1165: The current claimrules indicate that path vmhba0:C0:T0:L1 should be claimed by plugin PowerPath. 2017-05-16T08:06:50.035Z cpu21:33912)ScsiClaimrule: 1169: Path vmhba0:C0:T6:L1 which appears to refer to the same physical media as path vmhba0:C0:T0:L1 is already claimed by plugin NMP. 2017-05-16T08:06:50.035Z cpu21:33912)ScsiClaimrule: 1171: If neither of these paths is being masked by ESX, this condition indicates a problem with the claimrules. 2017-05-16T08:06:50.035Z cpu21:33912)WARNING: ScsiPath: 608: Path vmhba0:C0:T0:L1 claims to be a VVol PE but has a version of 4 (expected 5 or higher). Not treating it as a PE. 2017-05-16T08:06:50.036Z cpu21:33912)ScsiPath: 5549: Plugin 'NMP' claimed path 'vmhba0:C0:T0:L1'
Cause
在開機程序期間,問題裝置的查詢命令失敗。
Resolution
在此情況下,主機廠商將兩台伺服器的兩張 HBA 卡從 FLB 650 更換為 FLB 630。
伺服器重新開機後,未發現任何問題。PowerPath/VE 能正確管理裝置。
Additional Information
您可以在localcli_storage-core-adapter-list.txt輸出中找到 HBA 的型號。
您可以在esxcfg-info_-a.txt.FRAG-00000.txt中找到伺服器品牌和型號。
您可以在 vmware_-vl.txt 中找到 VMware 版本。
您可以在 localcli_storage-core-device-list.txt 中找到陣列和韌體的型號。
為了妥善排除此問題,我們使用了工程特殊組建。
PowerPath 仰賴 SCSI 查詢命令來宣告路徑。從工程測試包日誌中,我們可以觀察到查詢命令最初失敗,並出現HOST_RETRY (0x12) 或 HOST_NO_CONNECT (0x1) 錯誤。但後來當 ESXi 再次提供裝置時,他們成功導致 PowerPath 宣告該裝置 (如果沒有工程組建,則看不到第二個順序)
為了回應HOST_RETRY錯誤,PowerPath 甚至在內部 0.1 秒內多次重試查詢命令。但主機配接卡仍然失敗,從記錄中可以看出。
查詢一開始失敗
2016-12-14T11:53:51.561Z cpu24:33396)PowerPath:Claiming path vmhba0:C0:T1:L0 2016-12-14T11:53:51.561Z cpu24:33396)PowerPath:PowerPlatformScsiIoErrorIsRetryable: cmd=0x12 Failed H: 0xc S: 0x0 P: 0x0 Path=vmhba0:C0:T1:L0 2016-12-14T11:53:51.663Z cpu24:33396)PowerPath:PowerPlatformScsiIoErrorIsRetryable: cmd=0x12 Failed H: 0xc S: 0x0 P: 0x0 Path=vmhba0:C0:T1:L0 2016-12-14T11:53:51.765Z cpu24:33396)PowerPath:PowerPlatformScsiIoErrorIsRetryable: cmd=0x12 Failed H: 0xc S: 0x0 P: 0x0 Path=vmhba0:C0:T1:L0 2016-12-14T11:53:51.867Z cpu24:33396)PowerPath:PowerPlatformScsiIoErrorIsRetryable: cmd=0x12 Failed H: 0xc S: 0x0 P: 0x0 Path=vmhba0:C0:T1:L0 2016-12-14T11:53:51.969Z cpu24:33396)PowerPath:PowerPlatformScsiIoErrorIsRetryable: cmd=0x12 Failed H: 0xc S: 0x0 P: 0x0 Path=vmhba0:C0:T1:L0 2016-12-14T11:53:56.772Z cpu46:33491)ALERT: PowerPath:MpxRecognize failed. Path vmhba0:C0:T1:L0 not claimed
最後查詢成功,PowerPath 宣告裝置 - 一般 GA 組建不會發生此順序。
2016-12-14T11:54:08.542Z cpu12:34080)PowerPath:Claiming path vmhba0:C0:T1:L0 2016-12-14T11:54:08.545Z cpu12:34080)PowerPath:Path Claim: Successfully claimed path vmhba0:C0:T1:L0
同時,我們會看到 lpfc 驅動程式回報連結中斷和啟動、連接埠探索訊息延遲等。
2016-12-14T11:53:46.586Z cpu30:33490)WARNING: lpfc: lpfc_mbx_cmpl_read_topology:3271: 0:1305 Link Down Event x5 received Data: x5 x20 x800220 x0 2016-12-14T11:53:46.704Z cpu4:33493)WARNING: lpfc: lpfc_mbx_cmpl_read_topology:3271: 1:1305 Link Down Event x5 received Data: x5 x20 x800220 x0 2016-12-14T11:53:49.334Z cpu30:33490)WARNING: lpfc: lpfc_mbx_cmpl_read_topology:3247: 0:1303 Link Up Event x6 received Data: x6 x0 x5 x0 x0 2016-12-14T11:53:52.337Z cpu25:33493)WARNING: lpfc: lpfc_mbx_cmpl_read_topology:3247: 1:1303 Link Up Event x6 received Data: x6 x0 x5 x0 x0 2016-12-14T11:53:52.452Z cpu25:33493)WARNING: lpfc: lpfc_sli4_async_fip_evt:5702: 1:2546 New FCF event, evt_tag:x7, index:x0 2016-12-14T11:53:52.479Z cpu24:33396)PowerPath:PowerPlatformScsiIoErrorIsRetryable: cmd=0x12 Failed H: 0xc S: 0x0 P: 0x0 Path=vmhba0:C0:T1:L0 2016-12-14T11:53:52.505Z cpu25:33493)WARNING: lpfc: lpfc_do_scr_ns_plogi:8098: 1:3334 Delay fc port discovery for 10 seconds
要聲稱一條道路,調查必須成功。但由於開機程序期間的主機配接卡錯誤,查詢會失敗,因此 PowerPath 未認領該裝置。
這不是 PowerPath 問題。
我們的建議是與 VMware/配接卡廠商聯繫,瞭解這些暫時性故障的原因:主機開
機期間的主機重試 (0xc)、無連線 (0x1)。如果這些與配接卡相關的暫時性錯誤已修正,PowerPath 在宣告裝置時應該不會有任何問題。