It looks like this could be due to an unsupported operating system. You indicate it does not occur when booting to Windows.Dell has only validated up to ESXi 6.5 for the R320. I do understand you say you have had ESXi 6.7 working, however this is an unsupported OS and do not know what the behavior would be on this OS.
If you roll back the U3 update does it run like it did before the update?
If you don't have one, you may look at getting a dedicated port card. If this OS has a lock on the NIC ports then giving the DRAC a dedicated port may help.
Pn#2827M - ASSY Riser Card, Server Chassis, Advanced Management Expansion Adapter , iDRAC Port Card
My last changes on this server was last month, but I don't think that's the cause: - Changing CPU to a E5-2470V2 - Updating ESXi 6.7 to 6.7U3, enabling SNMP on the iDRAC interface to monitor it.
This “updating and enabling part” I’m wondering how you proceeded, if you could give us more detail.
One thing I could think of now is to reset idrac to troubleshoot.
First, thank you for your response. I updated ESXi by SSH with the following command: "esxcli software profile update".
For the SNMP part, I simply enabled the service option in the iDRAC webUI.
I tried the RACADM resets (racreset and even racresetcfg) but the problem is sill present.
I'm not particularly familiar with the RACADM utility but I tried some "get" command to view config and I noticed that the link to the LOM interface seems to be not detected when ESXi is running:
LOM Status: NIC Selection = LOM1 Link Detected = No Speed = Unknown Duplex Mode = Unknown Active NIC = None Active LOM in Shared Mode = None FailOver LOM in Shared Mode = None
I have 2 build-in NIC connected to a switch (first used for iDRAC & ESXi Management, second for generic VM Network). I also have 2 PCIe network cards with 4xGbE for my others VMs in my VLAN.
I tried to change the ESXi Management NIC in to another interface not used by iDRAC but the problem persists.
Thanks for your response. Effectively, I haven't checked the compatibility with the version 6.7 and it's probably the problem. The network drivers seems to have changed between the 6.7U1 and the U3.The new drivers seems to have a lock on the NICs, like you said.
I tried to rollback my updates but I always got an error, so I tried another thing:
I changed the ESXi "ntg3" driver with the old "tg3" driver, I rebooted twice and then the iDRAC started to respond again. It has been working for 3 days now. I did not dare to reboot one more time, but now I know that is probably a driver problem.
I might consider getting a dedicated iDRAC card if the problem persists. As long as it works like this, I won't change anything. It is anyway a homelab and not a production server.
DELL-Charles R
Moderator
•
4.7K Posts
0
June 22nd, 2021 08:00
Hello gabriel-d,
Do you have a DRAC Dedicated port you can try?
It looks like this could be due to an unsupported operating system. You indicate it does not occur when booting to Windows. Dell has only validated up to ESXi 6.5 for the R320. I do understand you say you have had ESXi 6.7 working, however this is an unsupported OS and do not know what the behavior would be on this OS.
Supported OS : https://dell.to/3d3lOaa
VMware ESXi 6.5
If you roll back the U3 update does it run like it did before the update?
If you don't have one, you may look at getting a dedicated port card. If this OS has a lock on the NIC ports then giving the DRAC a dedicated port may help.
Pn#2827M - ASSY Riser Card, Server Chassis, Advanced Management Expansion Adapter , iDRAC Port Card
DELL-Young E
Moderator
•
5.4K Posts
0
June 20th, 2021 19:00
Hi, thanks for choosing Dell.
From what you said
My last changes on this server was last month, but I don't think that's the cause:
- Changing CPU to a E5-2470V2
- Updating ESXi 6.7 to 6.7U3, enabling SNMP on the iDRAC interface to monitor it.
This “updating and enabling part” I’m wondering how you proceeded, if you could give us more detail.
One thing I could think of now is to reset idrac to troubleshoot.
https://dell.to/3wNiUOI
gabriel-d
3 Posts
0
June 22nd, 2021 08:00
Hi,
First, thank you for your response.
I updated ESXi by SSH with the following command: "esxcli software profile update".
For the SNMP part, I simply enabled the service option in the iDRAC webUI.
I tried the RACADM resets (racreset and even racresetcfg) but the problem is sill present.
I'm not particularly familiar with the RACADM utility but I tried some "get" command to view config and I noticed that the link to the LOM interface seems to be not detected when ESXi is running:
LOM Status:
NIC Selection = LOM1
Link Detected = No
Speed = Unknown
Duplex Mode = Unknown
Active NIC = None
Active LOM in Shared Mode = None
FailOver LOM in Shared Mode = None
I have 2 build-in NIC connected to a switch (first used for iDRAC & ESXi Management, second for generic VM Network). I also have 2 PCIe network cards with 4xGbE for my others VMs in my VLAN.
I tried to change the ESXi Management NIC in to another interface not used by iDRAC but the problem persists.
gabriel-d
3 Posts
0
June 29th, 2021 12:00
Hello,
Thanks for your response. Effectively, I haven't checked the compatibility with the version 6.7 and it's probably the problem. The network drivers seems to have changed between the 6.7U1 and the U3.The new drivers seems to have a lock on the NICs, like you said.
I tried to rollback my updates but I always got an error, so I tried another thing:
I changed the ESXi "ntg3" driver with the old "tg3" driver, I rebooted twice and then the iDRAC started to respond again. It has been working for 3 days now. I did not dare to reboot one more time, but now I know that is probably a driver problem.
I might consider getting a dedicated iDRAC card if the problem persists. As long as it works like this, I won't change anything. It is anyway a homelab and not a production server.
Thank you again for helping me!