OME 1.2 Server Discovery/Inventory Fails at 10%

Question

I was pretty excited to download 1.2 this morning, having been waiting on it for a while now. It has been installed on a WS 2012 VM under VMware. Sadly though I have added a wildcarded range which allows for SNMP, WS-Man and IPMI Discovery and at 10% the discovery service on the machine crashes, stopping the discovery process. Interestingly, this appears to be happening in an IP range which is mostly ESXi Hosts. Given theres a known VMware issue with WS-Man memory leaks, I wonder if a broken WS-Man instance may be causing the discovery process to crash?

DELL-Rob C · Answer

Hi and thanks for the post. Sorry you are running into trouble.

1. Is this a range that was working on OME 1.1/1.1.1 but is not working in OME 1.2?

2. is the range something like 10.1.1.1-255 or similar? Can you give me an example of the range?

3. Can you try breaking it up into several ranges for debugging purposes. Either by protocol or some way to narrow it down to a possible problematic system?

I don't think at this point the issue would be the VMware memory leak, but I can't say for sure now.

We may need to collect discovery logs to see if we can find any device that might be giving us the trouble. But we'll see.

Also, the Dell Troubleshooting tool is very helpful in these cases. But of course we need to get a better idea of a suspect IP address.

Thx

Rob

bcshort · Answer

Hi Rob,

Thanks for the prompt reply. Just so you are aware, This was a fresh install of OME 1.2, not an upgrade over 1.1.1 :) I do have an OME 1.1.1 instance that resides on a different server.

1) Yep, existing ranges are work in OME 1.1.1

2) Yep, this is a private range of 10.8.178.*

3) I have restricted the range to 12 IP addresses that belong to our ESX Cluster - I am pleased to say this is not causing the crash.

The issue appears to be isolated to a Windows 2008R2 server running as a VM. the SNMP configuration on the server looks fine. Dell troubleshooting tool shows things as fine as well.

It looks like my current workaround will be to exclude this machine IP from discovery and see how things go from there.

Is there any more detailed logs I can forward on for investigation?

DELL-Pupul M · Answer

Hi,

Thanks for trying to figure out where the problem is. Since you have said that it is because of a VM with Windows 2008R2 OS, so does this mean that a single discovery task on this server makes the service crash?

Can you provide the application logs which is there under "Logs" after the services crash?

bcshort · Answer

Hi Pupul, Have sent you a 'friend request' so I can share a trace log file offline of the forums. Cheers

DELL-Rob C · Answer

Ok, so the suspect box is a guest OS VM? Or it is ESX server hosting VMs?

If it is a regular Server with OMSA, the TS tool should return some sensible data along with the version of Server Administrator.

Whichever test you run, you may compare the results of a good server to the suspect one. That is often useful.

You should probably open a ticket as we will likely need to dig into logs to better identify the cause of the wayward server.

Rob

bcshort · Answer

Hi Rob,

The suspect box is a Windows Server 2008R2 Virtual Machine running on an ESXi Host. The troubleshooting tool reports back cleanly for SNMP, WS-Man Protocols, and the information is the same between working and non working servers.

I'll get a ticket opened :)

DELL-Pupul M · Answer

Hi,

Thanks for the response.

Since it is application logs which is there in the UI itself, it would be great if you can attach a screenshot. If detailed logs are required, it would be better if you open a trouble ticket soon as Rob suggested already.

Dell OpenManage Essentials

Was this post helpful?