Unsolved

This post is more than 5 years old

11 Posts

816546

December 23rd, 2009 17:00

OMSA: No controllers found

OMSA tools (command line and web-based) see but do not display PERC5 controller nor any of the array info. MegaCli displays everything fine.

I am battling with this problem for several days. I tried several solutions as suggested in various places on the net, to no avail. Please note that OMSA daemons appear to recognize the controller and see the disks, however both omreport and web-based GUI complain that no controllers were found.


Symptom of the problem:


# omreport storage controller
No controllers found


The web-based OMSA GUI also displays this message in "Storage" folder, which is otherwise shown empty. Other folders ("Main System Chassis" and "Software" are populated and seem to be working fine

Hardware: DELL PowerEdge 2950

System:
Centos 5.4 64 bit

# uname -a
Linux peta.swmed.edu 2.6.18-164.9.1.el5 #1 SMP Tue Dec 15 20:57:57 EST 2009
x86_64 x86_64 x86_64 GNU/Linux



PERC Controller: PERC 5/i Integrated

# ./MegaCli -AdpAllInfo -aALL

Adapter #0

==============================================================================
                    Versions
                ================
Product Name    : PERC 5/i Integrated
Serial No       : 12345
FW Package Build: 5.0.2-0003

                    Mfg. Data
                ================
Mfg. Date       : 00/00/00
Rework Date     : 00/00/00
Revision No     :
Battery FRU     : N/A

                Image Versions in Flash:
                ================
Boot Block Version : R.2.3.2
BIOS Version       : MT23
MPT Version        : MPTFW-00.06.71.00-IT
FW Version         : 1.00.02-0157
WebBIOS Version    : 1.01-021
Ctrl-R Version     : 1.02-007


OMSA Version
: 6.2.0

# rpm -qa | grep -i srvadmin
srvadmin-idracadm-6.2.0-677
srvadmin-iws-6.2.0-1.18.el5
srvadmin-idrac-6.2.0-1.5.el5
srvadmin-webserver-6.2.0-1.5.el5
srvadmin-sysfsutils-6.2.0-2.1.el5
srvadmin-racsvc-6.2.0-677
srvadmin-omacore-6.2.0-1.18.el5
srvadmin-standardAgent-6.2.0-1.5.el5
srvadmin-racdrsc5-6.2.0-677
srvadmin-hapi-6.2.0-1.17.el5
srvadmin-megalib-6.2.0-1.6.el3
srvadmin-racadm5-6.2.0-677
srvadmin-deng-6.2.0-1.6.el5
srvadmin-storage-6.2.0-1.29.el5
srvadmin-idrac-components-6.2.0-677
srvadmin-storageservices-6.2.0-1.5.el5
srvadmin-racdrsc4-6.2.0-677
srvadmin-rac5-6.2.0-1.5.el5
srvadmin-storelib-6.2.0-1.11.el3
srvadmin-omilcore-6.2.0-1.9.el5
srvadmin-jre-6.2.0-1.17.el5
srvadmin-omcommon-6.2.0-1.19.el5
srvadmin-storage-populator-6.2.0-1.25.el3
srvadmin-cm-6.2.0-677
srvadmin-idracdrsc-6.2.0-677
srvadmin-rac4-6.2.0-1.5.el5
srvadmin-smweb-6.2.0-1.29.el5
srvadmin-racadm4-6.2.0-677
srvadmin-isvc-6.2.0-1.16.el5
srvadmin-base-6.2.0-1.5.el5
srvadmin-rac5-components-6.2.0-677
srvadmin-smcommon-6.2.0-1.29.el5
srvadmin-xmlsup-6.2.0-1.17.el5
srvadmin-fsa-6.2.0-1.6.el3
srvadmin-itunnelprovider-6.2.0-1.6.el5
srvadmin-rac4-components-6.2.0-677
srvadmin-all-6.2.0-1.5.el5

The OMSA software was installed as follows:
wget -q -O - http://linux.dell.com/repo/hardware/latest/bootstrap.cgi | bash
yum install srvadmin-all


Driver version:


# modinfo megaraid_sas
filename:       /lib/modules/2.6.18-164.9.1.el5/extra/megaraid_sas.ko
description:    LSI Logic MegaRAID SAS Driver
author:         megaraidlinux@lsi.com
version:        00.00.04.17
license:        GPL
srcversion:     93C6B7A0CD0B1E64061C3B4
alias:          pci:v00001028d00000015sv*sd*bc*sc*i*
alias:          pci:v00001000d00000413sv*sd*bc*sc*i*
alias:          pci:v00001000d00000071sv*sd*bc*sc*i*
alias:          pci:v00001000d00000073sv*sd*bc*sc*i*
alias:          pci:v00001000d00000079sv*sd*bc*sc*i*
alias:          pci:v00001000d00000078sv*sd*bc*sc*i*
alias:          pci:v00001000d0000007Csv*sd*bc*sc*i*
alias:          pci:v00001000d00000060sv*sd*bc*sc*i*
alias:          pci:v00001000d00000411sv*sd*bc*sc*i*
depends:        scsi_mod
vermagic:       2.6.18-164.9.1.el5 SMP mod_unload gcc-4.1
parm:           fast_load:megasas: Faster loading of the driver, skips
physical devices!      (default=0) (int)
parm:           max_sectors:Maximum number of sectors per IO command (int)
parm:           cmd_per_lun:Maximum number of commands per logical unit
(default=128) (int)
parm:           poll_mode_io:Complete cmds from IO path, (default=0) (int)


Same problems with driver that came with OS:
version:        00.00.04.08-RH2


All srvadmin services start OK:


# srvadmin-services.sh status
dell_rbu (module) is running
ipmi driver is running
dsm_sa_datamgrd (pid 4277) is running
dsm_sa_eventmgrd (pid 5204) is running
dsm_om_shrsvcd (pid 3706) is running
dsm_om_connsvcd (pid 5222 5221) is running

# service ipmi status
ipmi_msghandler module loaded.
ipmi_si module loaded.
ipmi_devintf module loaded.
/dev/ipmi0 exists.


Starting dsm_on_connsvcd causes the following file to be created,
indicating that OMSA daemons do indeed see the controller:

# cat /opt/dell/srvadmin/var/log/openmanage/Inventory.xml.1


   
invcolBuild="288" timeStamp="2009-12-22T18:28:32">
    majorVersion="package redhat-release is not installed"
minorVersion="2.6.18-164.9.1.el5" usingTPMmeasurements="FALSE"/>
    componentID="2331" display="OpenManage Server Administrator Managed Node">


    impactsTPMmeasurements="TRUE"> version="1.2.0" display="BIOS"/>
   
     
  
    subVendorID="1028" bus="2" device="e" function="0" display="PERC 5/i
Integrated Controller 0" impactsTPMmeasurements="TRUE"> componentType="FRMW" version="5.0.2-0003" display="PERC 5/i Integrated
Controller 0 Firmware"/>
    display="ST3750640NS"> display="ST3750640NS Firmware"/>
    display="ST3750640NS"> display="ST3750640NS Firmware"/>
    display="ST3750640NS"> display="ST3750640NS Firmware"/>
    display="ST3750640NS"> display="ST3750640NS Firmware"/>
    display="ST3750640NS"> display="ST3750640NS Firmware"/>
    display="ST3750640NS"> display="ST3750640NS Firmware"/>
    display="SAS/SATA Backplane 0:0 Backplane"> componentType="FRMW" version="1.00" display="SAS/SATA Backplane 0:0
Backplane Firmware"/>
    subDeviceID="01b2" bus="5" device="0" function="0" display="NetXtreme II
BCM5708 Gigabit Ethernet rev 12 (eth0)">
       
   
    subDeviceID="01b2" bus="9" device="0" function="0" display="NetXtreme II
BCM5708 Gigabit Ethernet rev 12 (eth1)">
       
   
   


Any help would be welcome. Sorry for the long post.

10 Posts

January 18th, 2010 10:00

I have read and followed everything in this post but I am still having some difficulties. In my case everything installs just fine and the scripts run without errors but when I restart the services I always get a failure with dsm_sa_datamgrd. It always fails.

This is the only log entry I can find relating to the service:

kernel: dsm_sa_datamgrd[3375]: segfault at 00000000002e0041 rip 00000000007faff3 rsp 00000000ffa2ffb0 error 4

I have searched hi and low for more information on this. There is another similar post, same os as me (CentOS 5.4 64-bit) but no resolution there. Hoping someone here may have an idea.

Thanks!

1 Message

January 25th, 2010 16:00

I'm running 3x CentOS 5.4 servers and had the same problems as above. I tried all of the fixes suggested but things would only work sporadically and I wasn't able to pin down what was going on. In the end the solution was:

 

srvadmin-services.sh start ; srvadmin-services.sh start ; srvadmin-services.sh start

 

That's right, I ran the startup 3 times in a row and everything works perfectly everytime.

 

10 Posts

January 25th, 2010 17:00

Interesting solution ; )

Didn't work here though. All I got was 3 consecutive failures.

Anyone else have any ideas?

11 Posts

January 26th, 2010 12:00

DLConsulting, I have this problem too. I start OMSA twice; please see my other thread at http://en.community.dell.com/forums/t/19313720.aspx

However, this seems to be a separate issue from what I described in this thread. Here the problem was to make OMSA just see the RAID controller. In the other thread my problem was to keep OMSA running reliably for longer than 1 minute.

48 Posts

February 10th, 2010 11:00

I had the same problem today. Logged into Dell OpenManage 6.2.0 and found no RAID storage controllers. I am running VMware ESX vSphere 4.0 (similar to Red Hat Linux). I thought I would restart the services, but could not find an easy way to do so. I tried calling the Dell OM install script with "--help" it said it was a bad option and then restarted the services (which is what I wanted anyways). After the services restarted everything was good again.

Sounds like Dell OM 6.2.0 may not be very stable.

Here is the error from the /var/log/message file:

Feb  5 12:11:01 kernel: [1883137.674942] dsm_sa_datamgrd[4457]: segfault at 00000000c06f0103 rip 00000000007e1fec rsp 00000000f4dc4008 error 4
Feb  5 12:11:01 Server Administrator: Instrumentation Service EventID: 1009  Systems Management Data Manager Stopped

3 Posts

February 13th, 2010 21:00

"srvadmin-services.sh start ; srvadmin-services.sh start ; srvadmin-services.sh start"

 

I was having an issue with another server and I tried this solution and it worked. I am amazed.

 

thanks

No Events found!

Top