1 Rookie
•
99 Posts
0
2930
August 12th, 2022 11:00
No controller found on Poweredge M630 running CentOS 7 - Disk failure?
The machine has been running CentOS 7 fine for almost a year. But I noticed some weird disk issue with /dev/sdb1 . Many files had ????? on them and the OS gave input/output error when trying to access them. So I tried to reboot it in hopes that would fix it. But now it won't boot up. It takes me to this message on the console about "no controller found". I am not even sure what a controller is? Any tips on how to debug further? I suppose I can just buy the drives to replace, but not sure which model/sizes to get or how many?
No Events found!



DELL-Chris H
Moderator
•
9.6K Posts
1
August 18th, 2022 10:00
Thisisalloneword,
The drive can report smaller if it was configured that way, without using the whole drives space. Do you have any other Virtual Disks on the contoller, and if so is the drive in question listed in both Virtual Disk? That would explain that, as far as the issue you're seeing, I would start with makeing sure the server is up to date on BIOS, iDrac, Raid Controller, and the drives.
I wouldn't think that you would need to replace anything as of now, I would update and then see if the reporting on the drives changes and identifies an issue.
You may also want to screenshot the Virtual Disk page, as well as the Physical Disk page.
Let us know what you see.
DELL-Joey C
Moderator
•
4K Posts
1
August 15th, 2022 02:00
Hi @thisisalloneword,
I'm not able to load the second image, this might due to that the image may contain your private info, like service tag. I'm unsure if you have check the BIOS to see if the server is able to detect the RAID controller. If the server is not able to boot into the OS, mainly it is not able to communicate with the hard drive. Try to check if the RAID controller and the drives are detected on hardware level.
thisisalloneword
1 Rookie
•
99 Posts
0
August 15th, 2022 14:00
Think I resolved it by formatting the partition and mounting it again in the OS. For now things are working ok, but will need to do some deeper dive to see what caused the file corruption in the first place. Is there anything in the BIOS that can check the health of the disks? They are 5+ years old, so perhaps I should replace them?
DELL-Joey C
Moderator
•
4K Posts
1
August 15th, 2022 18:00
Hi @thisisalloneword,
For M630, you may need to access to CMC logs of the chassis M1000e. You can also check in iDRAC logs for any traces of hardware failure.
Ref: https://dell.to/3w2A0da
thisisalloneword
1 Rookie
•
99 Posts
0
August 18th, 2022 08:00
Thanks, I checked the CMC and the iDRAC logs and don't see anything about disk errors. See screenshots below. The strange is thing is under "Storage" in the iDRAC it says I only have one physical disk of size 280GB. But the disk is around 3.1TB. I believe when I install CentOS 7 it partitions the disk into a 280GB and a 3.1TB partition or something.. As a safety precaution should I just replace this disk? Not sure how to do that or what brand/size/type to get?