With the drive showing as an online Predicted Failure, you will need to force that drive offline (page 25 here). prior to removal. After that you should be able to remove the problem drive, wait about 20 seconds and insert the replacement. It should start the rebuild automatically, if not then you can assign it as a hotspare with commands found on page 34 here.
Hello, I am aware that this post is particularly old but I have exactly the same problem on an H840 card and 15 disks in RAID 5 One disk has problems : Drive has flagged a S.M.A.R.T alert : yes Adapter 1 Enclosure Device ID: 251 Slot Number: 5
Is it possible to do the same thing with "megacli" that I usually use ? megacli -PDOffline -PhysDrv[251:5] -a1 Then put the disk OffLine, then remove it mechanically, then put the new one in its place. The rebuild starts by itself? I see on the net that you have to put the disk in "missing state" then in "removable state" before removing it, is it really useful in the case of a replacement? Thanks for your help
In regards to a Predicted Failure drive, which is in an ONLINE state, you indeed need to OFFLINE that drive prior to replacing it. This is to ensure that the bad blocks on the Predicted Failure drive don't get moved to the replacement drive. So you would offline it, replace it, then if the rebuild doesn't automatically start you can set it as a Hotspare to start it.
My question was more to know if in this case (the same as the one mentioned at the beginning of this post : replacement of a defective disk), it was enough to only pass it in Offline* before removing it and before putting the new one.
* With this megacli command : megacli -PDOffline -PhysDrv[251:5] -a1
That looks correct to me. Also, just on a side note, if you ever need to offline a drive, when you don't have access to it via cli or gui, you can power off the server, then remove the drive, then power up the server, then add a replacement drive when back in the OS.
I take advantage of this thread to ask you 2 smalls questions:
To find information concerning the replacement of disks on a raid 5 (DELL PERC), i did a lot of research and on almost all the sites it was question in addition to putting the disk "Offline" to put it in "Missing" then in "Removable". Can you explain to me what this means and in which case should we proceed this way?
Since I cannot install Omsa on my Debian server, I use Megacli. Can you tell me which tool to use with recent cards (H840 H730...) Megacli, Perccli or Storcli?
My original configuration was as follows Raid 5 on a PERC H840 card 15 disks One spare disk: Disk [251:0] A bad disk : Disk [251:5]
So I did the following to replace the bad disk megacli -PDOffline -PhysDrv[251:5] -a1 Once this disk is Offline, the disk [251:0] (spare) is automatically switched to "Rebuild".
Replacement of the defective disk with a new identical disk This disk has been switched to "Hotspare, Spun Up" mode The rebuild on this disk did not start
Now the disks are in the following state:
Old defective disk replaced by a new one megacli -PDHSP -Set -PhysDrv[251:5] -a1 Firmware state: Hotspare, Spun Up
Old spare disk megacli -PDInfo -PhysDrv[251:0] -a1 Firmware state: Rebuild
Is this normal? The new spare disk is now disk [251:5], or should I do something?
DELL-Chris H
Moderator
•
9.7K Posts
1
December 4th, 2019 06:00
Billeuze,
With the drive showing as an online Predicted Failure, you will need to force that drive offline (page 25 here). prior to removal. After that you should be able to remove the problem drive, wait about 20 seconds and insert the replacement. It should start the rebuild automatically, if not then you can assign it as a hotspare with commands found on page 34 here.
Let me know how it goes.
billeuze
1 Rookie
•
14 Posts
0
December 4th, 2019 17:00
yup, it started rebuilding immediately, thanks.
brosysadm
1 Rookie
•
36 Posts
1
June 24th, 2021 10:00
Hello there,
I have very similar questoion.
If failed drive was taken out on hot, and reported as missing, new drive was inserted.
How to force rebuild the new drive?
Add it manually to RAID where drive was fault and rebuild it?
Assign it as hotspare and it will take care of missing drive automatically?
Insert back to server in the same slot and do it correct thought the perccli?
Thank you
Dell-DylanJ
4 Operator
•
2.9K Posts
1
June 24th, 2021 13:00
Hello,
I would recommend marking the replacement drive as a hotspare and allowing the controller to manage the rebuild, as you had mentioned in your post.
Dell-Martin S
Moderator
•
3.6K Posts
0
September 21st, 2022 06:00
Hi,
please check https://dell.to/3S94xPK this should answer your questions.
Did you have any other questions?
Regards Martin
plegrand1
1 Rookie
•
19 Posts
0
September 21st, 2022 06:00
Hello,
I am aware that this post is particularly old but I have exactly the same problem on an H840 card and 15 disks in RAID 5
One disk has problems :
Drive has flagged a S.M.A.R.T alert : yes
Adapter 1
Enclosure Device ID: 251
Slot Number: 5
Is it possible to do the same thing with "megacli" that I usually use ?
megacli -PDOffline -PhysDrv[251:5] -a1
Then put the disk OffLine, then remove it mechanically, then put the new one in its place.
The rebuild starts by itself?
I see on the net that you have to put the disk in "missing state" then in "removable state" before removing it, is it really useful in the case of a replacement?
Thanks for your help
DELL-Chris H
Moderator
•
9.7K Posts
1
September 21st, 2022 07:00
Plegrand1,
In regards to a Predicted Failure drive, which is in an ONLINE state, you indeed need to OFFLINE that drive prior to replacing it. This is to ensure that the bad blocks on the Predicted Failure drive don't get moved to the replacement drive. So you would offline it, replace it, then if the rebuild doesn't automatically start you can set it as a Hotspare to start it.
Let me know if this answers your question.
plegrand1
1 Rookie
•
19 Posts
1
September 21st, 2022 07:00
Hello and thank you for your answer,
My question was more to know if in this case (the same as the one mentioned at the beginning of this post : replacement of a defective disk), it was enough to only pass it in Offline* before removing it and before putting the new one.
* With this megacli command : megacli -PDOffline -PhysDrv[251:5] -a1
Thanks again for your help.
DELL-Chris H
Moderator
•
9.7K Posts
1
September 21st, 2022 08:00
Pascal,
That looks correct to me. Also, just on a side note, if you ever need to offline a drive, when you don't have access to it via cli or gui, you can power off the server, then remove the drive, then power up the server, then add a replacement drive when back in the OS.
plegrand1
1 Rookie
•
19 Posts
0
September 21st, 2022 08:00
Thanks again for your answer.
Just to be sure, you confirm to me that megacli do the same job as perccli to pass offline a disk on a PERC H840?
megacli -PDOffline -PhysDrv[251:5] -a1
perccli /c1/e251/s5 set offline
Thanks again
Pascal
plegrand1
1 Rookie
•
19 Posts
0
September 21st, 2022 09:00
I have to replace a disk tomorrow then i will tell you if all is ok after that
plegrand1
1 Rookie
•
19 Posts
0
September 21st, 2022 23:00
I take advantage of this thread to ask you 2 smalls questions:
To find information concerning the replacement of disks on a raid 5 (DELL PERC), i did a lot of research and on almost all the sites it was question in addition to putting the disk "Offline" to put it in "Missing" then in "Removable".
Can you explain to me what this means and in which case should we proceed this way?
Since I cannot install Omsa on my Debian server, I use Megacli.
Can you tell me which tool to use with recent cards (H840 H730...)
Megacli, Perccli or Storcli?
In any case thank you for your help.
Pascal
plegrand1
1 Rookie
•
19 Posts
1
September 21st, 2022 23:00
Here is the procedure I will follow for the replacement of my disk.
Does it look good to you?
Should I choose megacli or perccli or even storcli which I don't know?
----------------------
megacli -PDInfo -PhysDrv[251:5] -a1
# perccli64 /c1/e251/s5 show all
megacli -PDLocate -start -PhysDrv[251:5] -a1
# perccli64 /c1/e251/s5 start locate
megacli -PDLocate -stop -PhysDrv[251:5] -a1
# perccli64 /c1/e251/s5 stop locate
megacli -PDOffline -PhysDrv[251:5] -a1
# perccli64 /c1/e251/s5 set offline
Replacement of the disk
megacli -PDRbld -ShowProg -PhysDrv[251:5] -a1
# perccli64 /c1/e251/s5 show rebuild
If rebuild doesnt start
megacli -PDHSP -Set -PhysDrv[251:5] -a1
# perccli64 /c1/e251/s5 add hotsparedrive
plegrand1
1 Rookie
•
19 Posts
0
September 22nd, 2022 05:00
My original configuration was as follows
Raid 5 on a PERC H840 card
15 disks
One spare disk: Disk [251:0]
A bad disk : Disk [251:5]
So I did the following to replace the bad disk
megacli -PDOffline -PhysDrv[251:5] -a1
Once this disk is Offline, the disk [251:0] (spare) is automatically switched to "Rebuild".
Replacement of the defective disk with a new identical disk
This disk has been switched to "Hotspare, Spun Up" mode
The rebuild on this disk did not start
Now the disks are in the following state:
Old defective disk replaced by a new one
megacli -PDHSP -Set -PhysDrv[251:5] -a1
Firmware state: Hotspare, Spun Up
Old spare disk
megacli -PDInfo -PhysDrv[251:0] -a1
Firmware state: Rebuild
Is this normal? The new spare disk is now disk [251:5], or should I do something?
plegrand1
1 Rookie
•
19 Posts
0
September 22nd, 2022 06:00
According to you, it's not a problem that the "old hot spare" became "online" and the new disk became "Hot spare" ?