This post is more than 5 years old
14 Posts
0
12559
A physical disk detected a warning value
Hi-
We have a failing drive with following info gleaned from MegaCLI...
C:\>megacli64 -PDInfo -PhysDrv [32:13] -a0 -NoLog | grep -E "(Slot Number:|Count:|Firmware state:|S.M.A.R.T)"
Slot Number: 13
Media Error Count: 94
Other Error Count: 0
Predictive Failure Count: 6
Firmware state: Online, Spun Up
Drive has flagged a S.M.A.R.T alert : Yes
Can we force it "Offline" and "Replace Member" - instead, of waiting until it fails completely?
Thanks for any input.
-SP
Anonymous
5 Practitioner
5 Practitioner
•
274.2K Posts
0
June 3rd, 2016 15:00
Hello.
What server and controller do you have? A drive in a predictive fail state shall fail any ways. Go a head and replace this drive, you don't to wait for it to fail. If the drive is a hot swap, you do not have to first offline it before replacing.
Anonymous
5 Practitioner
5 Practitioner
•
274.2K Posts
0
June 3rd, 2016 15:00
There you go! Thanks..
spichelman
14 Posts
0
June 3rd, 2016 15:00
Hi-
Thanks - I would like to swap asap.
T620 w/ PERC H710 Adapter (SSA/SATA).
Flashing orange then green right away.
Have the extra drive w/ carrier/caddy ready to go...
-SP
spichelman
14 Posts
0
June 3rd, 2016 15:00
Also - we'd like to replace when there is less activity on server - rebuild will affect performance.
This is a data disk or has SQL server DB's in production on it - our primary Application utilizes it all day.
Waiting for after hours - preferably a weekend - not sure of rebuild time - 2TB(SAS 15K) VD/Array...
Anonymous
5 Practitioner
5 Practitioner
•
274.2K Posts
0
June 3rd, 2016 16:00
There is no defined rebuild time as it is dependent on the size of the disk, I/O operations, etc. Good call!
spichelman
14 Posts
0
June 6th, 2016 08:00
Finished pretty fast - in 30 min...
MegaCli64 -AdpEventLog -GetEvents -f controller_log.txt -aALL
C:\Program Files (x86)\System Utilities\Jobs\Daily Dell OMSA logs backup>type controller_log.txt
Adapter: 0 - Number of Events : 8
seqNum: 0x00001d8e
Time: Fri Jun 03 17:13:15 2016
Code: 0x0000001e
Class: 0
Locale: 0x20
Event Description: Event log cleared
Event Data:
===========
None
seqNum: 0x00001dd2
Time: Fri Jun 03 17:34:14 2016
Code: 0x00000063
Class: 0
Locale: 0x02
Event Description: Rebuild complete on VD 01/1
Event Data:
===========
Target Id: 1
seqNum: 0x00001dd3
Time: Fri Jun 03 17:34:14 2016
Code: 0x00000064
Class: 0
Locale: 0x02
Event Description: Rebuild complete on PD 0d(e0x20/s13)
Event Data:
===========
Device ID: 13
Enclosure Index: 32
Slot Number: 13
seqNum: 0x00001dd4
Time: Fri Jun 03 17:34:14 2016
Code: 0x00000072
Class: 0
Locale: 0x02
Event Description: State change on PD 0d(e0x20/s13) from REBUILD(14) to ONLINE(18)
Event Data:
===========
Device ID: 13
Enclosure Index: 32
Slot Number: 13
Previous state: 20
New state: 24
seqNum: 0x00001dd5
Time: Fri Jun 03 17:34:14 2016
Code: 0x00000051
Class: 0
Locale: 0x01
Event Description: State change on VD 01/1 from DEGRADED(2) to OPTIMAL(3)
Event Data:
===========
Target Id: 1
Previous state: 2
New state: 3
seqNum: 0x00001dd6
Time: Fri Jun 03 17:34:14 2016
Code: 0x000000f9
Class: 0
Locale: 0x01
Event Description: VD 01/1 is now OPTIMAL
Event Data:
===========
Target Id: 1
spichelman
14 Posts
0
June 6th, 2016 08:00
Robert-
Appreciate your fast response times.
Last question related to Machines w/ Hot Swap drives and Dell(LSI?) PERC/RAID Controllers...
Do we need to research or lookup the type of Dell machine(Gen 11,12,13,etc?) + Controller(H700/710, etc.) to be sure it can hot swap before failing or if in a predictive failure/Smart alert state?
Thanks again.
-SP
Anonymous
5 Practitioner
5 Practitioner
•
274.2K Posts
0
June 6th, 2016 08:00
Fantastic! Let us know when you need further assistance.
Anonymous
5 Practitioner
5 Practitioner
•
274.2K Posts
0
June 6th, 2016 09:00
Yes, you can always research on such information. Generally, hard drives connected to back-planes are hot swap drives as opposed to cabled dives
Cantel
2 Posts
0
January 3rd, 2018 07:00
we had a similar situation with a predictive failure value of 252, and Drive has flagged a S.M.A.R.T alert : Yes,
we replaced teh drive and now teh new drive has exactly teh same PF value and alert. other two drive have zero PF..
pls advise
theflash1932
9 Legend
9 Legend
•
16.3K Posts
0
January 3rd, 2018 15:00
Did you force the PF drive offline before replacing?