Start a Conversation

This post is more than 5 years old

Solved!

Go to Solution

12424

June 3rd, 2016 11:00

A physical disk detected a warning value

Hi-

We have a failing drive with following info gleaned from MegaCLI...

C:\>megacli64 -PDInfo -PhysDrv [32:13] -a0 -NoLog | grep -E "(Slot Number:|Count:|Firmware state:|S.M.A.R.T)"
Slot Number: 13
Media Error Count: 94
Other Error Count: 0
Predictive Failure Count: 6
Firmware state: Online, Spun Up
Drive has flagged a S.M.A.R.T alert : Yes

Can we force it "Offline" and "Replace Member" - instead, of waiting until it fails completely?

Thanks for any input.

-SP

5 Practitioner

 • 

274.2K Posts

June 3rd, 2016 15:00

Hello.

What server and controller do you have? A drive in a predictive fail state shall fail any ways. Go a head and replace this drive, you don't to wait for it to fail. If the drive is a hot swap, you do not have to first offline it before replacing.

5 Practitioner

 • 

274.2K Posts

June 3rd, 2016 15:00

There you go! Thanks..

14 Posts

June 3rd, 2016 15:00

Hi-

Thanks - I would like to swap asap.

T620 w/ PERC H710 Adapter (SSA/SATA).

Flashing orange then green right away.

Have the extra drive w/ carrier/caddy ready to go...

-SP

14 Posts

June 3rd, 2016 15:00

Also - we'd like to replace when there is less activity on server - rebuild will affect performance.

This is a data disk or has SQL server DB's in production on it - our primary Application utilizes it all day.

Waiting for after hours - preferably a weekend - not sure of rebuild time - 2TB(SAS 15K) VD/Array...

5 Practitioner

 • 

274.2K Posts

June 3rd, 2016 16:00

There is no defined rebuild time as it is dependent on the size of the disk, I/O operations, etc. Good call!

14 Posts

June 6th, 2016 08:00

Finished pretty fast - in 30 min...

MegaCli64 -AdpEventLog -GetEvents -f controller_log.txt -aALL

C:\Program Files (x86)\System Utilities\Jobs\Daily Dell OMSA logs backup>type controller_log.txt

Adapter: 0 - Number of Events : 8

seqNum: 0x00001d8e
Time: Fri Jun 03 17:13:15 2016

Code: 0x0000001e
Class: 0
Locale: 0x20
Event Description: Event log cleared
Event Data:
===========
None


seqNum: 0x00001dd2
Time: Fri Jun 03 17:34:14 2016

Code: 0x00000063
Class: 0
Locale: 0x02
Event Description: Rebuild complete on VD 01/1
Event Data:
===========
Target Id: 1


seqNum: 0x00001dd3
Time: Fri Jun 03 17:34:14 2016

Code: 0x00000064
Class: 0
Locale: 0x02
Event Description: Rebuild complete on PD 0d(e0x20/s13)
Event Data:
===========
Device ID: 13
Enclosure Index: 32
Slot Number: 13


seqNum: 0x00001dd4
Time: Fri Jun 03 17:34:14 2016

Code: 0x00000072
Class: 0
Locale: 0x02
Event Description: State change on PD 0d(e0x20/s13) from REBUILD(14) to ONLINE(18)
Event Data:
===========
Device ID: 13
Enclosure Index: 32
Slot Number: 13
Previous state: 20
New state: 24


seqNum: 0x00001dd5
Time: Fri Jun 03 17:34:14 2016

Code: 0x00000051
Class: 0
Locale: 0x01
Event Description: State change on VD 01/1 from DEGRADED(2) to OPTIMAL(3)
Event Data:
===========
Target Id: 1
Previous state: 2
New state: 3


seqNum: 0x00001dd6
Time: Fri Jun 03 17:34:14 2016

Code: 0x000000f9
Class: 0
Locale: 0x01
Event Description: VD 01/1 is now OPTIMAL
Event Data:
===========
Target Id: 1

14 Posts

June 6th, 2016 08:00

Robert-

Appreciate your fast response times.

Last question related to Machines w/ Hot Swap drives and Dell(LSI?) PERC/RAID Controllers...

Do we need to research or lookup the type of Dell machine(Gen 11,12,13,etc?) + Controller(H700/710, etc.) to be sure it can hot swap before failing or if in a predictive failure/Smart alert state?

Thanks again.

-SP

5 Practitioner

 • 

274.2K Posts

June 6th, 2016 08:00

Fantastic! Let us know when you need further assistance.

5 Practitioner

 • 

274.2K Posts

June 6th, 2016 09:00

Yes, you can always research on such information. Generally, hard drives connected to back-planes are hot swap drives as opposed to cabled dives

2 Posts

January 3rd, 2018 07:00

we had a similar situation with a predictive failure value of 252, and Drive has flagged a S.M.A.R.T alert : Yes,

we replaced teh drive and now teh new drive has exactly teh same PF value and alert. other two drive have zero PF.. 

pls advise

7 Technologist

 • 

16.3K Posts

January 3rd, 2018 15:00

Did you force the PF drive offline before replacing?

No Events found!

Top