Start a Conversation

Unsolved

This post is more than 5 years old

3390

May 20th, 2017 04:00

Dell MD3000 Degraded Physical Disk Channel error

Hello;

I have a Dell PowerVault MD3000, and it is showing a degraded physical disk channel error and individual physical disk - degraded path error.

The recovery guru suggests contacting a technical support representative to fix the error and do not try to fix it myself. 

Any thoughts on how to proceed with fixing my issue? I am afraid to lose connection to my DAS and lose my data.

Below is the status of my physical disk channel:

DRIVE CHANNELS----------------------------

SUMMARY


CHANNEL PORT STATUS
1 Expansion Degraded
2 Expansion Optimal

DETAILS

DRIVE CHANNEL 1

Port: Expansion
Status: Degraded
Reason: Error threshold exceeded
Max. Rate: 3 Gbps
Current Rate: 3 Gbps
Rate Control: Switched
DRIVE COUNTS

Total # of attached physical disks: 15
Connected to: 0
Attached physical disks: 15
Expansion enclosure: 0 (15 physical disks)


CUMULATIVE ERROR COUNTS

RAID Controller Module 0

Baseline time set: 5/19/17 2:16:08 PM
Sample period (hh:mm:ss): 23:32:14
RAID Controller Module detected errors: 67
Physical Disk detected errors: 33
Timeout errors: 0
Total I/O count: 1174837

RAID Controller Module 1

Baseline time set: 5/19/17 2:16:08 PM
Sample period (hh:mm:ss): 23:13:33
RAID Controller Module detected errors: 38
Physical Disk detected errors: 8
Timeout errors: 0
Total I/O count: 52110

CAPTURED INTERVAL ERROR COUNTS

RAID Controller Module 1

Start time: {0} 5/19/17 2:16:08 PM
End time: {0} 5/19/17 2:18:50 PM
RAID Controller Module detected errors: 0
Physical Disk detected errors: 0
Timeout errors: 0
Total I/O count: 0

DRIVE CHANNEL 2

Port: Expansion
Status: Optimal
Max. Rate: 3 Gbps
Current Rate: 3 Gbps
Rate Control: Switched
DRIVE COUNTS

Total # of attached physical disks: 15
Connected to: 1
Attached physical disks: 15
Expansion enclosure: 0 (15 physical disks)


CUMULATIVE ERROR COUNTS

RAID Controller Module 0

Baseline time set: 5/19/17 2:16:08 PM
Sample period (hh:mm:ss): 23:32:14
RAID Controller Module detected errors: 0
Physical Disk detected errors: 19
Timeout errors: 0
Total I/O count: 1203323

RAID Controller Module 1

Baseline time set: 5/19/17 2:16:08 PM
Sample period (hh:mm:ss): 23:13:33
RAID Controller Module detected errors: 0
Physical Disk detected errors: 9
Timeout errors: 0
Total I/O count: 267426

Your help is highly appreciable.

Regards;

Moderator

 • 

7.1K Posts

June 13th, 2017 11:00

Hello elhammoud,

When you get the Physical disk channel error what you want to do is to check to make sure all drives are online & present. Are there any expansion enclosures attached to your MD3000? Here are the commands that need to be run:

smcli -n -p password -c "clear allPhysicalDiskChannels stats;"

smcli -n -p password -c "set physicalDiskChannel [1] status=optimal;"

smcli -n -p password -c "set physicalDiskChannel [2] status=optimal;"

If after running the commands you are still getting the error then we will need to review a support bundle from your MD3000. Also looking at your Storage Array Profile log it is showing that your MD3000 firmware could be upgraded to 07.35.39.64.

Please let us know if you have any other questions.

8 Posts

October 22nd, 2018 08:00

I am having an identical problem.  Is the reset of the error count the actual fix?  Even though the recovery gury says not to attempt to fix this myself, is the command:

smcli -n -p password -c "clear allPhysicalDiskChannels stats;"

what the Dell tech would do to fix this?

Any help is appreciated.

Bruce

Moderator

 • 

7.1K Posts

October 22nd, 2018 09:00

Hello Bruce,

When you are getting this issue if you have connection to the drives, then it is best to run the following commands to clear the errors and see if the errors come back again. 

smcli -n -p password -c "clear allPhysicalDiskChannels stats;"

smcli -n -p password -c "set physicalDiskChannel [2] status=optimal;"

Please let us know if you have any other questions.

8 Posts

October 22nd, 2018 10:00

Those commands cleared one of the errors.  I've tried this one:

smcli -n name -p password -c "set physicalDisk [0,8] operationalState=optimal;"

but the error doesn't clear.  Do I need to wait for it to clear?

Bruce

Moderator

 • 

7.1K Posts

October 22nd, 2018 12:00

Hello Bruce,

What you will need to do is to clear the drive stats on both channel 1 & 2.  If that doesn’t resolve your issue then we will need to look at a support bundle to see what is going on.

Please let us know if you have any other questions.

8 Posts

October 22nd, 2018 13:00

I've run both of these commands:

smcli -n name -p password -c "clear allphysicaldiskchannels stats;"

smcli -n name -p password -c "set physicalDisk [0,8] operationalState=optimal;"

but the error persists.

I don't see an attachment link for the support bundle.  How can I get that to you?

Bruce

Moderator

 • 

7.1K Posts

October 23rd, 2018 06:00

Hello Bruce,

I will send you a private message so that you can send me the logs so that I can review them.

Please let us know if you have any other questions.

8 Posts

October 23rd, 2018 08:00

Thank you for the quick reply. I will swap out the disk and report my findings.

Moderator

 • 

7.1K Posts

October 23rd, 2018 08:00

Hello Bruce,

Thanks for the support bundle as it helps to look into your issue. What I am seeing is that the drive in 0,8 is throwing a lot of errors.  It is under the limit to trigger a predicted failure, but it is throwing enough to trigger the channel errors.  What I would do is to replace the drive in 0,8 & that should resolve your issue.  I don’t see an issue with the controller in slot 1 as all other drives can access both channels.

Please let us know if you have any other questions.

8 Posts

October 23rd, 2018 10:00

While the disk group is under construction, I had a thought. Is there anyway of using a non-Dell certified disk in this SAN? I'm finding it increasingly hard to find Dell certified ones as spares. Bruce

Moderator

 • 

7.1K Posts

October 23rd, 2018 13:00

Hello Bruce,

You can’t not use non-Dell drives in an MD3000.  If the drives don’t have dell firmware loaded on them then the drive is spun down and marked as bad.

Please let us know if you have any other questions.

8 Posts

October 24th, 2018 10:00

The new disk has rebuilt.  All seems well except for one error in the Recovery Guru:

Individual Physical Disk - Degraded Path
Channel: 0
Related physical disks: (Unknown)

I've tried these commands, but none clear the error:

smcli -n name -p password -c "clear allphysicaldiskchannels stats;"
smcli -n name -p password -c "set physicalDisk [0,8] operationalState=optimal;" (since disk 8 was the problem child)
smcli -n name -p password -c "set physicaldiskchannel [0] status=optimal;"

Ideas?

8 Posts

October 25th, 2018 08:00

It did come with its own carrier, but there was no interposer board installed.

Bruce

Moderator

 • 

7.1K Posts

October 25th, 2018 08:00

Hello Bruce,

The drive that you replaced in slot 8 did it come in its own carrier? If it didn’t come in its own carrier, was there an interposer board on the old drive.  If there was an interposer on the old drive did you put it on the new drive?

Please let us know if you have any other questions.

8 Posts

October 25th, 2018 08:00

Nor was there an interposer board on the drive I pulled out.

Bruce

No Events found!

Top