Start a Conversation

This post is more than 5 years old

Solved!

Go to Solution

16843

March 30th, 2014 11:00

Isilon unable to detect new drive

Hi,

Is anyone running on OneFS 7.0.2.4 facing issue that Isilon is unable to detect new drive, the new drive is a replacement of failed drive.

Node: NL400

Thanks,

Damal

2 Posts

May 13th, 2014 12:00

In the mean time, this issue can generally be resolved by powering the node down (not just rebooting, shut it down completely) and then turning it back on. You can leave the new drive inserted, the node should pick it up during startup.

11 Posts

March 30th, 2014 13:00

Hi Damal,

What is the status of the drive?

By status I mean, [HEALTHY], [NEW], [REPLACE], etc.

You can check that out by :

     # isi_for_array -s isi devices | grep -v HEALTHY

Try using these commands for adding the drive:

     # isi devices -a add -d :

     # isi devices -a format -d :

If it still doesn't get added, try rebooting the node.

You can also refer to this: NL 400 Drive Replacement Guide

Cheers,

Anurag

165 Posts

March 30th, 2014 15:00

The new drive installed, status shows up as empty. I am not sure if format is performed but i cant check on that.

We can drain the client connections and reboot the node, but reboot is not preferred. Actually the node rebooted by itself and not sure why it is

11 Posts

March 31st, 2014 10:00

If status is empty even after adding the drive physically, try adding it the via command:

# isi devices -a add -d :

165 Posts

March 31st, 2014 17:00

I believe that step is part of procedure that CE is following to replace the disk.

But i will double check with him.

About the node reboot, support determined its a bug which will be fixed in 7.1.x.x, reboot caused by node panics - page fault in kernel module

1.2K Posts

April 1st, 2014 03:00

Hi Damal, what is the Issue ID for this (panic/reboot)?

Thanks

-- Peter

165 Posts

April 1st, 2014 07:00

Panic.JPG.jpg

The bug number is -119807


Is this what you asked for Peter.


Also i checked with few of my other colleagues about Isilon not detecting the replacement disk until node reboot, and they never came across this. They are on 7.0.2.4 as well. Is that a know issue ?

1.2K Posts

April 2nd, 2014 02:00

Thanks Damal

Just checked, no sight of 119807 in the 7.1.0.1 MR release notes.

As for the replaced disk not being recognized,

this has happened twice for us, but on 6.5 and IQ108NL.

In one case, just another drive was shipped that worked.

In the other case, we tried out three drives without luck.

Replaced the controller, still no drives worked in the particular slot.

Finally the whole chassis got replaced.

Cheers

-- Peter

April 2nd, 2014 06:00

Is the model of the new drive the same as the old drive?  EMC has been known in the past to change drive manufacturers and then you find out your drive is no longer on the supported list for the firmware you're running.

Double-check the node and drive firmware packages you have installed and update if necessary (which unfortunately requires node reboots). 

165 Posts

April 2nd, 2014 06:00

When the node rebooted we actually had disk failures in two different nodes - one in each node and after drive replacements, both the nodes has to be rebooted to detect the new drives. So i am not sure if it a s/w or h/w fault but i noticed its happening in my environment only - OneFS 7.0.2.4.

The chassis replacement you mentioned about, were you on 7.0.2.4 ?

165 Posts

April 2nd, 2014 07:00

Not aware of drive model but i can check with the CE. Is there a way to check the drive model and manufacturer from CLI ? 

The current firmware is from image below. Is there place where i can check the drive compatibility with the firmware on node ?

Drivefirm.JPG.jpg

April 2nd, 2014 08:00

You can check the drive models with:

isi_radish -q

This also lists the firmware of the individual drives.  Go to the EMC support site, grab the drive release notes PDF, and see what's supported.

You can compare the drive models with the physical label of the drive your cluster won't see.  Your node firmware package looks recent.

1.2K Posts

April 2nd, 2014 09:00

> The chassis replacement you mentioned about, were you on 7.0.2.4 ?


6.5.5.11


-- P.

165 Posts

April 2nd, 2014 10:00

Ive gone through the docs

My understanding is we should have firmware revision: SN03 for drives, but they have SN02 which may not be compatible.

Bay 16/da26   is ST4000NM0033-9ZM170 FW:SN02

Bay 17/da8    is ST4000NM0033-9ZM170 FW:SN02

Bay 18        is not a recognized drive

Bay 19/da9    is ST4000NM0033-9ZM170 FW:SN02

Bay 20/da27   is ST4000NM0033-9ZM170 FW:SN02

Firm.JPG.jpg

117 Posts

April 2nd, 2014 11:00

There is a known issue related to drive add resulting in an 'empty' status.  Bug# 122361.  I would recommend you open an SR to validate if this is what you are hitting and the expected fix date for this issue.

No Events found!

Top