Unsolved

7 Posts

3065

December 29th, 2020 14:00

RAID 1 drive in Predictive Failure cannot switch off the server

Hi, I have PowerEdge R820 with four drives configured with two virtual drives in RAID-1 on PERC H710P Adapter.

On of the drives is in predictive failure. I have the replacement drive ready to go.

I was told that I can just pull the failing drive and replace.

However I would like to at least take the failing disc off line, however I cannot switch the server off (or reboot it), it runs ESX host with 20 VMs, which cannot be migrated or turned off.

My only access is via iDrac 7 (0.01/2.50.50.50), and it seem like that I cannot take the drive off line using this tool.

The server has on OpenManaged installed and I cannot install it.

What are my options to replace the drive without bringing the server down?

Thank you


DTG

 

 

4 Operator

 • 

3K Posts

December 29th, 2020 18:00

Can you run racadm command "racadm help storage" and check whether it is showing option for forceonline (racadm storage help forceonline) and offline(racadm storage help forceoffline). I am not sure whether iDRAC 7 support these commands. If these commands are supported then you can use these commands to make a drive offline.

If above command is supported you can run below command to make a drive offline

racadm storage forceoffline:Disk.Bay.11:Enclosure.Internal.0-1:RAID.Slot.5-1

Note : You can run the command "racadm storage get pdisks" to get the key for physical disk which need to be used in above command

Once above command is executed you need to create a job to apply the setting. You can run below command for that

racadm jobqueue create RAID.Slot.5-1 -s TIME_NOW --realtime

Note : You can run the command "racadm storage get controllers" to get the FQDD for controller which need to be used in above command

4 Operator

 • 

3K Posts

December 29th, 2020 19:00

Can you update latest iDRAC firmware  (2.65.65.65) on the server and try. 2.60.60.60 iDRAC firmware have these commands supported.

7 Posts

December 29th, 2020 19:00

Thank you, unfortunately it seems like this version of iDrac does not support the forceoffline/forceonline commands

7 Posts

December 29th, 2020 20:00

Hi I have downloaded iDRAC-with-Lifecycle-Controller_Firmware_0GHF4_LN_2.65.65.65_A00.BIN and put that on a share on windows server.

From Firmware Update menu in iDrac, I have tried the update from "Network Share", but I get  "LC023. Cannot access network share. Credentials or network share identity information provided did not result in a network connection being established. Check network share access credentials (IP address, user name, password, share type, and so on)" yet when I test the network configuration I get 'Success RAC0606 The network connection test operation was successful"

Any suggestions?

4 Operator

 • 

3K Posts

December 29th, 2020 21:00

You can download the file "iDRAC-with-Lifecycle-Controller_Firmware_0GHF4_WN64_2.65.65.65_A00.EXE" and directly upload this file to iDRAC to update iDRAC FW.

You can refer below link for more details

https://www.dell.com/support/kbdoc/en-in/000134013/dell-poweredge-update-the-firmware-of-single-system-components-remotely-using-the-idrac#idrac78 

7 Posts

December 30th, 2020 16:00

Thank you.
I have been able to finally update to version 2.65.65.65
Eventually I have used the HTTP option for updating, where for the HTTP Address field I have used downloads.dell.com, the UserName and Password fields were left blank, after couple of minutes list of available updates was shown.
The Job was delegated to a Job que and was sitting on Status Completed 0%. I left it overnight, and in the morning, it was still showing Status Completed 0%. However, after opening the details of the job, the Message RED001: Job completed successfully was shown.
This is confirmed by the Server Information, where the Firmware Version shows 2.65.65.65
I can confirm now that after executing racadm help storage command from the command line, I can see the options:
racadm raid forceonline:
racadm raid forceoffline:

However, I am not on site today, so I will not be able to test until next week. I will report the result then

DTG

7 Posts

January 3rd, 2021 13:00

I have executed the following command 

racadm storage forceoffline:Disk.Bay.2:Enclosure.Internal.0-1:RAID.Slot.7-1

The feedback after the command execution was:

RAC1040 : Successfully accepted the storage configuration operation. To apply the configuration operation, create a configuration job, and then restart the server. To create the required commit and reboot jobs, run the jobqueue command.

So next I executed the following command as suggested

racadm jobqueue create RAID.Slot.7-1 -s TIME_NOW –realtime

However I get this

ERROR: STOR081 : The job could not be created  because the reboot type selected for the job creation and the reboot type required for pending operations do not match. Change the reboot type for job creation and retry the operation.

 

Any suggestions?

4 Operator

 • 

3K Posts

January 3rd, 2021 20:00

From the error message it looks like you need to reboot the server to apply the setting using iDRAC 7.

7 Posts

January 3rd, 2021 20:00

Ok, unfortunately that is the whole problem, I cannot reboot the server

 

1 Message

August 8th, 2021 20:00

Hi, May I know if you were able to pull this off? the forceoffline command?

 

Thank you

Moderator

 • 

2.9K Posts

August 9th, 2021 01:00

Hello there,

 

I looked into the topic and then looked at the command guide for PERC here.https://dl.dell.com/topicspdf/cli_guide_en-us.pdf  In this document, I see that the [Force] command can be added to many commands, but I could not see it for offline. Frankly, I was wondering if they were able to get results from this command for offline.

 

Regards,

 
No Events found!

Top