Unsolved

This post is more than 5 years old

647

January 10th, 2020 05:00

PV MD3200i with no critical alert but not functionning

         Hi world,

     Our very old MD3200i is not functioning very well, some VM using isccsi virtual disks on it have sluggish copy rate of 3Mo/s! VMs seems to work no errors, no other performance issue could be observed but an application running on one of those can't run some sql requests  because of timeout.

As there was no critical error i decided to rebuild a clean datastore beside the defect one but before going to copy things i decided to take a look to other messages in log...

Here is what i found:

Datastore 1 the one i want to reinit

Date/heure : 09/01/20 11:06:54
Numéro de séquence : 107105
Type d'événement : 201F
Catégorie d'événement : Interne
Priorité : Informatif
Description : VDD repair completed
Codes spécifiques aux événements : 0/0/0
Type de composant : Module de contrôleur RAID
Emplacement du composant : Boîtier 0, Logement 1
Journalisé par : Module de contrôleur RAID dans le logement 1

Données brutes :
4d 45 4c 48 03 00 00 00 61 a2 01 00 00 00 00 00
1f 20 48 00 3e fb 16 5e 0a 00 01 00 f9 41 24 14
01 00 00 00 04 00 00 00 22 00 00 00 22 00 00 00
08 00 00 00 00 00 00 00 02 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 00 00 01 01 14 00 00 00 10 00 13 06
80 20 00 04 00 00 00 00 10 da 09 01 00 00 00 00

 

Date/heure : 08/01/20 22:16:39
Numéro de séquence : 107080
Type d'événement : 5023
Catégorie d'événement : Commande
Priorité : Informatif
Description : RAID Controller Module return status/function call for requested operation
Codes spécifiques aux événements : 1/b8/0
Type de composant : Module de contrôleur RAID
Emplacement du composant : Boîtier 0, Logement 0
Journalisé par : Module de contrôleur RAID dans le logement 0

Données brutes :
4d 45 4c 48 03 00 00 00 48 a2 01 00 00 00 00 00
23 50 30 00 b7 46 16 5e 00 00 00 00 00 80 00 00
00 00 00 00 03 00 00 00 22 00 00 00 22 00 00 00
08 00 00 00 00 00 00 00 01 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 00 00 00 01 0c 00 00 00 08 00 14 08
b8 00 00 00 01 00 00 00

Date/heure : 01/01/20 19:01:42
Numéro de séquence : 106887
Type d'événement : 210A
Catégorie d'événement : Interne
Priorité : Informatif
Description : RAID Controller Module cache not enabled or was internally disabled
Codes spécifiques aux événements : 0/0/0
Type de composant : Module de contrôleur RAID
Emplacement du composant : Boîtier 0, Logement 0
Journalisé par : Module de contrôleur RAID dans le logement 0

Données brutes :
4d 45 4c 48 03 00 00 00 87 a1 01 00 00 00 00 00
0a 21 48 00 86 de 0c 5e 00 00 00 00 00 00 00 00
00 00 00 00 04 00 00 00 22 00 00 00 22 00 00 00
08 00 00 00 00 00 00 00 01 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 00 00 00 00 00 00 00 00

 

Datastore 2 the one in operation

Date/heure : 09/01/20 11:06:54
Numéro de séquence : 107101
Type d'événement : 1016
Catégorie d'événement : Erreur
Priorité : Informatif
Description : Physical Disk returned unrecoverable media error
Codes spécifiques aux événements : 3/11/0
Type de composant : Disque physique
Emplacement du composant : Boîtier 0, Logement 10
Journalisé par : Module de contrôleur RAID dans le logement 1

Données brutes :
4d 45 4c 48 03 00 00 00 5d a2 01 00 00 00 00 00
16 10 11 10 3e fb 16 5e 0a 00 01 00 0a 01 00 00
01 00 00 00 01 00 00 00 22 00 00 00 22 00 00 00
01 00 00 00 00 00 00 00 0b 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 02 00 01 03 44 00 00 00 12 00 0d 01
f0 00 03 01 09 da 10 18 00 00 00 00 11 00 00 80
00 4d 00 00 20 00 0e 81 cc dd bb 28 10 01 09 da
10 00 00 06 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 03 00 0e 81 00 00 00 00

 

Date/heure : 16/12/19 17:12:27
Numéro de séquence : 106632
Type d'événement : 100A
Catégorie d'événement : Erreur
Priorité : Informatif
Description : Physical Disk returned CHECK CONDITION
Codes spécifiques aux événements : 6/2a/1
Type de composant : Disque physique
Emplacement du composant : Boîtier 0, Logement 8
Journalisé par : Module de contrôleur RAID dans le logement 1

Données brutes :
4d 45 4c 48 03 00 00 00 88 a0 01 00 00 00 00 00
0a 10 11 20 eb ac f7 5d 08 00 01 00 0a 01 00 00
03 00 00 00 01 00 00 00 22 00 00 00 22 00 00 00
01 00 00 00 00 00 00 00 09 00 00 00 01 00 00 00
0a 00 00 00 01 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
01 00 00 00 01 00 01 03 44 00 00 00 12 00 0d 01
70 00 06 00 00 00 00 0a 00 00 00 00 2a 01 00 00
00 00 00 00 20 00 0e 81 cc dd bb 4d 00 58 00 00
00 00 ff fe 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 03 00 0e 81 00 00 00 00

 

So, can i trust datastore 1? What can i do for further investigation?

Datastore 2 has disks problems, disk 8 is a new one in spare status, but disk 10 ...

Another thing i would like to know is why i have those 3 triangles when i clic show  physical disks, datastore 2 is 5,6,7,9,10,11?

Conclusion, would you keep this SAN in production any longer?

 

Greetings

Gilles

 

3 Attachments

Moderator

 • 

7.7K Posts

January 10th, 2020 10:00

Hello Gilles,

So looking at the log file that you provided I would say that you need to start by upgrading your firmware on your controllers.  the firmware that you are using is 8yrs old.  Once the firmware has been updated on both controllers and HDD’s, then we can look into your performance issues.  Here is the link to the firmware and resource dvd.

Here are the steps for doing a firmware upgrade just in case you need them.

 

Controller Firmware

https://www.dell.com/support/home/us/en/04/drivers/driversdetails?driverid=4cp5x&oscode=w12r2&productcode=powervault-md3200i

Resource dvd

https://www.dell.com/support/home/us/en/04/drivers/driversdetails?driverid=r9g1x&oscode=w12r2&productcode=powervault-md3200i

HDD/ SDD

https://www.dell.com/support/home/us/en/04/drivers/driversdetails?driverid=x50j8&oscode=w12r2&productcode=powervault-md3200i

 

Firmware Upgrade steps:

  1. Gather support bundle in MDSM
  2. WARNING: If you have a single controller PowerVault MD3200/MD3600 series storage array you must stop all I/O operations before starting the firmware upgrade.
  3. Extract the firmware to folders and remember location.
  4. Burn or mount the ISO for the resource cd.
  5. Uninstall the MDSM from hosts (reboot required)
  6. Install MDSM from resource DVD (reboot required)
  7. Clear the Major Event Log.
  8. Update to 07.75.28.60 if not already there, then update to latest firmware.

O  If you receive an error while checking the SPM database, ensure that you have an out of band management connection to both RAID controllers.

  1. Verify connection and data.
  2. Update the hard drive firmware.
  3. Reboot stack and verify all is optimal.
  4. Power down the server(s)
  5. Power down the MD32x0(i)
  6. Power down any attached storage (MD12xx)
  7. Leave the power off for 2-3 minutes

 

 Please allow for at least 60-85 minutes for the updates.  Time may vary depending on how long each reboot takes.

Please let us know if you have any other questions.

No Events found!

Top