SC Storage Customer Notification: Long Running Management Process May Cause Controller Reset

Summary: This article describes a where a Storage Center OS safety mechanism triggered by a long running management process may proactively reset a controller, avoiding a potential service affecting event. ...

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

While on Storage Center OS 7.3 a controller may reset due to a kernel safety mechanism. This safety mechanism recovers kernel processes that are either waiting longer than expected to run or have an excessive number of queued threads. When this mechanism is triggered the controller will reboot itself to ensure this state does not impact production IO. Without the safety mechanism the outstanding kernel processes could lead to a service affecting event. 

An ongoing investigation is focused on why these kernel processes may not be reacting as expected, triggering this safety mechanism. 

This issue has been resolved in SCOS 7.4.2 and higher and upgrading is highly recommended to avoid a recurrence of the unplanned resets.

Affected Products

Compellent (SC, SCv & FS Series), Dell Compellent SC4020, Dell Storage SC8000, Dell Storage SCv2000, Dell Storage SCv2020, Dell Storage SCv2080, Dell Storage SC5020, Dell Storage SC5020F, Dell Storage SC7020, Dell Storage SC7020F, Dell Storage SC9000 , Dell Storage SCv3000, Dell Storage SCv3020 ...
Article Properties
Article Number: 000129562
Article Type: Solution
Last Modified: 21 Feb 2021
Version:  3
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.