PowerFlex How to Reset Oscillating Errors

Summary: Steps to reset PowerFlex oscillating errors.

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Instructions

Resetting Oscillating Errors

Oscillating errors are cosmetic in nature and do not cause any issues. When customers are performing RCM upgrades, deploying a new service, or anything that bounces hosts up and down frequently enough, PowerFlex reports this as an oscillating error.

NOTE: The oscillating errors message may remain after running the reset commands below. The reason for this is that they can be cached on the current Primary MDM. In this scenario, the final step to clear the alert requires that the Primary MDM be switched to the Secondary MDM. This process is an intrusive action. This remediation step must be confirmed with the customer before proceeding. See Dell article PowerFlex Presentation Server: Unable to Clear Network Alerts Reported192262 for more information (Login in as a registered Dell Support user is required to view article).


The various oscillating errors failure counters can be reset to zero. The following command is useful when you have fixed a problem and want to ensure that an alert is no longer active in the system. We can reset counters for the entire system, per Protection Domain, or per Storage Pool.
  • SSH to the Primary MDM to run the reset command below:

Syntax

scli --reset_oscillating_failure_counters (--failure_counter (((-- protection_domain_id | --protection_domain_name ) | --all_sds) | --all_sdc | (((----protection_domain_id | -- protection_domain_name) --storage_pool_name ) | --storage_pool_id ) | --all_devices) | --all_counters)

Parameters

--failure_counter <COUNTER>


The oscillating failure counter to be reset to zero; Counters are grouped as follows:

  • Counter Group A (MDM and SDS-related counters):
mdm_sds_network_disconnections

sds_sds_network_disconnections

sds_decoupled

sds_configuration_failures

sds_receive_buffer_allocation_failures
  • Counter Group B (SDC-related counters):
sdc_mdm_network_disconnections

sdc_sds_network_disconnections

sdc_long_operations

sdc_memory_allocation_failures

sdc_socket_allocation_failures
  • Counter Group C (SDS device-related counters):
sds_device_long_successful_ios
 
--protection_domain_id <ID>         Protection Domain ID

--protection_domain_name <NAME>   - Protection Domain name

--storage_pool_name <NAME>        - Storage Pool name

--storage_pool_id <ID>            - Storage Pool ID

--all_sds                         - Reset the counter parameters on all SDSs

--all_sdc                         - Reset the counter parameters on all SDCs

--all_devices                     - Reset the counter parameters on all SDS devices

--all_counters                    - Reset all counters to zero


Example (All SDC, SDS, MDM Counters):

# scli --log in --username admin –password <MDM Password>

# scli --reset_oscillating_failure_counters --all_counters --all_sdc

# scli --reset_oscillating_failure_counters --all_counters --all_sds

# scli --reset_oscillating_failure_counters --all_counters --all_mdm


Example (Specific Counter):

# scli --log in --username admin –password <MDM Password>

# scli --reset_oscillating_failure_counters --failure_counter mdm_sds_network_disconnections --all_sds


Additional Information - Specific Counter Details:

sds_configuration_failures              - measures the number of MDM's failures to configure an SDS when connecting to an SDS (failures in the reconfiguration phase)
sds_receive_buffer_allocation_failures  - measures the number of times SDS fails to allocate buffer for receiving messages

sdc_mdm_network_disconnections
sdc_sds_network_disconnections
sdc_long_operations                     - measures the number of SDC RPC operations which took more time than a predefined threshold
sdc_memory_allocation_failures          - measures the number of memory allocation failures in each SDC
sdc_socket_allocation_failures          - measures the number of socket allocation failures in each SDC

sds_device_long_successful_ios          - measures the number of successful IOs to an SDS device that took longer than a predefined threshold

The default threshold is 250 milliseconds.
 

Affected Products

PowerFlex rack, ScaleIO
Article Properties
Article Number: 000213116
Article Type: How To
Last Modified: 21 Jan 2025
Version:  6
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.