RecoverPoint: Consistency Group in an Error State: Snap Interface Cannot Be Accessed on All Volumes
Summary: RecoverPoint Consistency Groups (CGs) can enter into an error state with the error, "Snap interface cannot be accessed on all volumes."
Symptoms
The following log print is present within the RecoverPoint Control logs:
XXXX/04/19 08:51:36.633 - #1 - 5969/5141 - WorkManager: GroupCopy(XXXXXXXXX SiteUID(0xXXXXXXXXXX) 0): Action removeSMPs failed! value.arrayRvCode() = e_OPERATION_IN_PROGRESS value.errorStrings() = [Device is in use; Operations like delete and expand cannot be performed]
XXXX/05/09 15:41:19.496 - #1 - 16628/16145 - WorkManager: GroupCopy(XXXXXXXXX SiteUID(0xXXXXXXXXXX) 0): Action removeSMPs failed! value.arrayRvCode() = e_API_FAILURE value.errorStrings() = [An internal error occurred during a SYMAPI operation. Please report to EMC]
XXXX/05/09 15:43:22.629 - #2 - 16628/16145 - WorkManager: GroupCopy(XXXXXXXXX SiteUID(0xXXXXXXXXXX) 0): Number of retries to remove SMPs exceeded maximum. Giving up
In addition, the Storage logs also contain prints highlighting the issue:
XXXX/04/19 08:52:22.766 - #1 - 6110/3773 - SymmDeleteATDev::execute: SymRPControl SYMAPI_RP_ACT_ATDEV_REMOVE for m_arraySerial = 000XXXXXXXXX m_clusterName = SB1 m_devices = Set(001BE,001C5,001C6,001C7,001C8) flag = 0 returned Device is in use; Operations like delete and expand cannot be performed
XXXX/05/08 17:23:09.514 - #1 - 20289/14917 - SymmDeleteATDev::execute: SymRPControl SYMAPI_RP_ACT_ATDEV_REMOVE for m_arraySerial = 000XXXXXXXXX m_clusterName = production m_devices = Set(002BE,002BF,0033C,0033D) flag = 0 returned An internal error occurred during a SYMAPI operation. Please report to EMC
XXXX/05/09 15:43:36.161 - #1 - 20289/14917 - GKCmdExposeConsistentSnapshot::execute: GKCmdExposeConsistentSnapshot for m_snapName = RP0XXXXXXX_7396_000XXXXXXXXX failed due to error in nice names retrieval
Cause
The system is unsuccessful in removing the Snapshot Mount Points (SMPs) from the VMAX Array when attempting to replicate RecoverPoint VMAX AF Devices. Those same SMPs are reused later on by the Array for replication purposes when this cleanup procedure does not occur. This is an illegal operation as the internal GUIDs of the SMPs are still on the blocklist within the Array and will fail to be reused.
Because these SMPs cannot be reused, the Array eventually encounters a space limitation issue, resulting in the error witnessed above.
Resolution
Workaround:
In order to resolve this issue temporarily, a reboot of all the RPAs or a restart of the Storage process on all RPAs must occur.
Resolution:
This issue is addressed in the RecoverPoint 5.1.2 code release.
To determine whether an upgrade is appropriate for your environment, contact the Dell Customer Support Center or your service representative and reference this solution ID.