Cherry251
2 Bronze

Replication Paused by System

Observed an issue when doing test failovers. The CG status displayed" Paused by System" and the following error was shown: ERROR: One or more links of group CG_TEST are set to replicate snaps and an error occurred in the snap-based replication process. The following errors were received from the storage: Link = TEST_PROD>TEST_Copy, error = [RPC failed at server.  snapset_not_found]

I had to disable and enable CG so that replication continued normally.

What could be the reason? The system is RP with XIO.

Labels (1)
0 Kudos
7 Replies
m_makadmeh
3 Silver

Re: Replication Paused by System

Hello,

Please check KB463933 (EMC Article Number 000463933😞

RecoverPoint/XtremIO: ERROR: [RPC failed at server. snapset_not_found]

Article Content                                                                                                                                                                                                                                                                                                                             

                                                         Replication will stop and go into Error state causing a DRU (Data Replication Unavailable).

Consistency Groups replication will go into Error state.

RecoverPoint GUI and CLI (get_events_log) will show (events and command get_system_status -n) following:
ERROR: One or more links of group <group_name> are set to replicate snaps and an error occurred in the snap-based replication process. The following errors were received from the storage: Link = prod<name> -> copy<name>, error = [RPC failed at server.  snapset_not_found]

                                                            
                                                        

Due to a code bug in RecoverPoint 4.1 SP2 Patch 2 (4.1.2.2), RecoverPoint's snapshot clean-up tool may delete the current working snap causing replication to go into an error state.

                                                            
                                                        

Upgrade to RecoverPoint 4.1 SP2 Patch 2 - 4.1.2.2

                                                            
                                                        

Resolution:

EMC engineering is currently investigating this issue in RecoverPoint 4.1.2.2. A permanent fix is still in progress. Contact the EMC Customer Support Center or your service representative for assistance and reference this solution ID.


                                                            
                                                        

Note:
For more information, consult Bugzilla defect number 125418, 122249. Bugzilla access is only available to authorized Customer Service Representatives.

0 Kudos
Idan
DellEMC

Re: Replication Paused by System

Hi,

Indeed this is a known issue which was fixed.

Couple of comments

1) It was fixed at 4.1.2.3 and 4.4

2) If an upgrade is planned, it's highly recommended to go to RP 4.4.1.1 and XtremIO 4.0.4-41 (XtremIO 4.0.2-80 is currently recommended if RP isn't active on that XtremIO array)

Hope that helps,

Idan Kentor

RecoverPoint Corporate Systems Engineering

idan.kentor@emc.com


Idan Kentor
Tech Staff Engineering Technologist - Data Protection


idan.kentor@dell.com
@IdanKentor
0 Kudos
Cherry251
2 Bronze

Re: Replication Paused by System

Idan,

We are at RP 4.1 SP2 P3. How come we are still observing this issue? XIO is at 4.0.2-80.

0 Kudos
Idan
DellEMC

Re: Replication Paused by System

Hi there,

Well, there are two similar errors so it's possible that you're facing the one fixed in 4.4.

To confirm, we'll need to review the logs so I'll recommend opening a SR.

Take a look at my note above on the versions to upgrade to if indeed an upgrade would be planned.

Regards,

Idan


Idan Kentor
Tech Staff Engineering Technologist - Data Protection


idan.kentor@dell.com
@IdanKentor
0 Kudos
sd_storage
2 Bronze

Re: Replication Paused by System

Hi Idan, 

This error is also in RP version 5.1 SP2 P1 .

0 Kudos
Idan
DellEMC

Re: Replication Paused by System

It also depends on the XIOS version. Please raise an SR with support for us to further investigate.

 

Regards,

Idan


Idan Kentor
Tech Staff Engineering Technologist - Data Protection


idan.kentor@dell.com
@IdanKentor
sd_storage
2 Bronze

Re: Replication Paused by System

Thanks Idan , yes its due to old XIO version .

0 Kudos