Start a Conversation

Unsolved

This post is more than 5 years old

7216

July 11th, 2016 12:00

Replication Paused by System

Observed an issue when doing test failovers. The CG status displayed" Paused by System" and the following error was shown: ERROR: One or more links of group CG_TEST are set to replicate snaps and an error occurred in the snap-based replication process. The following errors were received from the storage: Link = TEST_PROD>TEST_Copy, error = [RPC failed at server.  snapset_not_found]

I had to disable and enable CG so that replication continued normally.

What could be the reason? The system is RP with XIO.

43 Posts

July 12th, 2016 02:00

Hello,

Please check KB463933 (EMC Article Number 000463933):

RecoverPoint/XtremIO: ERROR: [RPC failed at server. snapset_not_found]

Article Content                                                                                                                                                                                                                                                                                                                             

                                                         Replication will stop and go into Error state causing a DRU (Data Replication Unavailable).

Consistency Groups replication will go into Error state.

RecoverPoint GUI and CLI (get_events_log) will show (events and command get_system_status -n) following:
ERROR: One or more links of group are set to replicate snaps and an error occurred in the snap-based replication process. The following errors were received from the storage: Link = prod -> copy , error = [RPC failed at server.  snapset_not_found]

                                                            
                                                        

Due to a code bug in RecoverPoint 4.1 SP2 Patch 2 (4.1.2.2), RecoverPoint's snapshot clean-up tool may delete the current working snap causing replication to go into an error state.

                                                            
                                                        

Upgrade to RecoverPoint 4.1 SP2 Patch 2 - 4.1.2.2

                                                            
                                                        

Resolution:

EMC engineering is currently investigating this issue in RecoverPoint 4.1.2.2. A permanent fix is still in progress. Contact the EMC Customer Support Center or your service representative for assistance and reference this solution ID.


                                                            
                                                        

Note:
For more information, consult Bugzilla defect number 125418, 122249. Bugzilla access is only available to authorized Customer Service Representatives.

28 Posts

July 12th, 2016 07:00

Idan,

We are at RP 4.1 SP2 P3. How come we are still observing this issue? XIO is at 4.0.2-80.

675 Posts

July 12th, 2016 07:00

Hi,

Indeed this is a known issue which was fixed.

Couple of comments

1) It was fixed at 4.1.2.3 and 4.4

2) If an upgrade is planned, it's highly recommended to go to RP 4.4.1.1 and XtremIO 4.0.4-41 (XtremIO 4.0.2-80 is currently recommended if RP isn't active on that XtremIO array)

Hope that helps,

Idan Kentor

RecoverPoint Corporate Systems Engineering

idan.kentor@emc.com

675 Posts

July 14th, 2016 00:00

Hi there,

Well, there are two similar errors so it's possible that you're facing the one fixed in 4.4.

To confirm, we'll need to review the logs so I'll recommend opening a SR.

Take a look at my note above on the versions to upgrade to if indeed an upgrade would be planned.

Regards,

Idan

November 3rd, 2019 20:00

Hi Idan, 

This error is also in RP version 5.1 SP2 P1 .

675 Posts

November 4th, 2019 01:00

It also depends on the XIOS version. Please raise an SR with support for us to further investigate.

 

Regards,

Idan

November 27th, 2019 05:00

Thanks Idan , yes its due to old XIO version .

No Events found!

Top