Highlighted
dbreveard1
1 Copper

Replication Failing With Errors

Does anyone know what causes the error below or a command that I can run on my SAN to display more information? The job was running for about 20 hours before it died. The replication is happening over a site to site VPN (5Mbps) connection. The source is a NX4 and the destination is a NS-120.

Slot 2: Primary Scheduler=fs45_T1_LUN6_SL7E9092800009_0000_fs70_T8_LUN25_APM00085002054_0000(alias=Valor_Replication), copying snapshot failed:DpRequest_SchedulerInactive - Message id=13160415296. Inactive.

0 Kudos
2 Replies
Peter_EMC
3 Zinc

Re: Replication Failing With Errors

From the message id:

$ nas_message -i 13160415296
MessageID = 13160415296
BaseID    = 64
Severity  = ERROR
Component = DART
Facility  = REP
Type      = STATUS

Brief_Description  = The replication scheduler is not active.

Full_Description   = The replication internal scheduler becomes inactive. It cannot function properly.

Recommended_Action = Stop and then restart the replication session. Then, perform a modify operation after the session is restarted.

0 Kudos
dbreveard1
1 Copper

Re: Replication Failing With Errors

I agree that statement is the process that EMC recommends to resolve the issue, but I still get the same error after that. Please be aware that I don't have a base file. Replication on this LUN has never completed. Here is some text from the log files.


[7m--More-- [m
Feb  4 20:01:24 2010:CS_PLATFORM:NASDB:INFO:306::::1265338884:nasdb_backup: Cele
rra database backup done.
Feb  4 20:24:16 2010:CS_PLATFORM:JServer:WARNING:808::::1265340256:CLARIION SL7E
9092800009 polling interval is too short, skipping polls
Feb  4 20:29:16 2010:CS_PLATFORM:JServer:INFO:850::::1265340556:CLARIION SL7E909
2800009 - new configuration received
Feb  4 21:01:21 2010:CS_PLATFORM:NASDB:INFO:300::::1265342481:nasdb_backup: NAS_
DB checkpoint is in progress.
Feb  4 21:01:22 2010:CS_PLATFORM:NASDB:INFO:305::::1265342482:nasdb_backup: NAS_
DB checkpoint done.
Feb  4 21:01:24 2010:CS_PLATFORM:NASDB:INFO:306::::1265342484:nasdb_backup: Cele
rra database backup done.
Feb  4 22:01:21 2010:CS_PLATFORM:NASDB:INFO:300::::1265346081:nasdb_backup: NAS_
DB checkpoint is in progress.
Feb  4 22:01:22 2010:CS_PLATFORM:NASDB:INFO:305::::1265346082:nasdb_backup: NAS_
DB checkpoint done.
Feb  4 22:01:24 2010:CS_PLATFORM:NASDB:INFO:306::::1265346084:nasdb_backup: Cele
rra database backup done.
Feb  4 22:12:32 2010:DART:REP:WARNING:25:Slot 2:::1265346752:Primary=fs45_T1_LUN
6_SL7E9092800009_0000_fs70_T8_LUN25_APM00085002054_0000(alias=Valor_Replication)
, doing post-event. Secondary no space. Retry.
Feb  4 22:12:52 2010:DART:REP:INFO:38:Slot 2:::1265346772:Primary=fs45_T1_LUN6_S
[7m--More-- [m
L7E9092800009_0000_fs70_T8_LUN25_APM00085002054_0000(alias=Valor_Replication), S
pace on the destination is available.
Feb  4 22:12:52 2010:DART:REP:ERROR:18:Slot 2:::1265346772:Primary=fs45_T1_LUN6_
SL7E9092800009_0000_fs70_T8_LUN25_APM00085002054_0000(alias=Valor_Replication),
doing post-event failed:DpRequest_SchedulerInactive. Session inactive.
Feb  4 22:12:52 2010:DART:DPSVC:ERROR:13:Slot 2:::1265346772:The replication ses
sion:fs45_T1_LUN6_SL7E9092800009_0000_fs70_T8_LUN25_APM00085002054_0000 internal
policy has become inactive:DpRequest_SchedulerInactive.
Feb  4 22:12:52 2010:DART:REP:ERROR:50:Slot 2:::1265346772:Primary Scheduler=fs4
5_T1_LUN6_SL7E9092800009_0000_fs70_T8_LUN25_APM00085002054_0000(alias=Valor_Repl
ication), copying snapshot failed:DpRequest_SchedulerInactive - Message id=13160
415296. Inactive.
Feb  4 23:01:19 2010:CS_PLATFORM:NASDB:INFO:300::::1265349679:nasdb_backup: NAS_
DB checkpoint is in progress.
Feb  4 23:01:20 2010:CS_PLATFORM:NASDB:INFO:305::::1265349680:nasdb_backup: NAS_
DB checkpoint done.
Feb  4 23:01:22 2010:CS_PLATFORM:NASDB:INFO:306::::1265349682:nasdb_backup: Cele
rra database backup done.
Feb  5 00:01:21 2010:CS_PLATFORM:NASDB:INFO:300::::1265353281:nasdb_backup: NAS_
DB checkpoint is in progress.
Feb  5 00:01:22 2010:CS_PLATFORM:NASDB:INFO:305::::1265353282:nasdb_backup: NAS_
DB checkpoint done.
[7m--More-- [m
Feb  5 00:01:24 2010:CS_PLATFORM:NASDB:INFO:306::::1265353284:nasdb_backup: Cele
rra database backup done.
Feb  5 01:01:19 2010:CS_PLATFORM:NASDB:INFO:300::::1265356879:nasdb_backup: NAS_
DB checkpoint is in progress.
Feb  5 01:01:20 2010:CS_PLATFORM:NASDB:INFO:305::::1265356880:nasdb_backup: NAS_
DB checkpoint done.
Feb  5 01:01:22 2010:CS_PLATFORM:NASDB:INFO:306::::1265356882:nasdb_backup: Cele
rra database backup done.
Feb  5 01:44:16 2010:CS_PLATFORM:JServer:WARNING:808::::1265359456:CLARIION SL7E
9092800009 polling interval is too short, skipping polls
Feb  5 01:49:16 2010:CS_PLATFORM:JServer:INFO:850::::1265359756:CLARIION SL7E909
2800009 - new configuration received
Feb  5 02:01:30 2010:CS_PLATFORM:NASDB:INFO:300::::1265360490:nasdb_backup: NAS_
DB checkpoint is in progress.
Feb  5 02:01:31 2010:CS_PLATFORM:NASDB:INFO:305::::1265360491:nasdb_backup: NAS_
DB checkpoint done.
Feb  5 02:01:33 2010:CS_PLATFORM:NASDB:INFO:306::::1265360493:nasdb_backup: Cele
rra database backup done.
Feb  5 03:01:21 2010:CS_PLATFORM:NASDB:INFO:300::::1265364081:nasdb_backup: NAS_
DB checkpoint is in progress.
Feb  5 03:01:21 2010:CS_PLATFORM:NASDB:INFO:305::::1265364081:nasdb_backup: NAS_
DB checkpoint done.
[7m--More-- [m
Feb  5 03:01:23 2010:CS_PLATFORM:NASDB:INFO:306::::1265364083:nasdb_backup: Cele
rra database backup done.
Feb  5 04:01:21 2010:CS_PLATFORM:NASDB:INFO:300::::1265367681:nasdb_backup: NAS_
DB checkpoint is in progress.
Feb  5 04:01:22 2010:CS_PLATFORM:NASDB:INFO:305::::1265367682:nasdb_backup: NAS_
DB checkpoint done.
Feb  5 04:01:25 2010:CS_PLATFORM:NASDB:INFO:306::::1265367685:nasdb_backup: Cele
rra database backup done.
Feb  5 05:01:19 2010:CS_PLATFORM:NASDB:INFO:300::::1265371279:nasdb_backup: NAS_
DB checkpoint is in progress.
Feb  5 05:01:19 2010:CS_PLATFORM:NASDB:INFO:305::::1265371279:nasdb_backup: NAS_
DB checkpoint done.
Feb  5 05:01:22 2010:CS_PLATFORM:NASDB:INFO:306::::1265371282:nasdb_backup: Cele
rra database backup done.
Feb  5 06:01:20 2010:CS_PLATFORM:NASDB:INFO:300::::1265374880:nasdb_backup: NAS_
DB checkpoint is in progress.
Feb  5 06:01:21 2010:CS_PLATFORM:NASDB:INFO:305::::1265374881:nasdb_backup: NAS_
DB checkpoint done.
Feb  5 06:01:23 2010:CS_PLATFORM:NASDB:INFO:306::::1265374883:nasdb_backup: Cele
rra database backup done.
Feb  5 07:01:20 2010:CS_PLATFORM:NASDB:INFO:300::::1265378480:nasdb_backup: NAS_
DB checkpoint is in progress.
[7m--More-- [m
Feb  5 07:01:21 2010:CS_PLATFORM:NASDB:INFO:305::::1265378481:nasdb_backup: NAS_
DB checkpoint done.
Feb  5 07:01:23 2010:CS_PLATFORM:NASDB:INFO:306::::1265378483:nasdb_backup: Cele
rra database backup done.
Feb  5 07:04:16 2010:CS_PLATFORM:JServer:WARNING:808::::1265378656:CLARIION SL7E
9092800009 polling interval is too short, skipping polls
Feb  5 07:09:16 2010:CS_PLATFORM:JServer:INFO:850::::1265378956:CLARIION SL7E909
2800009 - new configuration received
Feb  5 08:01:21 2010:CS_PLATFORM:NASDB:INFO:300::::1265382081:nasdb_backup: NAS_
DB checkpoint is in progress.
Feb  5 08:01:22 2010:CS_PLATFORM:NASDB:INFO:305::::1265382082:nasdb_backup: NAS_
DB checkpoint done.
Feb  5 08:01:24 2010:CS_PLATFORM:NASDB:INFO:306::::1265382084:nasdb_backup: Cele
rra database backup done.
]0;nasadmin@emcnas1:~ [nasadmin@emcnas1 ~]$nas_logviewer /nas/log/sys_log |more

[nasadmin@emcnas1:~server_log server_2

nas_task -list -all
Error 2100: Usage:

nas_task
    -list [ -remote_system { <remoteSystemName> | id=<id> } ]
]0;nasadmin@emcnas1:~ [nasadmin@emcnas1 ~]$ nas_task -list -all  [K  [K  [K  [K
ID    Task State Originator    Start Time                   Description                    Schedule                  Remote System
19418 Failed     nasadmin@172+ Thu Feb 04 09:16:47 MST 2010 Refresh Replication Valor_Rep+                           0
18333 Failed     nasadmin@172+ Wed Feb 03 08:52:24 MST 2010 Refresh Replication Valor_Rep+                           0
16964 Failed     nasadmin@172+ Tue Feb 02 07:44:00 MST 2010 Refresh Replication Valor_Rep+                           0

]0;nasadmin@emcnas1:~ [nasadmin@emcnas1 ~]$ nas_task -list   [K  [K  [K  [K  [Kinfo
Error 2100: Usage:
nas_task
    -info { -all | <taskId> }
          [ -remote_system { <remoteSystemName> | id=<id> } ]
]0;nasadmin@emcnas1:~ [nasadmin@emcnas1 ~]$ nas_task -info 19418
Task Id                = 19418
Celerra Network Server = 0
Task State             = Failed
Movers                 =
Description            = Refresh Replication Valor_Replication [ id=fs45_T1_LUN6_SL7E9092800009_0000_fs70_T8_LUN25_APM00085002054_0000].
Originator             = nasadmin@172.16.8.40
Start Time             = Thu Feb 04 09:16:47 MST 2010
End Time               = Thu Feb 04 22:12:53 MST 2010
Schedule               = n/a
Response Statuses      = Error 13160415296: The replication scheduler is not active.

]0;nasadmin@emcnas1:~ [nasadmin@emcnas1 ~]$ exit
logout
[H [2J

0 Kudos