Isilon: PowerScale: SyncIQ policy fails during failback with the error at target "Local Error: Unable to Link LIN Numerical argument out of domain"

Summary: SyncIQ fails to synchronize data in a reverse replication through mirror policy from the target cluster to the source cluster during failback.

This article applies to This article does not apply to This article is not tied to any specific product. Not all product versions are identified in this article.

Symptoms

While running a job on the SyncIQ mirror policy during failback from DR to Production cluster, it may fail with the below error message on DR cluster.
 
SyncIQ encountered a filesystem error on target cluster. Error at target cluster on node [TNode-2]:  Unable to link /ifs/accesszone/clustername/shared/vol/share/0006/filename_L_bkp (1:2f4f:a28b::9672106) in directory /ifs/accesszone/clustername/shared/vol/share/0006/ (1:00e4:b84b::9672106), Local error : Unable to link lin 174045d4f: Numerical argument out of domain: Numerical argument out of domain
                                   from link_lin (/b/mnt/src/isilon/bin/isi_migrate/sworker/stf_transfer.c:1410)
                                   from link_callback (/b/mnt/src/isilon/bin/isi_migrate/sworker/stf_transfer.c:3820) , (policy name: Actualpolicyname_mirror target ip : 10.*.*.*

Cause

When acceleration failback is set for the original SyncIQ policy, a DomainMark system job is triggered for the source path which creates the SyncIQ domain upon a SyncIQ job run for the first time.

Reason for the failure of a mirror SyncIQ job in this case is due to the fact that multiple SyncIQ domains exist for same source path of SyncIQ policy at the actual production cluster where one source path is associated with the read-only SyncIQ domain while its subdirectory has read-write SyncIQ domain assigned.

Please refer to the below example.
 
ActualSourceCluster-1# isi_classic domain list
/ifs/accesszone/clustername/shared/  | SyncIQ
/ifs/accesszone/clustername/shared/vol/share/0006/ | SyncIQ,Writable



SyncIQ domain for the path /ifs/accesszone/clustername/shared/ is read-only while SyncIQ domain for its subdirectory /ifs/accesszone/clustername/shared/vol/share/0006/ is read-write as per the above command output.

Replication is failing when SyncIQ updates files under the path /ifs/accesszone/clustername/shared/vol/share/0006/ as SyncIQ domain for this path is read-write on the actual Production cluster.

Resolution

On the actual Production cluster delete the SyncIQ domain for the subdirectory which is writable and keep only the SyncIQ domain for the parent path.

1. Verify the SyncIQ domain on the actual Production cluster.
 
ActualSourceCluster-1# isi_classic domain list
/ifs/accesszone/clustername/shared/  | SyncIQ <--- Parent path
/ifs/accesszone/clustername/shared/vol/share/0006/ | SyncIQ,Writable <--- Subdirectory


2. Perform the following command on the actual Production cluster to delete the additional writable SyncIQ domain. 
 
ActualSourceCluster-1# domain_mark /ifs/accesszone/clustername/shared/vol/share/0006 synciq -d


3. Verify that only one SyncIQ domain is listed for the parent path.
 
ActualSourceCluster-1# isi_classic domain list
/ifs/accesszone/clustername/shared/  | SyncIQ


4. Start the mirror SyncIQ job again on actual DR cluster.
 
ActualTargetCluster-1# isi sync job start Policy_mirror

Affected Products

Isilon, PowerScale OneFS, Isilon SyncIQ
Article Properties
Article Number: 000200105
Article Type: Solution
Last Modified: 29 Jun 2023
Version:  4
Find answers to your questions from other Dell users
Support Services
Check if your device is covered by Support Services.