Start a Conversation

Unsolved

This post is more than 5 years old

I

4885

May 17th, 2010 10:00

savefs: nothing to save

Hello Gurus,

We have 2 physical HP-UX 11.31 servers in a MC Serviceguard cluster.  This is my situation;

Last week all backups were working properly, however I had the cluster File Systems fail over to ServerA from ServerB.  I can backup ServerB, and the Cluster File Systems on ServerA, however I can't backup the local filesystems.

Any help would be appreciated.

736 Posts

May 18th, 2010 02:00

I have seen 'savefs: nothing to save' being caused by the host's name not being included in the /etc/hosts file or by the dns not resolving the hostname correctly.  You might want to check these.

-Bobby

11 Posts

May 18th, 2010 05:00

Hello Bobby and thanks for your help,

I checked my host files on both ends and have no issues with resolution.  I checked the short and long names, both in the hosts file and resolving and I have no issues resolving.

This all happened after I mounted the clustered file systems on the server.  I can backup the clustered file systems by its clustered name, but can't backup the physical server.

736 Posts

May 18th, 2010 06:00

run savefs -p -D9 and check the output.

'savefs -p -D9 > savefs_out 2>&1'

grep matchbyname savefs_out

This will tell you what is being compared to what.  If you have a line that states that 'localhost' is being matched to your hostname, then it looks like a resolution issue.  If not, then it's somthing else.  Check also if the lcmap file is being invoked.

-Bobby

11 Posts

May 18th, 2010 08:00

Bobby,

Thanks again for your help.  When I grep the matchbyname, I get the following;

clu_hosts_matchbyname() comparing CLUSTERNAME and PHYSICALNAME
clu_hosts_matchbyname:ENTRY
clu_hosts_matchbyname:EXIT: b_ret=0

Also saw this;

host1=CLUSTERNAME, host2=PHYSICALNAME

How can I tell if the lcmap is being invoked?

Thanks again.

736 Posts

May 20th, 2010 03:00

Hi,

Your matchbyname looks ok although I don't have a cluster here to test that to confirm.  The lcmap will query the cluster
nodes for information on what to backup. You can search for the word 'lcmap' in the savefs output you have.  If it looked for
this command and didn't find it, it will say so, and this means that the cluster is not being recognised by NetWorker. You
could check also your /etc/fstab - maybe compare this to the same file on a working system. 

If this doesn't get you anywhere, you should probably open a Service Request for someone in support to have a look
at your configuration in detail.

-Bobby

5 Posts

June 2nd, 2010 05:00

Hi,

taken from the cluster install guide 7.5:

The NetWorker client software must determine an owning host for any paths that it

saves. The NetWorker software determines which mount points an

MC/ServiceGuard or MC/LockManager package owns by the entries in the

.nsr_cluster file, located in the /etc/cmcluster/ directory. The .nsr_cluster file should

have an entry for the NetWorker shared mount point, which is owned by the

NetWorker package.

To configure the .nsr_cluster file:

1. Add the name and path of each mount point to the file in the following format:

pkgname:

published_ip_address:owned_path [:...]

where

published_ip_address is the address assigned to the package owning a

shared disk. IPv6 addresses must be enclosed in square brackets, as in the

following example:

networker:[3ffe:80c0:22c:74:230:6eff:fe4c:2128]:/share/nw

Ensure that the ownership and access permissions for the .nsr_cluster file are

"read" for World.

3. Additional paths, preceded by colons, can be added as required. The following is

an example of a typical .nsr_cluster file:

/etc/cmcluster/.nsr_cluster file. networker:192.168.109.41:/vg011

oracle:192.168.109.10:/vg021:/ora_data1:/ora_data2

Have you configured this? If not, networker will detect an mc/sg cluster and determine there are no savesets to be backed up.

11 Posts

June 3rd, 2010 04:00

Hello and thanks,

I am still having the issue.  My .nsr_cluster works like a charm on my other 3 clustered nodes, however one is being a pain.  The one thing that I did change was the world read permissions to the .nsr_cluster file.  Still didn't work though.

14.3K Posts

June 3rd, 2010 04:00

I agree with what has been said above.  I have 20+ boxes with exactly the same setup and usually the only reason why it would fail like that would be caused by .nsr_cluster not set correctly (or cluster check file not touched) or if machine had network interface which was not listed in alias list for given client.

14.3K Posts

June 3rd, 2010 05:00

Go to that box.  Post following outputs:

- cmviewcl

- cat /etc/hosts

- cat /etc/resolv.conf

- cat /etc/cmcluster/.nsr_cluster

- ll /opt/networker/bin/NetWorker.clustersvr

- savefs -s -vpn

- hostname

From backup server do following:

echo print | nsradmin -p 390113 -i - -s

Post outputs from above please and we can check it out.

No Events found!

Top