We recently upgrded the networker server from 184.108.40.206 to 220.127.116.11. After the upgrade server has gon to hung state many times, usually when many jobs are started in the evening and restarting the service is the only solution we have.
Our Networker server is in HPUX 2 node cluster with 2000+ clients.
OS : HP-UX B.11.23 U ia64
Storage nodes : 4 (3 HPUX + 1 Redhat Linux)
Previous version : 18.104.22.168
Current Version : 22.214.171.124
Summary of the issue
Networker server is in HPUX 2 node cluster. On 15th Nov 2014, after the upgrade we did a
failover test from node A to second node A. Issue started after we switched back the package to node1, where it was running before the
NW162230 - NetWorker hangs intermittently after upgrading from 7.6.x to 126.96.36.199 is created to EMC engineering team to check the issue.
Could someone please help me with this issue... Has anyone already come across such issue with version 188.8.131.52?
I was told that 184.108.40.206 has fixed many issues, but it doesn't seem to be true in our case.
I have the exact same issue on the same OS for the backups server. I however have only around 650 clients on my affected backup server. I see this mostly during my monthly backup schedules and also sometimes on the weekly schedule. The thing is the server does not actually get hung, the nsrd crashes with the rest of the services still running. I had a case with EMC on this they gave me a patched binary which seemed to be working until it crashed again, but again the crashes are not as frequent as they were before EMC gave me the binary.
I am blaming this on my very old HP-UX server and am planning on getting migrating it to a Linux. things are not as fast as you expect in the service industry especially when there is cost involved so is taking a while for me to reach there.
I hope you backup server is patched with the latest patches or at least with the one's EMC has provided as pre-requisites. Also, the OpenSSL should be of the latest version. Try this and see if this help you.
Whether the Backup Server is going to Hung state or the Backup Jobs are getting Hanged?? We faced same kind of issue after the Networker upgrade to 220.127.116.11. As there was some bug identified in 18.104.22.168, we upgraded to 22.214.171.124 and applied some patches suggested by engineering team and now it is in stable condition.
We have two different data zones. One is having around 450 clients and the other one is having less than 100 clients. Currently all the backups are stable. Earlier we used to get some errors like no tape label found even if the volumes are available in pools. The tape labels will automatically get unlabeled (No data lose, after inventory, it will come up, manual backup will get complete successful, but schedule will fail/ hang).
I believe for 11v2 there are some (or at least one) HPUX patch required. What has been mentioned for ssl was fixed in 126.96.36.199 (but there is still recommendation to get ssl up to 0.9xy or something like that (I think xz is latest one). I run 11v3 without issue with 188.8.131.52 (well, I did hit the bug with ssids not removed which is fixed in 184.108.40.206) and thought of jumping to 220.127.116.11, but I plan to move this to Linux as well so most likely I won't be updating (due to some dependencies). Do you use dynamic nsrmmds? Do you have core dumps (check /nsr/core)? If yes, for which process?