Unsolved
This post is more than 5 years old
1 Rookie
•
77 Posts
0
1547
nsrstage command has retried times
Hi,
I run a stage process ( Start NOW ) to move data on a storage node from its adv_file to the tapes it owns in DDS with other stnodes.
The command started 1 hour ago and on the storage node ( w2K ) I see someting like it is starting "nsrmmd#13 .....Automated staging as determined the need to migrate 0KB"; this is typical when I run it using "Start now". On the server side the process is up but nothing happen, apart the message:
nsrstage: nsrstage command has retried 102 times.
nsrstage: nsrstage command has retried 122 times.
repeated until now and continuing.
The problem seems that the stnode is not able to reply to the server; I have stopped/started networker on the storage node side but nothing happen.
The storage node is able to ping the server and to run rpcinfo -p servername also.
What I can do to fix it ? It's enough to try to kill nsrstage and redo from start ? Is a signal of something bad ? Or have I to wait the filesystem check interval ( 6 hours for me ).
Please, a suggestion .....
Thanks a lot in advance !!!
Pierpa
I run a stage process ( Start NOW ) to move data on a storage node from its adv_file to the tapes it owns in DDS with other stnodes.
The command started 1 hour ago and on the storage node ( w2K ) I see someting like it is starting "nsrmmd#13 .....Automated staging as determined the need to migrate 0KB"; this is typical when I run it using "Start now". On the server side the process is up but nothing happen, apart the message:
nsrstage: nsrstage command has retried 102 times.
nsrstage: nsrstage command has retried 122 times.
repeated until now and continuing.
The problem seems that the stnode is not able to reply to the server; I have stopped/started networker on the storage node side but nothing happen.
The storage node is able to ping the server and to run rpcinfo -p servername also.
What I can do to fix it ? It's enough to try to kill nsrstage and redo from start ? Is a signal of something bad ? Or have I to wait the filesystem check interval ( 6 hours for me ).
Please, a suggestion .....
Thanks a lot in advance !!!
Pierpa
ble1
2 Intern
2 Intern
•
14.3K Posts
0
July 25th, 2007 06:00
pfrassino
1 Rookie
1 Rookie
•
77 Posts
0
July 25th, 2007 07:00
new run by command line:
unlb-legato-01:/home/solveit # nsrstage -v -b "Tape 3M" -m -S 4037486709
Obtaining media database information on server unlb-legato-01.dpko.un.org
Parsing save set id(s)
Migrating the following save sets (ids):
4037486709
Automatically copying save sets(s) to other volume(s)
Starting migration operation...
NSR server unlb-legato-01.dpko.un.org: busy
waiting 30 seconds then retrying
NSR server unlb-legato-01.dpko.un.org: busy
waiting 30 seconds then retrying
nsrstage: nsrstage command has retried 2 times.
Boh .... why ...?????!!!!!!!!!!!
Pierpa
pfrassino
1 Rookie
1 Rookie
•
77 Posts
0
July 25th, 2007 07:00
I run it with "start now" asking to stage everything, so nno highwatermarks are checked.
Pierpa
ble1
2 Intern
2 Intern
•
14.3K Posts
0
July 25th, 2007 07:00
pfrassino
1 Rookie
1 Rookie
•
77 Posts
0
July 25th, 2007 08:00
I'VE FOUND !!!
I stage from an adv_file to a tape. Apparently the advfile was mounted correctly. I dismounted the NON-RO advfile, mounted again and voilà, the stage now is running quickly via commandline and via GUI.
But I cannot figure out why the advfile had to be unmounted and mounted again.
I'm using 7.3.2. Jumbo Update 1 ( build 386 ). Are you aware of any issue like this.
I've seen sometimes as well that after Networker reboot the advfiles need to be mounted by hand. Any comment on this as well ??
Thanks Hrvoje for your infinite patience in replying! !!!!!!
Pierpa
pfrassino
1 Rookie
1 Rookie
•
77 Posts
0
July 25th, 2007 08:00
this is not the case. I've all the tapes free.
Thanks for any suggestion. I'm close to be crazy...
Pierpa
ble1
2 Intern
2 Intern
•
14.3K Posts
0
July 25th, 2007 09:00
lee6
68 Posts
1
July 25th, 2007 10:00
Yes, I have had this with build 386 and it still is in 399. it is hit and miss not all the time. I have a couple of hot fix's running on my 399 build as so far it has not happened again. I can not say for sure that the hot fix has taken care of this but it has been est 4 reboots and I still have them mounted after reboot.
ble1
2 Intern
2 Intern
•
14.3K Posts
1
July 25th, 2007 13:00
pfrassino
1 Rookie
1 Rookie
•
77 Posts
0
July 25th, 2007 21:00
therefore something is not managing these adv_files correctly at startup.
I'll try to do the same in my environment; now, at least, I know that I'm not into a dream but the same happens to someone else ...
Thanks again for your suggestions!!
Pierpa