Yes I've looked into the NSF feature on these switches. The feature looks like it's turned on by default on these switches and I have confirmed we have it on.
I could be wrong but Ultimately it does look like the LACP port-channel has to reset or renegotiate once the stack MAC address changes (master fails/reboots). Once this happens traffic does not pass through the LAG until STP has gone through its calculations. What I think I need is something stopping the LACP LAGs connected to other switches from restarting and causing STP to kick in.
I tried out changing both sides of the stack to short timeouts but this does not appear to have solved the problem. When the master dies the LACP port-channel interface still dies and then starts up again, causing an STP reconvergence.
The stack firmware is the newest at 6.2.7.2. I did try testing the LAG to static and when the master dies the port-channel interface still has a short 1-2 second blip still causing STP calculations. When I disabled spanning-tree completely on both stacks there is a much better behavior. The 1-2 second blip of the port-channel interface going down is the only down time you see. It comes right back up and everything is in a forwarding state instead of listening, learning, etc.
I have tested this with different spanning-tree versions as well. Obviously I can run RSTP or Rapid-PVST and the LAG outage connecting both switches shortens. Ultimately though I want to know if when the stack master/management plane dies if there is a way on these switches to continue passing traffic on the remaining port-channel member without it going down for even the 1-2 seconds. The non-stop forwarding (NSF) feature does not look to be doing this.
MTSIA5988
8 Posts
0
January 11th, 2016 11:00
Hi Daniel--I appreciate the response.
Yes I've looked into the NSF feature on these switches. The feature looks like it's turned on by default on these switches and I have confirmed we have it on.
I could be wrong but Ultimately it does look like the LACP port-channel has to reset or renegotiate once the stack MAC address changes (master fails/reboots). Once this happens traffic does not pass through the LAG until STP has gone through its calculations. What I think I need is something stopping the LACP LAGs connected to other switches from restarting and causing STP to kick in.
MTSIA5988
8 Posts
0
January 12th, 2016 06:00
Hi Daniel,
I tried out changing both sides of the stack to short timeouts but this does not appear to have solved the problem. When the master dies the LACP port-channel interface still dies and then starts up again, causing an STP reconvergence.
The stack firmware is the newest at 6.2.7.2. I did try testing the LAG to static and when the master dies the port-channel interface still has a short 1-2 second blip still causing STP calculations. When I disabled spanning-tree completely on both stacks there is a much better behavior. The 1-2 second blip of the port-channel interface going down is the only down time you see. It comes right back up and everything is in a forwarding state instead of listening, learning, etc.
MTSIA5988
8 Posts
0
January 12th, 2016 07:00
I have tested this with different spanning-tree versions as well. Obviously I can run RSTP or Rapid-PVST and the LAG outage connecting both switches shortens. Ultimately though I want to know if when the stack master/management plane dies if there is a way on these switches to continue passing traffic on the remaining port-channel member without it going down for even the 1-2 seconds. The non-stop forwarding (NSF) feature does not look to be doing this.