After a crash reboot, on 1 switch, we decide to attempt an UPGRADE to resolv this issue.
The upgrade failed, because :
- When 1 switch is on 10.5 version, the VLT is not functionning correctly, all VLAN are incorrect, and all LAG are failed (probably due to VLT mismatch on all LAG).
- When we upgrade all switches to 10.5 the VLT is ok, but we cannot use our QSFP28. So no inter-dc links up.
The GBIC are not "powered". No laser, but we have "power reading -0.2 db" (approax).
So we downgrade the switches, and remain in the same version.
Question 1 : Why VLT between 10.4 and 10.5 do not correspond / not compatible
Question 2: Why some GBIC (AKA our QSFP28) are not working.
DELL-Josh Cr
Moderator
•
9.5K Posts
0
October 30th, 2019 07:00
Hi,
Can you move to the latest version? You could also try rebooting each switch and see if that helps.
remy-ch
26 Posts
0
October 31st, 2019 01:00
Well, as you know it's not so easy to upgrade Core switch in prod environement..
No other idea ?
DELL-Josh Cr
Moderator
•
9.5K Posts
0
November 1st, 2019 06:00
You could try stopping the peer link and reconnecting, it seems that the switches are getting out of sync.
remy-ch
26 Posts
0
November 4th, 2019 07:00
Hello,
What do you mean "getting out of sync" ?
We have find a workaround : we have shutdown one link (under the LAG ) between the 2 sites, and the switch are became stable since 5 days.
So, can it be a LACP bug ?
remy-ch
26 Posts
0
November 7th, 2019 03:00
Hi,
After a crash reboot, on 1 switch, we decide to attempt an UPGRADE to resolv this issue.
The upgrade failed, because :
- When 1 switch is on 10.5 version, the VLT is not functionning correctly, all VLAN are incorrect, and all LAG are failed (probably due to VLT mismatch on all LAG).
- When we upgrade all switches to 10.5 the VLT is ok, but we cannot use our QSFP28. So no inter-dc links up.
The GBIC are not "powered". No laser, but we have "power reading -0.2 db" (approax).
So we downgrade the switches, and remain in the same version.
Question 1 : Why VLT between 10.4 and 10.5 do not correspond / not compatible
Question 2: Why some GBIC (AKA our QSFP28) are not working.
remy-ch
26 Posts
0
December 12th, 2019 04:00
All switches have restarted.
We have also add new S5232F for interconnection between site.
But the problem still occured. Packet lost are increased when S4148 are using LAG with VLT.
On the Linux sub-system we have this errors (example):
S4148-1
2019-12-11 13:11:23.569 S4148-1 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-MAC], Error when fetching MAC entry (MAC:00:50:56:9b:6e:e5, VLAN:1003) to verify ageout from VLT peer-unit-id:1 name:VLT_NODE_1
2019-12-11 13:11:23.574 S4148-1 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-CMN], Datastore Get failed
2019-12-11 13:11:23.575 S4148-1 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-MAC], Error when fetching MAC entry (MAC:00:e0:20:15:0e:3c, VLAN:1003) to verify ageout from VLT peer-unit-id:1 name:VLT_NODE_1
2019-12-11 13:11:23.688 S4148-1 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-CMN], Datastore Get failed
2019-12-11 13:11:23.689 S4148-1 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-MAC], Error when fetching MAC entry (MAC:00:e0:20:15:0e:4f, VLAN:1003) to verify ageout from VLT peer-unit-id:1 name:VLT_NODE_1
2019-12-11 13:11:23.700 S4148-1 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-CMN], Datastore Get failed
2019-12-11 13:11:23.702 S4148-1 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-MAC], Error when fetching MAC entry (MAC:00:e0:20:15:0e:4f, VLAN:1002) to verify ageout from VLT peer-unit-id:1 name:VLT_NODE_1
2019-12-11 13:11:23.748 S4148-1 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-CMN], Datastore Get failed
2019-12-11 13:11:23.749 S4148-1 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-MAC], Error when fetching MAC entry (MAC:00:e0:20:11:0a:36, VLAN:1003) to verify ageout from VLT peer-unit-id:1 name:VLT_NODE_1
S4148-2
2019-12-11 12:27:46.900 S4148-2 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-CMN], Datastore Get failed
2019-12-11 12:27:46.901 S4148-2 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-MAC], Error when fetching MAC entry (MAC:b4:b6:86:2f:24:9b, VLAN:1003) to verify ageout from VLT peer-unit-id:2 name:VLT_NODE_2
2019-12-11 12:27:46.946 S4148-2 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-CMN], Datastore Get failed
2019-12-11 12:27:46.947 S4148-2 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-MAC], Error when fetching MAC entry (MAC:00:e0:20:15:0e:4f, VLAN:1003) to verify ageout from VLT peer-unit-id:2 name:VLT_NODE_2
2019-12-11 12:27:46.977 S4148-2 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-CMN], Datastore Get failed
2019-12-11 12:27:46.978 S4148-2 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-MAC], Error when fetching MAC entry (MAC:00:e0:20:15:0e:4f, VLAN:1002) to verify ageout from VLT peer-unit-id:2 name:VLT_NODE_2
2019-12-11 12:27:47.102 S4148-2 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-CMN], Datastore Get failed
2019-12-11 12:27:47.103 S4148-2 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-MAC], Error when fetching MAC entry (MAC:00:e0:20:11:0a:36, VLAN:1003) to verify ageout from VLT peer-unit-id:2 name:VLT_NODE_2
2019-12-11 12:57:46.120 S4148-2 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-CMN], Datastore Get failed
2019-12-11 12:57:46.121 S4148-2 dn_infra_afs[err]: [INFRA_AFS:INFRA-AFS-MAC], Error when fetching MAC entry (MAC:14:b3:1f:1f:66:b8, VLAN:1003) to verify ageout from VLT peer-unit-id:2 name:VLT_NODE_2
We also see another anormal thing.
Some MAC address must stay in VLAN, and not visible in all VLANS.
remy-ch
26 Posts
0
December 16th, 2019 08:00
Thanks for your message.
We have open a ticket last week and upgrade all S4148 to 10.5.0.3.600.
Its an incredible change of ressources consummation.
We still have an issue with one LAG over links betwwen site. All other seems to be fixed.
There is some DELL Gbic that didn't works with the new version, and we have found a workaound with another switch.
Will keep you updated, and send you TAG in private.
DELL-Josh Cr
Moderator
•
9.5K Posts
0
December 16th, 2019 08:00
Hi,
Can you private message me the service tags?