-
Hi, I'm peering with a VyOS that provides me full BGP tables. For simplicity we're only going to consider IPv6 but this happens on IPv4 as well. It's worth to mention that this is a peering session over vlan subinterface inside a VRF (not netns). After getting the first full tables update from VyOS I will see FRR disconnect from the peer with error: Full log:
After issuing Investigating the actual next-hop attribute, since the error mentions martians, shows not a single martian. All next-hops are the correct IPv6LL of the VyOS node and I've also thought that maybe a handful of routes might be having martians and FRR is throwing away the entire table so I recorded a pcap file and inspected every single BGP Update and the Next-Hop Attribute was always the correct GUA and LL. Next I've changed the MAC of the subinterface thinking that it might collide with the IPv6LL of a different subinterface which also has a full-table peer connected to it. After changing the MAC and cycling the interface down/up I verified a new IPv6LL was assigned to me but the error persists. At this point I'm out of ideas. This feels specific to VyOS as my other sessions (Juniper, Cisco, Arista, FRR, Bird) are all fine with full-tables and tagged interfaces. However the VyOS partner also has fully working sessions to at least Juniper and Arista. The setup is running on Ubuntu 22.04 LTS with FRRouting 8.1-1ubuntu1.9 (from ubuntu repositories). Anyone got any ideas what I could try next to debug this? |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 10 replies
-
Please use 9.x or 10.x releases, 8.1 is too old to be supported. |
Beta Was this translation helpful? Give feedback.
-
#15543 you mean like I did here? if not then please be more specific of what you want me to execute/log/upload |
Beta Was this translation helpful? Give feedback.
-
Ok so good news: 10.x does solve it bad news:
So 10.x has this oddity:
The /22 is inserted in the VRF and added to the kernel, it's also matching the correct outgoing route-map. But it's not advertised. One Step forward, One Step backward.... |
Beta Was this translation helpful? Give feedback.
-
To summarize:
This was a very bumpy upgrade and I hope nobody else has to go through it. Or if you do, rather start with a blank config as you will end up rewriting or correcting a lot anyway. |
Beta Was this translation helpful? Give feedback.
To summarize:
bgp suppress-fib-pending
which leads to prefixes not being advertised ever (even when peer was shut/noshut)no neighbor xyz enforce-first-as
and in the process breaks other parts of the config (name rt import/export statements missing or incomplete randomly) leading to unusable stateThis was a very bumpy upgrade and I hope nobody else has to go through it. Or if you do, rather start with a blank config as you will end up rewriting or correcting a lot anyway.