Why is the route configuration dropping on my vNet?

William Willis 0 Reputation points
2023-06-22T16:25:07.8566667+00:00

I'm having an issue that I can't figure out, and after 45 minutes with Azure support still couldn't find the issue. I'll try to give the best details I can here.

Short description: I have a VM that hosts 2 nested servers; I followed guides to get them set up - and everything works great. For a while, then it suddenly stops, and the only way I can bring it back is removing the route from 3 route tables, and then adding it back.

Long description:

Azure VM with 2 NICs: 172.20.0.5 / 172.20.1.5; the 0.5 NIC is used for NAT connectivity, 1.5 NIC with IP forwarding, firewall set at 172.20.3.4; nested server at 172.20.2.5. vNet set with address spaces for 172.20.0.0/24, 172.20.1.0/24, 172.20.3.0/24 (firewall), route table for server set with 172.20.2.0/24 -> 172.20.1.5; second route table for FW set with the same route.

Second vNet setup with the virtual Gateway and peered with the vNet above; address spaces of 172.20.4.0/24 (gateway) and 172.20.5.0, route table set for 172.20.2.0/24 -> 172.20.1.5.

Gateway connects with local FW, S2S IPSec, both sides with traffic set for 172.20.1.0/24 and 172.20.2.0/24, no errors on either side.

Now the issue: I get everything set up, and it works - I can RDP to the nested servers on the 172.20.2.0 subnet, ping, etc. Those servers can also reach the internet through the firewall, everything works as expected. After a while, I can no longer reach the 172.20.2.0 subnet from outside the server. I can always reach the 172.20.1.5 address to connect to the host and if I try to connect to the 172.20.2.5 VM I can see the traffic going across the tunnel, so I know the gateway is working.

If I connect to the host server on the 172.20.1.5 I can connect to the 2.5 VM. Connecting on the 2.5 VM I can ping the host and the firewall, but not the gateway.

The only way I can get restore access is dropping the routing of 172.20.2.0/24 -> 172.20.1.5 from all 3 route tables and then adding them back and it's restored immediately.

Working with the support they said everything routes fine, but at that time it was working. What could possibly be happening here?

Azure Virtual Network
Azure Virtual Network
An Azure networking service that is used to provision private networks and optionally to connect to on-premises datacenters.
2,775 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.