Deploy highly available NVAs

Article
2025-04-25

This article describes common ways to deploy a set of network virtual appliances (NVAs) for high availability in Azure. An NVA typically controls the flow of traffic between network segments that have different security levels. For example, you might use an NVA between a perimeter network virtual network and the public internet, or to connect external locations to Azure via virtual private network (VPN) or software-defined WAN (SD-WAN) appliances.

This article assumes that you have a basic understanding of Azure networking, Azure load balancers, virtual network traffic routing, and user-defined routes (UDRs).

Many design patterns use NVAs to inspect traffic between different security zones. These patterns might use NVAs for the following purposes:

To inspect egress traffic from virtual machines to the internet and prevent data exfiltration.
To inspect ingress traffic from the internet to virtual machines and prevent attacks.
To filter traffic between virtual machines in Azure to prevent lateral moves of compromised systems.
To filter traffic between on-premises systems and Azure virtual machines, especially if they belong to different security levels. For example, Azure hosts the perimeter network, while the on-premises environment hosts the internal applications.
To terminate VPN or SD-WAN tunnels from external locations, such as on-premises networks or other public clouds.

You can add the following NVAs to an Azure design by using the patterns in this article:

Network firewalls
Layer-4 reverse proxies
Internet Protocol Security (IPsec) VPN endpoints
SD-WAN appliances
Web-based reverse proxies that have Web Application Firewall functionality
Internet proxies to restrict which internet pages can be accessed from Azure
Layer-7 load balancers

Azure-native NVAs, such as Azure Firewall and Azure Application Gateway, use the designs explained later in this article. You should understand these options from a design perspective and for network troubleshooting purposes.

NVAs often require high availability because they control the communication between network segments. If NVAs become unavailable, network traffic can't flow and applications stop working. Scheduled and unscheduled outages occasionally shut down NVA instances, similar to other virtual machines in Azure or in other clouds. The NVA instances might shut down even if you configure them with Azure Premium SSDs, which provide a single-instance service-level agreement in Azure. Highly available applications require at least two NVAs to help ensure connectivity.

When you choose the best option to deploy an NVA into an Azure virtual network, the most important aspect is whether the NVA vendor has evaluated and validated their design. The vendor must also provide the required NVA configuration to integrate the NVA into Azure. If the NVA vendor provides multiple supported design options, consider the following factors to make your decision:

Convergence time: The time that it takes each design to reroute traffic away from a failed NVA instance
Topology support: The NVA configurations that each design option supports, such as active/active, active/standby, or scale-out NVA clusters that have an extra unit for redundancy
Traffic symmetry: Whether a particular design forces the NVA to perform Source Network Address Translation (SNAT) on the packets to avoid asymmetric routing, or if the design enforces traffic symmetry by other means

Note

This article focuses on hub-and-spoke designs. This article doesn't cover Azure Virtual WAN because it has more prescriptive guidelines for deploying NVAs, depending on whether a Virtual Wan hub supports a specific NVA. For more information, see NVAs in a Virtual WAN hub.

The following sections describe common architectures that you can use to integrate NVAs into a hub-and-spoke network.

High availability architectures overview

Solution	Benefits	Considerations
Azure Load Balancer	This solution supports active/active and active/standby configurations, and scale-out NVAs with good convergence time.	The NVA needs to provide a port for the health probes, especially for active/standby deployments. For stateful appliances, such as firewalls that require traffic symmetry, flows to and from the internet require SNAT.
Azure Route Server	The NVA must support Border Gateway Protocol (BGP). This solution supports active/active, active/standby, and scale-out NVAs.	Traffic symmetry requires SNAT in this solution.
Azure Gateway Load Balancer	Traffic symmetry is guaranteed without SNAT. NVAs can be shared across tenants. This solution has a good convergence time and supports active/active, active/standby, and scale-out NVAs.	This solution supports flows to and from the internet and doesn't support East-West flows.
Dynamic private IP address and UDR	The NVA doesn't require special features. This solution guarantees symmetric traffic.	This solution is only for active/passive designs. It has a high convergence time of one to two minutes.

Load Balancer

The Load Balancer design uses two Azure load balancers to expose a cluster of NVAs to the rest of the network. The approach suits both stateful and stateless NVAs.

An internal load balancer redirects internal traffic from Azure and on-premises to the NVAs. This internal load balancer is configured with high availability ports rules so that every Transmission Control Protocol (TCP) and User Datagram Protocol (UDP) port is redirected to the NVA instances.
A public load balancer exposes the NVAs to the internet. High availability ports are for inbound traffic, so each TCP/UDP port needs to be opened in a dedicated load-balancing rule.

The following diagram shows the sequence of hops that packets take from the internet to an application server in a spoke virtual network. These packets traverse a firewall NVA to control traffic to and from the public internet, also called North-South traffic.

Download a Visio file of this architecture.

To send traffic from spokes to the public internet through the NVAs, this design uses a UDR for 0.0.0.0/0. The next hop is the internal load balancer's IP address.

For traffic between Azure and the public internet, each direction of the traffic flow crosses a different Azure load balancer. This process occurs even if the firewall NVA has a single network interface card (NIC) for both the public and internal networks because the ingress packet goes through the public Azure load balancer and the egress packet goes through the internal Azure load balancer. Both directions of the flow go through different load balancers. So if you require traffic symmetry, the NVA instances must perform SNAT to attract return traffic and ensure traffic symmetry. Most firewalls require traffic symmetry.

The following diagram shows how to use the same load balancer design to inspect traffic between Azure and on-premises networks, or East-West traffic, which involves only an internal load balancer. You can also use this method to send traffic between spokes through the NVAs.

In the previous diagrams, spoke1 doesn't know about spoke2's range. Therefore, the 0.0.0.0/0 UDR sends traffic that's intended for spoke2 to the NVA's internal Azure load balancer.

For traffic between on-premises networks and Azure, or between Azure virtual machines, traffic symmetry is guaranteed in single-NIC NVAs by the internal Azure load balancer. When both directions of a traffic flow traverse the same Azure load balancer, the load balancer selects the same NVA instance for both directions. If a dual-NIC NVA design has an internal load balancer for each direction of communication, SNAT ensures traffic symmetry. The previous North-South diagram provides an example of this design.

In this design, dual-NIC NVAs must determine where to send replies to the load balancer's health checks. Load Balancer always uses the same IP address as the source for the health checks, which is 168.63.129.16. The NVA must send these health check responses back through the same interface on which they were received. This process typically requires multiple routing tables in an operating system because destination-based routing sends the replies through the same NIC.

The Azure load balancer has a good convergence time in individual NVA outages. You can send health probes every five seconds, and it takes three failed probes to declare a back-end instance out of service. So it usually takes 10 to 15 seconds for the Azure load balancer to converge traffic to a different NVA instance.

This setup supports both active/active and active/standby configurations. For active/standby configurations, the NVA instances need to provide either a TCP or UDP port or an HTTP endpoint that responds only to the load balancer health probes for the instance currently in the active role.

Layer-7 load balancers

A specific design for security appliances replaces the Azure public load balancer with a Layer-7 load balancer such as the Azure Application Gateway, which can be considered an NVA itself.

In this scenario, the NVAs only require an internal load balancer for traffic from the workload systems. Dual-NIC devices sometimes use this approach to avoid routing problems with the Azure load balancer's health checks. This design supports only the Layer-7 protocols that the Layer-7 load balancer supports, which is typically HTTP and HTTPS.

The NVA should handle inbound traffic for protocols that the Layer-7 load balancer doesn't support. The NVA might also handle egress traffic. For more information, see Firewall and Application Gateway for virtual networks.

Route Server

Route Server is a service that enables an NVA to interact with Azure software-defined networking via BGP. NVAs learn which IP address prefixes exist in Azure virtual networks. They can also inject routes in the effective route tables of virtual machines in Azure.

In the previous diagram, each NVA instance connects to Route Server via BGP. This design doesn't require a route table in the spoke subnets because Route Server programs the routes that the NVAs advertise. If two or more routes are programmed in the Azure virtual machines, they use equal-cost multi-path routing to choose one of the NVA instances for every traffic flow. Therefore, you must include SNAT in this design if you require traffic symmetry.

This insertion method supports both active/active and active/standby configurations. In an active/active configuration, all NVAs advertise the same routes to Route Server. In an active/standby configuration, one NVA advertises routes with a shorter Autonomous System (AS) path than the other. Route Server supports a maximum of eight BGP adjacencies. So if you use a scale-out cluster of active NVAs, this design supports a maximum of eight NVA instances.

This setup has a fast convergence time. The keepalive and holdtime timers of the BGP adjacency influence the convergence time. Route Server has default keepalive timers set to 60 seconds and holdtime timers set to 180 seconds. But the NVAs can negotiate lower timers during the BGP adjacency establishment. Setting these timers too low could lead to BGP instabilities.

This design suits NVAs that need to interact with Azure routing. Examples include SD-WAN or IPsec NVAs, which typically have good BGP support. These NVAs need to learn the prefixes configured in Azure virtual networks or advertise certain routes over ExpressRoute private peerings. These types of appliances are usually stateless, so traffic symmetry isn't a problem and SNAT isn't required.

Gateway Load Balancer

Gateway Load Balancer provides a way of inserting NVAs in the data path without the need to route traffic by using UDRs. For virtual machines that expose their workloads via an Azure load balancer or a public IP address, you can redirect inbound and outbound traffic transparently to a cluster of NVAs located in a different virtual network. The following diagram shows the path that packets follow for inbound traffic from the public internet if the workloads expose the application via an Azure load balancer.

This NVA injection method provides the following benefits:

This method doesn't require SNAT to guarantee traffic symmetry.
You can use the same NVAs to inspect traffic to and from different virtual networks, which provides multitenancy from the NVA perspective.
Virtual network peering isn't required between the NVA virtual network and the workload virtual networks, which simplifies configuration.
UDRs aren't required in the workload virtual network, which also simplifies configuration.

You can use service injection via Gateway Load Balancer for inbound traffic to an Azure public load balancer, its return traffic, and outbound traffic from Azure. East-West traffic between Azure virtual machines can't use Gateway Load Balancer for NVA injection.

In the NVA cluster, Azure load balancer health check probes detect failures in individual NVA instances, which provides a quick convergence time of 10 to 15 seconds.

Dynamic public IP address and UDR management

The goal of this design is to have a setup that functions without NVA redundancy and can be modified if the NVA experiences downtime. The following diagram shows how an Azure public IP address associates with an active NVA (NVA1 in the diagram). The UDRs in the spokes use the active NVA's IP address (10.0.0.37) as the next hop.

If the active NVA becomes unavailable, the standby NVA calls the Azure API to remap the public IP address and the spoke UDRs to itself, or take over the private IP address. These API calls can take several minutes to be effective. This design provides the worst convergence time among the options in this article.

This design supports only active/standby configurations, which can lead to scalability problems. If you need to increase the bandwidth that your NVAs support, you must scale up both instances.

This design doesn't require SNAT to guarantee traffic symmetry because only one NVA is active at any given time.

Contributors

Microsoft maintains this article. The following contributors wrote this article.

Principal authors:

Keith Mayer | Principal Cloud Solution Architect
Telmo Sampaio | Principal Service Engineering Manager
Jose Moreno | Principal Engineer

To see nonpublic LinkedIn profiles, sign in to LinkedIn.

Next step

Perimeter networks

Implement a secure hybrid network by using Azure Firewall

Share via