Skip to main content

Get Service Mesh Certified with Buoyant.

Enroll now!
close
Cost optimization · Kubernetes

Cut cross-zone networking costs without sacrificing reliability

High Availability Zonal Load Balancing (HAZL) keeps Kubernetes traffic in-zone to cut cross-zone data transfer charges, and sends it across zones the moment reliability needs it. The cost win without the outage risk.

Provider
AWS
$0.02/GB
GCP
$0.01/GB
No. of Kubernetes clusters
Availability zones per cluster
Cross-zone traffic volume
select a provider

Select a provider to see your savings.

Without HAZL
per year
With HAZL
60% less
Savings
per year

Why your cross-zone bill is so high?

Cloud providers bill for traffic that crosses an availability-zone boundary, charged in each direction. Kubernetes spreads traffic as evenly as it can across pods, so in a typical 3-zone cluster about two-thirds of your traffic crosses a zone boundary and gets billed for it.

$100K–$1M+

Annual cross-zone charges at 1 GB/s on AWS (high-traffic cluster)

≈ 0

Cross-zone cost on Azure — in-zone routing highest-value on AWS & GCP

Why your cross-zone bill is so high?

Cloud providers bill for traffic that crosses an availability-zone boundary, charged in each direction. Kubernetes spreads traffic as evenly as it can across pods, so in a typical 3-zone cluster about two-thirds of your traffic crosses a zone boundary and gets billed for it.

~$300/yr

$100K–$1M+/yr

Every byte that crosses a zone boundary gets billed twice, once leaving, once arriving.

AWS $0.02/GB | GCP $0.01/GB | Azure≈ $0/GB

Up to 60% less egress

~$400K/yr saved

HAZL keeps traffic zone-local, eliminating most cross-zone hops before they're billed.

Based on published AWS/GCP rates. 60% reduction reflects observed HAZL deployments. Read the full analysis ↗

What you get with HAZL

HAZL (High Availability Zonal Load Balancing) is a request-level load balancer in Buoyant Enterprise for Linkerd. It balances HTTP and gRPC traffic in environments with multiple availability zones.

In-zone by default

HAZL keeps traffic in-zone under normal load, cutting the cross-zone bytes you're billed for.

Reliability preserved

It spills cross-zone the moment load or errors demand it, then pulls back, so cost never costs you uptime.

No tuning in most cases

Works with fewer than 3 pods per zone, imbalanced traffic, and autoscaling.

0% to ~100% on the same failure

In Buoyant's demo, Topology Aware Routing dropped a zone to 0% success; HAZL held near 100%.

How does HAZL work?

Topology Aware Routing (used by open-source Linkerd and Istio) keeps traffic in-zone with a static allocation that ignores live load, latency, and health, and is binary on or off. HAZL is a request-level load balancer in Buoyant Enterprise for Linkerd (BEL) that keeps traffic local when it safely can and adds cross-zone endpoints only when local load climbs, then removes them as load drops.

Zone-aware load balancing with HAZL
Without HAZLWith HAZL
Availability zone AAvailability zone Bapplicationspill cross-zone under load / on HTTP 429applicationpodmicroproxybilled in both directionscross-zone only when neededin-zone (preferred, no cross-zone charge)
podmicroproxyapplicationAvailability zone Across-zone only when neededspill cross-zone under load / on HTTP 429applicationAvailability zone Bin-zone (preferred, no cross-zone charge)billed in both directions!
in zone traffic
cross-zone traffic

Without HAZL, pods route traffic to the nearest available endpoint — regardless of availability zone. Cloud providers charge for cross-zone traffic in both directions, and these charges add up fast at scale.

Zone-aware load balancing with HAZL

Under normal load HAZL keeps requests in-zone; when the in-zone endpoint is overloaded or returning errors, it spills to another zone, then returns in-zone as load recovers.

Stop paying for traffic you don't need to

Most teams turn HAZL on with no tuning and watch steady-state cross-zone traffic drop while reliability holds. Run the demo, then point it at a real cluster and measure against your own bill.

What it doesn't

✗  Drops to 0% in-zone under failure

✗  Requires ≥3 balanced pods per zone

✗  No health-based spill logic

✗  Struggles with autoscaling

✓  Free, no license needed

✓  Built into Kubernetes

What it covers

✓  Never sacrifices reliability

✓  Works with <3 pods per zone

✓  In-band health checking (HTTP/gRPC)

✓  Reads HTTP 429 as a spill signal

✓  ~1 min recovery after overload

✓  No tuning required in most cases

Why HAZL

1. Cost

2. Reliability

3. Simplicity

Cuts cost and protects reliability

HAZL is a "request-level load balancer in Buoyant Enterprise for Linkerd that balances HTTP and gRPC traffic in environments with multiple availability zones," and unlike Topology Aware Routing "never sacrifices reliability to achieve this cost reduction."

Read the docs ↗

It reacts to real load

HAZL balances on outstanding requests per endpoint and prefers local endpoints, adding cross-zone only when local load climbs. It uses in-band health checking, and reads rate-limit responses: an in-zone endpoint returning HTTP 429 is a reason to spill rather than a fast success (a BEL feature) In the same failure that dropped Topology Aware Routing to 0%, HAZL held near 100%.

Read the docs ↗

It works where TAR struggles

Fewer than 3 pods per zone, imbalanced traffic, autoscaling, and "requires no tuning or configuration" in most cases. It also preserves zone affinity across cluster boundaries.

Read the docs ↗

1. Cost

Cuts cost and protects reliability

HAZL is a "request-level load balancer in Buoyant Enterprise for Linkerd that balances HTTP and gRPC traffic in environments with multiple availability zones," and unlike Topology Aware Routing "never sacrifices reliability to achieve this cost reduction."

Read the docs ↗

2. Reliability

It reacts to real load

HAZL balances on outstanding requests per endpoint and prefers local endpoints, adding cross-zone only when local load climbs. It uses in-band health checking, and reads rate-limit responses: an in-zone endpoint returning HTTP 429 is a reason to spill rather than a fast success (a BEL feature) In the same failure that dropped Topology Aware Routing to 0%, HAZL held near 100%.

Read the docs ↗

3. Simplicity

It works where TAR struggles

Fewer than 3 pods per zone, imbalanced traffic, autoscaling, and "requires no tuning or configuration" in most cases. It also preserves zone affinity across cluster boundaries.

Read the docs ↗

Show me the evidence

Every claim is backed by a reproducible demo, a published cost model, and a CNCF track record.

Reproducible demo

Run it on a 3-zone local cluster and watch HAZL hold success rate where Topology Aware Routing drops it to 0%.

See the demo ↗

Lorem ipsum Published cost model

The cost figures above are from Buoyant's published AWS model — open for review.

Read the analysis ↗

CNCF-graduated

Buoyant created Linkerd, coined the term "service mesh," and shipped the service mesh in July 28, 2021.

Frequently asked questions

What does HAZL do?

It balances HTTP and gRPC requests to keep traffic in-zone for cost savings, while sending it cross-zone when reliability requires it.

How is it different from Topology Aware Routing?

TAR allocates zones statically and ignores live load, latency, and health, and is binary. HAZL balances on real load with in-band health checks and spills only when needed.

Is HAZL in open-source Linkerd?

No. OSS uses Kubernetes-native TAR; HAZL is a BEL feature.

How much will it save me?

It depends on cloud provider, traffic volume, and zone topology. Model it against your own cluster and your BEL cost.