Hyperscale secondary replicas

Article
03/13/2024

As described in Distributed functions architecture, Azure SQL Database Hyperscale has two different types of compute nodes, also referred to as replicas:

Primary: serves read and write operations
Secondary: provides read scale-out, high availability, and geo-replication

Secondary replicas are always read-only, and can be of three different types:

High Availability replica
Geo-replica
Named replica

Each type has a different architecture, feature set, purpose, and cost. Based on the features you need, you can use just one or even all of the three together. Secondary replicas are supported by both serverless and provisioned compute tiers.

For tutorials on configuring and managing Hyperscale named replicas, see:

High Availability replica

A High Availability (HA) replica uses the same page servers as the primary replica, so no data copy is required to add an HA replica. HA replicas are mainly used to increase database availability; they act as hot standby replicas for failover purposes. If the primary replica becomes unavailable, failover to one of the existing HA replicas is automatic and quick. The connection string doesn't need to change; during failover applications might experience minimal downtime due to active connections being dropped. As usual for this scenario, proper retry logic is recommended. Several drivers already provide some degree of automatic retry logic. If you are using .NET, the latest Microsoft.Data.SqlClient library provides native full support for configurable automatic retry logic.

HA replicas use the same server and database name as the primary replica. Their Service Level Objective is also always the same as for the primary replica. HA replicas are not visible or manageable as a stand-alone resource from the portal or from any API.

There can be zero to four HA replicas. Their number can be changed during the creation of a database or after the database has been created, via the common management endpoints and tools (for example: PowerShell, AZ CLI, Portal, REST API). Creating or removing HA replicas does not affect active connections on the primary replica.

Connect to an HA replica

In Hyperscale databases, the ApplicationIntent argument in the connection string used by the client dictates whether the connection is routed to the read-write primary replica or to a read-only HA replica. If ApplicationIntent is set to ReadOnly and the database doesn't have a secondary replica, connection will be routed to the primary replica and will default to the ReadWrite behavior.

-- Connection string with application intent
Server=tcp:<myserver>.database.windows.net;Database=<mydatabase>;ApplicationIntent=ReadOnly;User ID=<myLogin>;Password=<myPassword>;Trusted_Connection=False; Encrypt=True;

All HA replicas are identical in their resource capacity. If more than one HA replica is present, the read-intent workload is distributed arbitrarily across all available HA replicas. When there are multiple HA replicas, keep in mind that each one could have different data latency with respect to data changes made on the primary. Each HA replica uses the same data as the primary on the same set of page servers. However, local data caches on each HA replica reflect the changes made on the primary via the transaction log service, which forward log records from the primary replica to HA replicas. As the result, depending on the workload being processed by an HA replica, application of log records can happen at different speeds, and thus different replicas could have different data latency relative to the primary replica.

Named replica

A named replica, just like an HA replica, uses the same page servers as the primary replica. Similar to HA replicas, there is no data copy needed to add a named replica.

There are differences between HA replicas and named replicas:

Named replicas appear as regular (read-only) Azure SQL databases in the portal and in API (AZ CLI, PowerShell, T-SQL) calls.
Named replicas can have database name different from the primary replica, and optionally be located on a different logical server (as long as it is in the same region as the primary replica).
Named replicas have their own Service Level Objective that can be set and changed independently from the primary replica.
Named replicas support for up to 30 named replicas (for each primary replica).
Named replicas support different authentication for each named replica by creating different logins on logical servers hosting named replicas.

As a result, named replicas offer several benefits over HA replicas, for what concern read-only workloads:

Users connected to a named replica will suffer no disconnection if the primary replica is scaled up or down; at the same time, users connected to the primary replica will be unaffected by named replicas scaling up or down.
Workloads running on any replica, primary or named, will be unaffected by long-running queries running on other replicas.

The main goal of named replicas is to enable a broad variety of read scale-out scenarios, and to improve Hybrid Transactional and Analytical Processing (HTAP) workloads. Examples of how to create such solutions are available here:

OLTP scale-out sample

Aside from the main scenarios listed above, named replicas offer flexibility and elasticity to also satisfy many other use cases:

Access Isolation: you can grant access to a specific named replica, but not the primary replica or other named replicas.
Workload-dependent service level objective: as a named replica can have its own service level objective, it is possible to use different named replicas for different workloads and use cases. For example, one named replica could be used to serve Power BI requests, while another can be used to serve data to Apache Spark for Data Science tasks. Each one can have an independent service level objective and scale independently.
Workload-dependent routing: with up to 30 named replicas, it is possible to use named replicas in groups so that an application can be isolated from another. For example, a group of four named replicas could be used to serve requests coming from mobile applications, while another group two named replicas can be used to serve requests coming from a web application. This approach would allow a fine-grained tuning of performance and costs for each group.

Note

For frequently asked questions on Hyperscale named replicas, see Azure SQL Database Hyperscale named replicas FAQ.

Zone redundancy for Hyperscale named replicas

Note

Zone redundancy for Azure SQL Database Hyperscale named replicas is currently in preview.

Zone redundancy for Azure SQL Database Hyperscale named replicas uses Azure Availability Zones to distribute named replicas compute nodes across different physical locations within an Azure region. By choosing zone redundancy for named replicas, you can enhance the resilience of all layers of your Hyperscale databases to a wider range of failures, including datacenter outages, without any modifications of the application logic. For more information, see Hyperscale zone redundant availability.

For a tutorial to create a zone redundant Hyperscale named replica, see Create a Hyperscale named replica.

For troubleshooting and testing application fault resiliency, see Test application fault resiliency.

Geo-replica

With active geo-replication, you can create a readable secondary replica of the primary Hyperscale database in the same or in a different Azure region. Geo-replicas must be created on a different logical server. The database name of a geo-replica always matches the database name of the primary.

When creating a geo-replica, all data is copied from the primary to a different set of page servers. A geo-replica does not share page servers with the primary, even if they are in the same region. This architecture provides the necessary redundancy for geo-failovers.

Geo-replicas are used to maintain a transactionally consistent copy of the database via asynchronous replication. If a geo-replica is in a different Azure region, it can be used for disaster recovery in case of a disaster or outage in the primary region. Geo-replicas can also be used for geographic read scale-out scenarios. As of October 2022, database copy from a Hyperscale geo secondary replica is supported.

Geo-replication for Hyperscale database has following current limitations:

Only one geo-replica can be created (in the same or different region).
Point in time restore of the geo-replica is not supported.
Creating geo-replica of a geo-replica (also known as "geo-replica chaining") is not supported.

Troubleshoot

Troubleshoot zone redundant Hyperscale named replicas

For troubleshooting and testing application fault resiliency, see Test application fault resiliency.
Ensure at least one high availability replica is specified when creating a zone redundant named replica, in PowerShell and CLI. For an example, see Create a Hyperscale named replica.
- In Azure CLI, you must specify both the parameters "ha-replicas" and "redundant".
- In PowerShell, you must specify the parameter "HighAvailabilityReplicaCount" and "ZoneRedundant".
- If omitted, you receive the error message: (ProvisioningDisabled) There is an insufficient number of high availability replicas to enable zone redundancy for a Hyperscale database.
The Hyperscale database should have zone redundancy already enabled as a prerequisite to enable this feature for named replicas.
- It is optional to enable zone redundancy for named replicas, even if primary database has zone redundancy enabled.
- If not enabled, you receive the error message: (DatabaseNamedReplicaSourceDatabaseNotZoneRedundant) Zone Redundancy cannot be enabled on this Named Replica since the primary Hyperscale Database is not zone redundant.

Known issues

Partially incorrect data returned from sys.databases

Row values returned from sys.databases, for named replicas, in columns other than name and database_id, might be inconsistent and incorrect. For example, the compatibility_level column for a named replica could be reported as 140 even if the primary database from which the named replica has been created is set to 150. A workaround, when possible, is to get the same data using the DATABASEPROPERTYEX() function, which will return correct data.

For tutorials on configuring and managing Hyperscale named replicas, see:

For more information, see:

Hyperscale secondary replicas

High Availability replica

Connect to an HA replica

Named replica

Zone redundancy for Hyperscale named replicas

Geo-replica

Troubleshoot

Troubleshoot zone redundant Hyperscale named replicas

Known issues

Partially incorrect data returned from sys.databases

Feedback

Feedback

Additional resources

Hyperscale secondary replicas

High Availability replica

Connect to an HA replica

Named replica

Zone redundancy for Hyperscale named replicas

Geo-replica

Troubleshoot

Troubleshoot zone redundant Hyperscale named replicas

Known issues

Partially incorrect data returned from sys.databases

Related content

Feedback

Feedback

Additional resources