Question 1

What is a Hyperscale database?

Accepted Answer

A Hyperscale database is a database in Azure SQL Database that is backed by the Hyperscale scale-out storage technology. A Hyperscale database supports up to 128 TB of data and provides high throughput and performance, as well as rapid scaling to adapt to the workload requirements. Connectivity, query processing, database engine features, and so on, work like in any other database in Azure SQL Database.

Question 2

What compute tiers and purchasing models support Hyperscale?

Accepted Answer

The Hyperscale service tier is available for single databases (both provisioned and serverless) and for elastic pools using the vCore-based purchasing model. It is not available in the DTU-based purchasing model.

Question 3

How does the Hyperscale service tier differ from the General Purpose and Business Critical service tiers?

Accepted Answer

The vCore-based service tiers are differentiated based on database availability and storage type, performance, and maximum storage size as described in resource limit comparison.

Question 4

Who should use the Hyperscale service tier?

Accepted Answer

The Hyperscale service tier is for all customers looking for higher performance and availability, fast backup and restores, fast storage, and compute scalability. This includes customers who are starting out small and growing, those running large mission-critical databases, those who are moving to the cloud to modernize their applications and customers who are already using other service tiers in Azure SQL Database.

With Hyperscale, you get:

Database size that can grow from 10 GB up to 128 TB, irrespective of the compute size.
Compute vCore resources from 2 vCores up to 128 vCores.
Fast database backups regardless of database size (backups are based on storage snapshots).
Fast database restores regardless of database size (restores are from storage snapshots).
Higher transaction log throughput regardless of database size and the number of vCores.
Read Scale-out using one or more read-only replicas, used for offloading read-only workloads or as hot standby databases.
Rapid scaling up of compute, in constant time, to be more powerful to accommodate the heavy workload and then scale down, in constant time. Scaling operations take single-digit minutes for provisioned compute, and less than a second for serverless compute, regardless of database size.
The option to pay for what you use with serverless compute, where compute is billed based on usage.

Question 5

What regions currently support Hyperscale?

Accepted Answer

The Hyperscale service tier is available in all regions where Azure SQL Database is available.

Question 6

Can I create multiple Hyperscale databases per server?

Accepted Answer

Yes. For more information and limits on the number of databases per server, see SQL Database resource limits for single and pooled databases on a server.

Question 7

What are the performance characteristics of a Hyperscale database?

Accepted Answer

The Hyperscale architecture provides high performance and throughput while supporting large database sizes.

Question 8

What is the scalability of a Hyperscale database?

Accepted Answer

Hyperscale provides rapid scalability based on your workload demand.

Scaling Up/Down

With Hyperscale, you can scale up the primary compute size in terms of resources like CPU and memory, and then scale down, in constant time. Because the storage is remote, scaling up and scaling down isn't a size-of-data operation.

Support for serverless compute provides automatic scale-up and scale-down and compute billing based on usage.
Scaling In/Out

With Hyperscale, you can use three kinds of secondary replicas to cater to read scale-out, high availability, and geo-replication requirements. This includes:
- Up to four high-availability replicas having the same compute size as primary. These serve as hot standby replicas to quickly fail over from the primary. You can also use them to offload read workloads from the primary.
- Up to 30 named replicas having the same or different compute size than the primary, to cater to various read scale-out scenarios.
- A geo-replica in a different Azure region to protect against regional outages and to enable geographic read scale-out.

Question 9

Can I mix Hyperscale and non-Hyperscale databases on the same SQL logical server?

Accepted Answer

Yes, you can.

Question 10

Does Hyperscale require my application programming model to change?

Accepted Answer

No, your application programming model stays the same as for any other MSSQL database. You use your connection string as usual and the other regular ways to interact with your Hyperscale database. Once your application is using the Hyperscale database, your application can take advantage of features such as secondary replicas.

Question 11

What transaction isolation level is the default in a Hyperscale database?

Accepted Answer

On the primary replica, the default transaction isolation level is READ COMMITTED with the READ_COMMITTED_SNAPSHOT database option (RCSI) enabled. On the secondary replicas, the isolation level is SNAPSHOT. This is the same as in any other Azure SQL database.

Question 12

Can I bring my on-premises or IaaS SQL Server license to Hyperscale?

Accepted Answer

With the new, simplified pricing in effect since December 15, 2023, the price of compute has been reduced for newly created Hyperscale databases, all serverless Hyperscale databases, and all Hyperscale elastic pools. With the new, simplified pricing, there is no need to apply Azure Hybrid Benefit (AHB) to obtain equivalent savings. Azure Hybrid Benefit (AHB) can only be applied to older (created before December 15, 2023) Hyperscale single databases with provisioned compute. For those older databases, AHB is only applicable until December 2026, after which those databases will also be billed as per the new, simplified pricing.

For more information, see Hyperscale pricing blog and Azure SQL Database Hyperscale - lower, simplified pricing.

Question 13

What kind of workloads is Hyperscale designed for?

Accepted Answer

Hyperscale works well for all workload types, including OLTP, Hybrid (HTAP), and Analytical (data mart) workloads.

Question 14

How can I choose between Azure Synapse Analytics and Azure SQL Database Hyperscale?

Accepted Answer

If you're currently running interactive analytics queries using SQL Server as a data warehouse, Hyperscale is a great option because you can host small and mid-size data warehouses (such as a few TB up to 128 TB) at a lower cost, and you can migrate your SQL Server data warehouse workloads to Hyperscale with minimal T-SQL code changes.

If you're running data analytics on a large scale with complex queries and sustained ingestion rates higher than 100 MiB/s or using Parallel Data Warehouse (PDW), Teradata, or other Massively Parallel Processing (MPP) data warehouses such as Azure Synapse Analytics, then Microsoft Fabric could be the best choice.

Ingestion or log generation rate of 150 MiB/s is available as an opt-in preview feature for premium-series and premium-series memory optimized. For more information and to opt in to 150 MiB/s, see Blog: November 2024 Hyperscale enhancements.

Question 15

Can I pause my compute at any time?

Accepted Answer

Not at this time. However you can scale your compute and the number of replicas down to reduce cost during nonpeak times, or use serverless to automatically scale compute based on usage.

Question 16

Can I provision a compute replica with extra RAM for my memory-intensive workload?

Accepted Answer

For read workloads, you can create a named replica with a higher compute size (more cores and memory) than the primary. For more information on available compute sizes, see Hyperscale storage and compute sizes.

Question 17

Can I provision multiple compute replicas of different sizes?

Accepted Answer

For read workloads, this can be achieved using named replicas.

Question 18

How many Read Scale-out replicas are supported?

Accepted Answer

You can scale the number of HA secondary replicas between 0 and 4 using Azure portal or REST API. Additionally, you can create up to 30 named replicas for many read scale-out scenarios. Each named replica can have up to 4 HA secondary replicas.

Question 19

For high availability, do I need to provision additional compute replicas?

Accepted Answer

In Hyperscale databases, data resiliency is provided at the storage level. You only need one replica (the primary) to provide resiliency. If the compute replica fails or is under maintenance, a new replica is created automatically with no data loss.

However, if there's only the primary replica, it can take a minute or two to create a new replica, vs. seconds in the case when an HA secondary replica is available. The new replica will have cold caches initially, which can result in higher storage latency and reduced query performance immediately after failover.

For mission-critical applications that require high availability with minimal failover impact, you should provision at least one HA secondary replica to ensure a hot standby replica is available to serve as a failover target.

Question 20

What is the maximum database size supported with Hyperscale?

Accepted Answer

The maximum size of a single Hyperscale database is currently 128 TB, irrespective of compute size. The maximum size of a database in a Hyperscale elastic pool is currently 100 TB.

Question 21

What is the size of the transaction log with Hyperscale?

Accepted Answer

In Hyperscale, the transaction log is practically infinite, with a restriction that the active portion of the log cannot exceed 1 TB. The active portion of the log can grow because of long-running transactions, or because of Change Data Capture processing not keeping up with the rate of data change. Avoid unnecessarily long and large transactions to stay below this limit. Other than this restriction, you don't need to worry about running out of log space on a system that has high log throughput. However, log generation rate might be reduced for continuous aggressively writing workloads. The peak sustained log generation rate is 100 MiB/s.

Log generation rate of 150 MiB/s is available as an opt-in preview feature for premium-series and premium-series memory optimized. For more information and to opt in to 150 MiB/s, see Blog: November 2024 Hyperscale enhancements.

Question 22

Does my tempdb scale as my database grows?

Accepted Answer

Your tempdb database is located on local SSD storage and is sized proportionally to the compute size (the number of cores) that you provision. The size of tempdb is not configurable and is managed for you. To determine maximum tempdb size for your database, see Hyperscale storage and compute sizes.

Question 23

Does my database size automatically grow, or do I have to manage the size of data files?

Accepted Answer

Your database size automatically grows as you insert/ingest more data.

Question 24

What is the smallest database size that Hyperscale supports?

Accepted Answer

10 GB. A Hyperscale database is created with a starting size of 10 GB and grows as needed.

Question 25

In what increments does my database size grow?

Accepted Answer

Each data file grows by 10 GB. Multiple data files can grow at the same time.

Question 26

Is the storage in Hyperscale local or remote?

Accepted Answer

In Hyperscale, data files are stored in Azure standard storage. Data is fully cached on local SSD storage, on page servers that are remote to compute replicas. In addition, compute replicas have data caches on local SSD and in memory, to reduce the frequency of fetching data from remote page servers.

Question 27

Can I manage or define files or filegroups with Hyperscale?

Accepted Answer

No. Data files are added automatically to the PRIMARY filegroup. The common reasons for creating additional filegroups do not apply in the Hyperscale storage architecture, or in Azure SQL Database more broadly.

Question 28

Can I provision a hard cap on the data growth for my database?

Accepted Answer

No.

Question 29

Is database shrink supported?

Accepted Answer

Yes, database and file shrink operations are supported in Azure SQL Database Hyperscale.

Question 30

Is data compression supported?

Accepted Answer

Yes, just like in any other Azure SQL DB database. This includes row, page, and columnstore compression.

Question 31

If I have a huge table, is table data spread out across multiple data files?

Accepted Answer

Yes. The data pages associated with a given table can end up in multiple data files, which are all part of the same filegroup. The MSSQL database engine uses proportional fill strategy to distribute data over data files.

Question 32

Can I move my existing databases in Azure SQL Database to the Hyperscale service tier?

Accepted Answer

Yes. For proofs of concept (POCs), we recommend you make a copy of your database and migrate the copy to Hyperscale.

The time required to move an existing database to Hyperscale consists of the time to copy data, and the time to replay the changes made in the source database while copying data. The data copy time is proportional to data size. The time to replay changes is shorter if the move is done during a period of low write activity.

You can convert an existing Azure SQL Database to Hyperscale in the Azure portal, Azure CLI, PowerShell, and Transact-SQL. For more information, see Convert an existing database to Hyperscale.

Reverse migration to the General Purpose service tier allows customers who have recently migrated an existing database in Azure SQL Database to the Hyperscale service tier to move back, should Hyperscale not meet their needs. While reverse migration is initiated by a service tier change, it's essentially a size-of-data operation between different architectures. Similarly to migration to Hyperscale, reverse migration is faster if done during a period of low write activity. For more information, see Reverse migrate from Hyperscale.

Question 33

Can I move my Hyperscale databases to other service tiers?

Accepted Answer

If you have previously migrated an existing Azure SQL Database to the Hyperscale service tier, you can reverse migrate it to the General Purpose service tier within 45 days of the original migration to Hyperscale. If you wish to migrate the database to another service tier, such as Business Critical, first reverse migrate to the General Purpose service tier, then modify the service tier. Reverse migration is a size-of-data operation.

Databases created in the Hyperscale service tier can't be moved to other service tiers.

Learn how to reverse migrate from Hyperscale, including the limitations for reverse migration and impacted backup policies.

Databases created in the Hyperscale service tier can't be moved to other service tiers.

Learn how to reverse migrate from Hyperscale, including the limitations for reverse migration and impacted backup policies.

Question 34

Do I lose any functionality or capabilities after migration to the Hyperscale service tier?

Accepted Answer

Yes. Some Azure SQL Database features are not supported in Hyperscale. If some of these features are enabled for your database, migration to Hyperscale could be blocked, or these features stop working after migration. For details, see Known limitations.

Question 35

Can I move my on-premises SQL Server database, or my SQL Server database in a cloud virtual machine to Hyperscale?

Accepted Answer

Yes. You can use many existing migration technologies to migrate to Hyperscale, including transactional replication, and any other data movement technologies (Bulk Copy, Azure Data Factory, Azure Databricks, SSIS). See also the Azure Database Migration Service, which supports many migration scenarios.

Question 36

What is my downtime during migration from an on-premises or virtual machine environment to Hyperscale, and how can I minimize it?

Accepted Answer

Downtime for migration to Hyperscale is the same as the downtime when you migrate your databases to other Azure SQL Database service tiers. You can use transactional replication to minimize downtime migration for databases up to a few TB in size. For very large databases (10+ TB), you can consider implementing the migration process using ADF, Spark, or other bulk data movement technologies.

Question 37

How much time would it take to bring in X amount of data to Hyperscale?

Accepted Answer

Hyperscale is capable of consuming 100 MiB/s of new/changed data, but the time needed to move data into databases in Azure SQL Database is also affected by available network throughput, source read speed, the type of load (bulk vs row-by-row), and the target database service level objective. Log generation rate of 150 MiB/s is available as an opt-in preview feature for premium-series and premium-series memory optimized. For more information and to opt in to 150 MiB/s, see Blog: November 2024 Hyperscale enhancements.

Question 38

Can I read data from blob storage and do a fast load (like Polybase in Azure Synapse Analytics)?

Accepted Answer

You can have a client application read data from Azure Storage and load data load into a Hyperscale database (just like you can with any other database in Azure SQL Database). Polybase is currently not supported in Azure SQL Database. As an alternative to provide fast load, you can use Azure Data Factory, or use a Spark job in Azure Databricks with the Spark connector for SQL. The Spark connector to SQL supports bulk insert.

It is also possible to bulk read data from Azure Blob store using BULK INSERT or OPENROWSET: Examples of Bulk Access to Data in Azure Blob Storage.

Simple or bulk logged recovery models are not supported in Hyperscale. Full recovery model is required to provide high availability and point-in-time recovery. However, Hyperscale log architecture provides better data ingest rate compared to other Azure SQL Database service tiers.

Question 39

Does Hyperscale allow provisioning multiple nodes for parallel ingesting of large amounts of data?

Accepted Answer

No. Hyperscale is a symmetric multi-processing (SMP) architecture and is not a massively parallel processing (MPP) or a multi-master architecture. You can create multiple replicas to scale out read-only workloads.

Question 40

Does Hyperscale support migration from other data sources such as Amazon Aurora, MySQL, PostgreSQL, Oracle, DB2, and other database platforms?

Accepted Answer

Yes. Azure Database Migration Service supports many migration scenarios.

Question 41

When I convert a database to Hyperscale, when does Hyperscale billing begin?

Accepted Answer

Hyperscale billing after a conversion begins only after the cutover is complete.

Question 42

When I convert to Hyperscale, can I control the disruption to my database?

Accepted Answer

Yes. Currently a preview feature, you have the ability to manually initiate the cutover when you convert a database to Hyperscale via the Azure portal, PowerShell, Azure CLI, or T-SQL. You'll only experience a short period of downtime, generally less than a minute, during the final cutover to Hyperscale.

Question 43

What SLAs are provided for a Hyperscale database?

Accepted Answer

See SLA for Azure SQL Database. We recommend adding HA secondary replicas for critical workloads. This provides faster failover, and reduces potential performance impact immediately after failover.

Question 44

Are the database backups managed for me by Azure SQL Database?

Accepted Answer

Yes.

Question 45

Does Hyperscale support Availability Zones?

Accepted Answer

Yes, Hyperscale supports zone redundant configuration. At least one HA secondary replica and the use of zone-redundant or geo-zone-redundant storage is required for enabling the zone redundant configuration for Hyperscale.

Question 46

Does Hyperscale support elastic pools?

Accepted Answer

Yes. For more information, see Hyperscale elastic pools and Blog: Hyperscale Elastic Pools are now generally available.

Question 47

How often are database backups taken?

Accepted Answer

There are no traditional full, differential, and transaction log backups for Hyperscale databases. Instead, there are regular storage snapshots of data files, with a separate snapshot cadence for each file. The generated transaction log is retained as-is for the configured retention period. At restore time, relevant transaction log records are applied to restored storage snapshots. Regardless of snapshot cadence, this results in a transactionally consistent database as of the specified point in time within the retention period, without any data loss. In effect, database backup in Hyperscale is continuous.

Question 48

Does Hyperscale support point-in-time restore?

Accepted Answer

Yes.

Question 49

What is the Recovery Point Objective (RPO)/Recovery Time Objective (RTO) for database restore in Hyperscale?

Accepted Answer

The RPO for point-in-time restore is 0 min. Most point-in-time restore operations complete within 60 minutes regardless of database size. Restore time can be longer for larger databases, and if the database experienced significant write activity before and up to the restore point in time. Issuing a restore after a recent change of storage redundancy might result in longer restore times because the restore can be a size-of-data operation in that case, and the restore time might be proportional to the database size.

Question 50

Does database backup affect compute performance on my primary or secondary replicas?

Accepted Answer

No. Backups are managed by the storage subsystem, and use storage snapshots. They do not impact user workloads.

Question 51

Can I perform geo-restore with a Hyperscale database?

Accepted Answer

Yes. Geo-restore is fully supported if geo-redundant or geo-zone-redundant storage is used. Geo-redundant storage is the default for new databases. Unlike point-in-time restore, geo-restore requires a size-of-data operation. Data files are copied in parallel, so the duration of this operation depends primarily on the size of the largest file in the database, rather than on total database size. Geo-restore time will be significantly shorter if the database is restored in the Azure region that is paired with the region of the source database. For more information, see Geo-restore for Azure SQL Database.

Question 52

Can I set up geo-replication or use failover groups with a Hyperscale database?

Accepted Answer

Yes. Geo-replication and failover groups can be set up for Hyperscale databases.

Question 53

Can I take a Hyperscale database backup and restore it to my on-premises server, or on SQL Server in a VM?

Accepted Answer

No. The storage format for Hyperscale databases is different from any released version of SQL Server, and you don't control backups or have access to them. To take your data out of a Hyperscale database, you can extract data using any data movement technologies such as Azure Data Factory, Azure Databricks, SSIS, etc.

Question 54

Will I be charged for backup storage costs in Hyperscale?

Accepted Answer

Yes. Effective May 4, 2022, backups for all new databases are charged based on the backup storage consumed and selected storage redundancy at rates captured in Azure SQL Database pricing page. For Hyperscale databases created before May 4, 2022, backups will be charged only if backup retention is set to be greater than seven days. To learn more, see Hyperscale backups and storage redundancy.

Question 55

How can I measure backup storage size in my Hyperscale database?

Accepted Answer

For details on how to measure backup storage size, see Automated Backups.

Question 56

How do I know what my backup bill will be?

Accepted Answer

To determine your backup storage bill, backup storage size is calculated periodically, and multiplied by the backup storage rate and the number of hours since the last calculation. To estimate your backup bill for a time period, multiply the billable backup storage size for every hour of the period by the backup storage rate, and add up all hourly amounts. To query relevant Azure Monitor metrics for multiple hourly intervals programmatically, use Azure Monitor REST API. Backup billing in the serverless compute tier is the same as in the provisioned compute tier.

Question 57

How will my workload influence my backup storage costs?

Accepted Answer

Backup costs will be higher for workloads that add, modify, or delete large volumes of data in the database. Conversely, workloads that are mostly read-only might have smaller backup costs.

Question 58

How can I minimize backup storage costs?

Accepted Answer

For details on how to minimize the backup storage costs, see Automated Backups.

Question 59

Can I geo-restore my Hyperscale database to another service tier, or vice-versa?

Accepted Answer

Currently, non-Hyperscale service tiers (Basic/Standard/Premium/General Purpose/Business Critical) backups cannot be geo-restored into a Hyperscale service tier and vice-versa. To convert a non-Hyperscale database to a Hyperscale database, change the service tier after a restore.

Question 60

How much write throughput can I push in a Hyperscale database?

Accepted Answer

Transaction log throughput limit is 100 MiB/s for any Hyperscale compute size. The ability to achieve this rate depends on multiple factors, including but not limited to workload type, client configuration and performance, and having sufficient compute capacity on the primary compute replica to produce log records at this rate. Log generation rate of 150 MiB/s is available as an opt-in preview feature for premium-series and premium-series memory optimized. For more information and to opt in to 150 MiB/s, see Blog: November 2024 Hyperscale enhancements.

Question 61

How many IOPS do I get on the largest compute?

Accepted Answer

IOPS and IO latency will vary depending on the workload patterns. If the data being accessed is cached in local SSD storage on the compute replica, you will see similar IO performance as in Business Critical or Premium service tiers.

Question 62

Does my throughput get affected by backups?

Accepted Answer

No. Compute is decoupled from the storage layer. This eliminates the performance impact of backup.

Question 63

Does my throughput get affected as I provision additional compute replicas?

Accepted Answer

Because the storage is shared and there is no direct physical replication happening between primary and secondary compute replicas, the throughput on the primary replica isn't directly affected by adding secondary replicas. However, the log rate for continuous and aggressive write workloads might be limited on the primary to allow log apply on secondary replicas and page servers to catch up. This avoids poor read performance on secondary replicas and long recovery after failover to an HA secondary replica.

Question 64

Is Hyperscale well suited for resource-intensive, long-running queries and transactions?

Accepted Answer

Yes. However, just like in other Azure SQL databases, connections might be terminated by very infrequent transient errors, which can abort long-running queries and roll back transactions. One cause of transient errors is when the system quickly shifts the database to a different compute node to ensure continued compute and storage resource availability, or to perform planned maintenance. Most of these reconfiguration events finish in less than 10 seconds. Applications that connect to your database should be built to expect and tolerate these infrequent transient errors by implementing retry logic. Additionally, consider configuring a maintenance window that matches your workload schedule to avoid transient errors due to planned maintenance.

Jaa

Azure SQL Database Hyperscale FAQ

General questions