Share via

Azure SQL Hyperscale – Compute/Storage Decoupling and Archive Use Case

Anonymous
2025-08-22T10:48:41.4233333+00:00

We are evaluating Azure SQL Hyperscale for storing approximately 25 TB of archive tables. The primary considerations are:

  1. Can Hyperscale (serverless) truly decouple compute and storage, such that storage is always billed but compute charges drop to near-zero when the database is idle?
  2. Are there any limitations on separating archive tables into a dedicated Hyperscale database to optimize cost?
  3. What is the typical cold-start latency when resuming compute from idle/paused state in Hyperscale serverless?
  4. Can we separate hot data and cold data in Hyperscale?
  5. For archive workloads that are queried only occasionally (e.g., auditor reports a few times a month), is Hyperscale more cost-effective than keeping archive data in ADLS + querying via Databricks/Synapse?
  6. Are there recommended best practices for balancing Hyperscale vs ADLS in long-term healthcare data retention scenarios?

do we need to know something more on this ?

Azure SQL Database

1 answer

Sort by: Most helpful
  1. Saraswathi Devadula 15,940 Reputation points Microsoft External Staff Moderator
    2025-08-22T11:39:25.66+00:00

    Hello Janice Chi
    Kindly please review the below information,

    1. Can Hyperscale (serverless) truly decouple compute and storage, such that storage is always billed but compute charges drop to near-zero when the database is idle?
    1. Are there any limitations on separating archive tables into a dedicated Hyperscale database to optimize cost?
    • You can absolutely isolate archive tables into their own Hyperscale database. This would let you scale and configure compute bounds (even using serverless) specifically for the archive workload.
    • Migration between tiers or adjusting settings (like backup redundancy) may require redeployment. Changing long-term retention or redundancy (LRS/GRS) in Hyperscale isn't possible after provisioning.
    1. What is the typical cold-start latency when resuming compute from idle/paused state in Hyperscale serverless?
    1. Can we separate hot data and cold data in Hyperscale?
    • Yes, technically you can partition workloads:
    • Keep “hot” data in provisioned compute (more responsive).
    • Spin off cold/archive tables into a Hyperscale database with lower compute tiers—or serverless in the future.
    • This enables better resource allocation and cost control.

    5.For archive workloads that are queried only occasionally (e.g., auditor reports a few times a month), is Hyperscale more cost-effective than keeping archive data in ADLS + querying via Databricks/Synapse?

    • Hyperscale serverless (when active) could make sense for intermittent SQL queries—compute is billed per second, and you avoid always-on compute costs. But without auto-pause today, compute still runs even when idle (unless you manually scale it to minimal provisioned).
    • Meanwhile, ADLS + Databricks/Synapse offers a very cost-effective solution for rarely accessed, large-scale data. Storage in ADLS is extremely cheap (especially cold tiers), and compute (like Databricks serverless SQL or Synapse SQL Serverless) can be fired up only when needed.

    6.Are there recommended best practices for balancing Hyperscale vs ADLS in long-term healthcare data retention scenarios?

    1. Use ADLS (cold or archive-tier) for long-term, infrequently accessed healthcare data. It’s compliant, cost-effective, and scalable.
    2. Provision an interactive compute layer:
      • If you need SQL-like access, use Azure Synapse serverless SQL Pools or Databricks SQL endpoints: spin up compute on demand, pay only per query/job.
    3. Separate hot vs. cold:
      • Hot data lives in provisioned SQL (Hyperscale or provisioned tier).
      • Cold archive data stays in ADLS.
    4. Hybrid for rare real‑time archiving needs:
      • If occasional SQL queries are required against archives, consider loading on-demand subsets into a temporary Hyperscale (or regular SQL) instance for those sessions.
    1 person found this answer helpful.
    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.