Deployment templates for cloud-scale analytics

Cloud-scale analytics has the following core starter templates:

Repository Content Required Deployment model
Data management template Central data management services and shared data services like data catalog and self-hosted integration runtime Yes One per cloud-scale analytics
Data landing zone template Data landing zone shared services, including ingestion, management, and data storage services Yes One per data landing zone
Data integration template - batch processing Additional services necessary for batch data processing No One or more per data landing zone
Data integration template - stream processing Additional services necessary for data stream processing No One or more per data landing zone
Data product template - analytics and data science Additional services necessary for data analytics and AI No One or more per data landing zone

These templates contain Azure Resource Manager templates, the templates' parameter files, and CI/CD pipeline definitions for resource deployment.

Templates can change over time due to new Azure services and requirements. Secure each repository's main branch so it remains error-free and ready for consumption and deployment. Use a development subscription to test template configuration changes before you merge feature enhancements back into your main branch.

Cost estimates

We have provided links to the official cost calculator, where users can make changes depending on the expected amount of data and data throughput. All cost calculations specify a base cost of cloud-scale analytics for Cloud Adoption Framework for Azure. Base cost can be defined as the cost that occurs if no data workloads run inside the respective subscriptions.

Data management landing zone costs

In a production scenario, we recommend that you rely on the Azure Firewall and private DNS zones that are hosted in the connectivity hub of Azure landing zones. In minimal viable products, you can rely on the Azure Firewall and private DNS zones that are bundled with Cloud Adoption Framework cloud-scale analytics. Therefore, we provide the following two different cost calculations:

Keep in mind that the following services aren't listed in the overview, because they're either not available in the official pricing calculator or the pricing calculator doesn't allow you to set the compute hours to 0:

  • Azure Purview
  • Synapse Private Link hub
  • Route table
  • Network security group

Data landing zone costs

For data landing zones, refer to the following cost calculation:

You can further reduce the cost by hosting the self-hosted integration runtime (SHIR) on-premises instead of hosting it on a virtual machine scale set.

Keep in mind that the following services aren't listed in the overview, because they're either unavailable in the official pricing calculator or the pricing calculator doesn't allow you to set the compute hours to 0:

  • Azure Databricks
  • Route table
  • Network security group

Next steps

Solution accelerators for cloud-scale analytics