Alternative to using Azure SQL Managed Instance for Complex calculations

Kman 41 Reputation points
2020-09-02T16:31:09.39+00:00

I have inherited an Architecture which carries out complex calculation in Azure SQL Managed Instance via Databricks. The Databricks is connected via Apache Spark connector. The Databricks does the calculations in SQL and the end results are in Tables and Views in SQL. This process takes hours.

I just wanted to know if there is another efficient way to do this?

I would have preferred to get this done in Databricks but is this the best way in terms of cost? I have heard carrying out complex calculations in .NET is more optimal but is it?

I am happy to use any other Azure Resources to carry out the calculation providing its is faster.

Any help or ideas would be helpful.

Azure SQL Database
Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,630 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,047 questions
{count} votes

1 answer

Sort by: Most helpful
  1. MartinJaffer-MSFT 26,051 Reputation points
    2020-09-04T02:12:28.973+00:00

    Part of the pain is all the data movement. I see your data comes from ADLS originally. Does the Managed Instance add value for you?

    There are a number of services to consider. Chief among them is Azure Synapse (Workspace). Everything big data seems to be converging in Azure Synapse. This includes Spark and Machine Learning. The linked document says it better than I can. It may be possible to replace both your Managed SQL and Databricks.

    Azure Synapse is still new, and there are so many options, such as on-demand SQL and pausing pools, it is hard to say what the optimal solution is. However the more I think about it, the more I am convinced Azure Synapse is something you should take a close look at.

    If your ADLS is Gen2, as opposed to Gen1, then you can ingest directly from the Storage Account instead of going thru the Managed Instance.