How to execute Hive queries in Synapse spark

Dharmesh Rathod 0 Reputation points
2023-02-03T02:56:53.8133333+00:00

Hello!

I am replacing a HDI cluster with Azure Synapse. My current HDI spark cluster executes some HIVE queries for data transformation. Is it possible to execute the same HIVE queries into Azure Synapse spark pool?

Thanks,

DR

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,422 questions
Azure HDInsight
Azure HDInsight
An Azure managed cluster service for open-source analytics.
199 questions
{count} votes

1 answer

Sort by: Most helpful
  1. PRADEEPCHEEKATLA-MSFT 78,331 Reputation points Microsoft Employee
    2023-02-03T06:10:35.65+00:00

    Hello @Dharmesh Rathod

    Thanks for the question and using MS Q&A platform.

    You can execute Hive queries in Synapse Spark by connecting to an external Hive Metastore. To do this, you need to set up a linked service to the external Hive Metastore in your Synapse workspace and configure your Spark pool to use the linked external Hive Metastore. You can then use the HiveContext in your Spark code to execute Hive queries.

    For more details, refer to Use external Hive Metastore for Synapse Spark Pool

    Hope this helps. Do let us know if you any further queries.


    Please don’t forget to Accept Answer wherever the information provided helps you, this can be beneficial to other community members.