2.1.8 Big Data Clusters

2024-10-30

Microsoft SQL Server Big Data Clusters allow the deployment of scalable cluster containers that enable users to read, write, and process big data from Transact-SQL. In this way, users can easily combine and analyze high-value relational data with high-volume big data in a managed cluster environment.

The Control Plane REST API [MS-CPREST] provides methods to create a big data cluster in which the user manages the lifecycles of resources deployed in a cluster. The Hive Metastore HTTP protocol [MS-HMSHTTP] uses a light-weight interface that defines data services to store and read metadata from Hive Metastore databases that are inside a big data cluster and are exposed out of the big data cluster through the Apache Knox [ApacheKnox] access point.

The communication between these client applications and SQL Server Big Data Clusters is shown in the following figure.

SQL Server Big Data Cluster architecture

Figure 10: SQL Server Big Data Cluster architecture

Share via

2.1.8 Big Data Clusters

Additional resources