Share via


2.1.8 Big Data Clusters

Microsoft SQL Server Big Data Clusters allow the deployment of scalable cluster containers that enable users to read, write, and process big data from Transact-SQL. In this way, users can easily combine and analyze high-value relational data with high-volume big data in a managed cluster environment.

The Control Plane REST API [MS-CPREST] provides methods to create a big data cluster in which the user manages the lifecycles of resources deployed in a cluster. The Hive Metastore HTTP protocol [MS-HMSHTTP] uses a light-weight interface that defines data services to store and read metadata from Hive Metastore databases that are inside a big data cluster and are exposed out of the big data cluster through the Apache Knox [ApacheKnox] access point.

The communication between these client applications and SQL Server Big Data Clusters is shown in the following figure.

SQL Server Big Data Cluster architecture

Figure 10: SQL Server Big Data Cluster architecture