Azure Cosmos DB Apache Spark 2 OLTP Connector for API for NoSQL: Release notes and resources
APPLIES TO: NoSQL
You can accelerate big data analytics by using the Azure Cosmos DB Apache Spark 2 OLTP Connector for NoSQL. The Spark Connector allows you to run Spark jobs on data stored in Azure Cosmos DB. Batch and stream processing are supported.
You can use the connector with Azure Databricks or Azure HDInsight, which provide managed Spark clusters on Azure. The following table shows supported versions:
Component | Version |
---|---|
Apache Spark | 2.4.x, 2.3.x, 2.2.x, and 2.1.x |
Scala | 2.11 |
Azure Databricks (runtime version) | Later than 3.4 |
Warning
This connector supports the API for NoSQL of Azure Cosmos DB. For the Azure Cosmos DB for MongoDB, use the MongoDB Connector for Spark. For the Azure Cosmos DB for Apache Cassandra, use the Cassandra Spark connector.
Resources
Link | |
---|---|
SDK download | Download latest .jar, Maven |
Contribute to the SDK | Azure Cosmos DB Connector for Apache Spark on GitHub |
Get started | Accelerate big data analytics by using the Apache Spark to Azure Cosmos DB connector - Use Apache Spark Structured Streaming with Apache Kafka and Azure Cosmos DB |
Related content
- Frequently asked questions
- Release notes
- Learn more about Azure Cosmos DB.
- Learn more about Apache Spark.