Sharing Tables Between Spark Pools and Serverless SQL Pools in Synapse Analytics

Question

Sharing Tables Between Spark Pools and Serverless SQL Pools in Synapse Analytics

Gabriel25 525

While Azure Synapse Analytics documentation(https://learn.microsoft.com/en-us/azure/synapse-analytics/metadata/overview) states that Apache Spark pools can access databases and tables created in serverless SQL pools, it's unclear if the reverse is true. If I create a table or external table in a serverless SQL pool, will it be automatically available to a Spark pool?

Additionally, the documentation(https://learn.microsoft.com/en-us/azure/synapse-analytics/overview-what-is) mentions that both SQL and Spark can directly access Parquet, CSV, TSV, and JSON files in the data lake. If a table is created using a JSON file in a serverless SQL pool, will it be accessible to a Spark pool?

Smaran Thoomu 24,110 Reputation points Microsoft External Staff Moderator

2024-08-26T09:54:41.7466667+00:00

@vikranth-0706 Following up to see if the below answer was helpful. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Accepted answer

0 additional answers

Your answer

Smaran Thoomu 24,110 Reputation points Microsoft External Staff Moderator

2024-08-26T09:54:41.7466667+00:00

@vikranth-0706 Following up to see if the below answer was helpful. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Answer 1

Hi @vikranth-0706
Thanks for the question and using MS Q&A platform.

If I create a table or external table in a serverless SQL pool, will it be automatically available to a Spark pool?

It's not possible to directly access tables from the Synapse serverless SQL pool, as they are not available externally. However, Synapse provides a feature that allows you to access Spark database objects without needing the Spark pool to be active, by synchronizing them with Serverless pools.
For reference, Spark Table to Serverless Pool Metadata Sync

If a table is created using a JSON file in a serverless SQL pool, will it be accessible to a Spark pool?

No, The Synapse serverless SQL pool tables cannot be accessed directly from outside. To access these tables, the JDBC connector in Synapse should be used. Additionally, objects created in the SQL engine won't be visible in the Spark Pool. To retrieve data from SQL for Spark, the SQL JDBC connection must be utilized as below.

print("read data from SQL")
jdbcDF = spark.read \
        .format("com.microsoft.sqlserver.jdbc.spark") \
        .option("url", url) \
        .option("dbtable", dbtable) \
        .option("user", user) \
        .option("password", password).load()

jdbcDF.show(5)

Hope this helps. Do let us know if you any further queries.

If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Smaran Thoomu 24,110 Reputation points Microsoft External Staff Moderator

2024-08-27T17:20:07.3933333+00:00

@vikranth-0706 Following up to see if the above answer was helpful. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Share via

Sharing Tables Between Spark Pools and Serverless SQL Pools in Synapse Analytics

0 additional answers

Your answer