Does synapse serverless sql pool OPENROWSET support "parquest based structured stream" format?

Salander He 66 Reputation points Microsoft Employee
2024-05-08T03:27:38.5+00:00

I can read from a data lake structured stream using synapse serverless sql pool OPENROWSET, by specifying FORMAT = 'SSTREAM'. Does OPENROWSET support "parquest based structured stream" format? If yes, then what value should I specify to FORMAT argument?

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,376 questions
Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,486 questions
0 comments No comments
{count} votes

Accepted answer
  1. PRADEEPCHEEKATLA-MSFT 80,251 Reputation points Microsoft Employee
    2024-05-08T07:06:52.3866667+00:00

    @Salander He - Thanks for the question and using MS Q&A platform.

    Unfortuantely, we don’t support querying parquet based SStream. We do work on that feature, and we will support it in future and I will update once it's available.

    According to the documentation: How to use OPENROWSET using serverless SQL pool in Azure Synapse Analytics, OPENROWSET in Synapse serverless SQL pool supports CSV, Parquet, and Delta formats.

    You have three choices for input files that contain the target data for querying. Valid values are:

    • 'CSV' - Includes any delimited text file with row/column separators. Any character can be used as a field separator, such as TSV: FIELDTERMINATOR = tab.
    • 'PARQUET' - Binary file in Parquet format
    • 'DELTA' - A set of Parquet files organized in Delta Lake (preview) format

    Therefore, you can read from a Parquet-based structured stream using Synapse serverless SQL pool OPENROWSET by specifying FORMAT = 'PARQUET'.

    Here is an example query that reads from a Parquet file using OPENROWSET:

    SELECT * FROM OPENROWSET( BULK 'https://yourstorageaccount.blob.core.windows.net/yourcontainer/yourfile.parquet', FORMAT = 'PARQUET' ) AS rows
    

    Hope this helps. Do let us know if you any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

    1 person found this answer helpful.
    0 comments No comments

0 additional answers

Sort by: Most helpful