Best practices to manage a Synapse Serverless SQL pool database

Question

Best practices to manage a Synapse Serverless SQL pool database

pmscorca 1,052

Hi,

in order to have a good and repeteable practice about the managing a Synapse Serverless SQL pool db, to apply for more projects, I think at these points:

for creating a database --> CREATE DATABASE myserverlesssqlpool;
for creating a master key for the db --> CREATE MASTER KEY (without any passwords? or specifying a password, but where does it save in a secure manner?);
for creating a db credential --> CREATE DATABASE SCOPED CREDENTIAL my_cred_synapse WITH IDENTITY = 'MANAGED IDENTITY' (I prefer this setting for IDENTITY);

and so on.

Any suggests, please? Thanks

Smaran Thoomu 24,260 Reputation points Microsoft External Staff Moderator

2024-02-22T04:20:11.77+00:00

@pmscorca Just checking in to see if the below answer provided by Debarchan Sarkar - MSFT helped.

If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

1 answer

Your answer

Smaran Thoomu 24,260 Reputation points Microsoft External Staff Moderator

2024-02-22T04:20:11.77+00:00

@pmscorca Just checking in to see if the below answer provided by Debarchan Sarkar - MSFT helped.

If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Answer 1

Creating a database in Synapse Serverless SQL pool is different from traditional SQL Server since the databases are actually stored in Azure storage (Data Lake or Blob Storage), and you don't use CREATE DATABASE syntax. In Synapse, you will directly query against those data stores. Here are some best practices for managing Synapse Serverless SQL pools: Access Data: Use OPENROWSET to access your data stored in Azure storage services. You can create an external table as well, but with serverless SQL pool, it's not necessary.


SELECT * 

FROM OPENROWSET(

    BULK 'https://myaccount.dfs.core.windows.net/myfilesystem/myfile.csv',

    FORMAT = 'CSV'

) AS result

Authentication: For authenticating, you typically use Managed Identity. Managed identities provide an Azure Active Directory identity for your Synapse workspace, simplifying password management.


CREATE DATABASE SCOPED CREDENTIAL [MyManagedIdentity]

WITH IDENTITY = 'Managed Identity';

Data Lake Firewall: If your Data Lake has firewall enabled, remember to add your Synapse workspace's managed private endpoint to the firewall allow list. Parquet Format: When dealing with large datasets, consider using Parquet format which provides both a performance boost and cost savings. Use Views: To simplify access to multiple files or to hide complexity of your data lake, use views. They are easy to manage and can be secured using ACLs. Optimize File Size: When writing data back to the data lake, try to optimize the size of output files. Aim for larger files (between 256 MB and 1 GB) to improve read performance. Securing Data: Sensitive data should be secured. Consider using static data masking or dynamic data masking to hide sensitive data. Always encrypt data at rest and in transit. Cost Management: To manage costs, monitor and analyse your usage regularly. Also, set up alerts for when you reach certain budget thresholds.

Remember that these are general guidelines and the specifics may vary depending on the details of your project. Let me know if you need more information on this!

Smaran Thoomu 24,260 Reputation points Microsoft External Staff Moderator

2024-02-23T05:15:03.0633333+00:00

Hi @pmscorca - Just following up to see if the above answer helped. Please do consider clicking Accept Answer as accepted answers help community as well. Thank you.
pmscorca 1,052 Reputation points

2024-03-03T06:58:53.08+00:00

Hi,

in this article Quickstart: Use serverless SQL pool a CREATE DATABASE is used.
In a serverless SQL pool a database means a logical container and not a physical database to save data.

A serverless SQL pool could have more "databases", namely more logical boxes.

The focus of this post regarding the best practices to manage a Synapse Serverless SQL pool (logical) database, thinking a Visual Studio project to use. Thanks
Smaran Thoomu 24,260 Reputation points Microsoft External Staff Moderator

2024-03-04T12:27:53.34+00:00

Hi @pmscorca

Thank you for the clarification. You are correct that in Synapse Serverless SQL pool, a database is a logical container and not a physical database to save data. You can create multiple databases within a serverless SQL pool, and each database can contain multiple tables.

When working with Visual Studio projects, you can use the Azure Synapse Analytics extension for Visual Studio to create and manage your serverless SQL pool databases, tables, and views. You can also use Visual Studio to develop and deploy Azure Data Factory pipelines and other Azure resources.

I hope this helps! Let me know if you have any further questions.

Share via

Best practices to manage a Synapse Serverless SQL pool database

1 answer

Your answer