What type of Synapse architecture should be implemented for organisations that are large and have multiple departments working on the same data?

Parameswaran Serussery Narayanan 35 Reputation points
2025-06-25T14:28:17.1866667+00:00

Hello,

I am looking for inputs in relation to the synapse strategy that needs to be adopted. I have the below scenario,

Company XYZ has different departments (Dept 1, 2, 3 and 4) that are looking to leverage big data platform like Azure Synapse. The data and insight team within these departments are made of data architects and data scientists. Each of the departments have their own requirement to develop, test, train and productionise machine learning model and propensity model on the core data. There is also a need to have the ability to share some of the data products between departments.

Current setup of Synapse Workspace is as below,

Dev environment - 1 workspace in total for all the Departments to use

Prod environment - 1 workspace in total for all the Departments to use

Synapse SQL Datawarehouse - 1 SQL DW with 1000 DWUs reserved in Dev and 1 SQL DW with 5000 DWUs reserved in Prod workspace

Number of pipelines in Prod - 735+ (owned by Dept 1)

Number of entities/artifacts in Prod - 2500+ (owned by Dept 1)

Departments intended to use the workspace setup - Dept 1, 2, 3 and 4 which includes both data architect and data scientists

With each team having their own requirement, would it be a good approach to have just one workspace and have everyone use the same workspace or would it be good to have single data lake store with the core data and each team having their own synapse workspace to carry out their own requirement?

I understand that each of the approach has its own pros and cons. Using single workspace would mean there will be resource contentions and there are hard limits on the numbers of pipelines and entities allowed in a single workspace. We have one Dept that has close to 735+ pipelines and about 2500+ entities in this single workspace. If the remaining Dept are to use the same workspace, we will soon run out of limit and Microsoft have confirmed that these are hard limits and cannot be increased through quota request like soft limits. Using a multi workspace with single data lake topology means these workspaces need to be setup and secured via different VNet but this will reduce the issues related to resource contention, with better scalability, data governance, and different dept able to promote changes in much agile and sleek release process and able to use the same core data and are also able to share the data products in the single data lake enabling other Dept to make use of the common data products. Cross data query is possible with serverless sql pool which make data sharing feasible with multi workspace implementation.

I had gone through previous Microsoft link by the programme manager (JovanPop - https://techcommunity.microsoft.com/blog/azuresynapseanalyticsblog/the-best-practices-for-organizing-synapse-workspaces-and-lakehouses/3002506) and Microsoft's chief data officer (https://medium.com/data-science/best-practices-for-organizing-synapse-workspaces-977fe14b1fdb) leaning towards the multiple workspace - single data lake topology as the best approach for such large organisations but would be good to have more inputs while I try and understand which approach would be the best suited for the above mentioned scenario with some architectural inputs.

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
5,373 questions
{count} votes

Accepted answer
  1. Chandra Boorla 14,510 Reputation points Microsoft External Staff Moderator
    2025-06-25T16:42:26.1833333+00:00

    @Parameswaran Serussery Narayanan

    Thank you for the detailed context, your analysis and understanding of the scenario are spot on.

    Given the scale at which Dept 1 is already operating (735+ pipelines and 2500+ artifacts), and with multiple other departments planning to use the same workspace, continuing with a single Synapse workspace architecture will likely lead to resource contention and hard platform limits, as you've rightly pointed out.

    Based on your scenario and Microsoft's documented best practices (including those shared by Jovan Pop and others), the recommended approach would be:

    Multi-Workspace Architecture with a Shared Data Lake (ADLS Gen2)

    This model provides:

    • Scalability by isolating workloads per department,
    • Autonomy for individual Dev/Test/Prod cycles and CI/CD per team,
    • Governance and control via centralized security and Purview integration,
    • Data sharing and reuse through Serverless SQL Pool queries and shared curated zones in the data lake.

    While this does introduce more complexity in setup (VNet integration, RBAC, and lake ACLs), the benefits in terms of long-term agility, performance, and maintainability outweigh those challenges, especially for large enterprises like yours.

    I hope this information helps. Please do let us know if you have any further queries.

    Kindly consider upvoting the comment if the information provided is helpful. This can assist other community members in resolving similar issues.

    Thank you.

    2 people found this answer helpful.

2 additional answers

Sort by: Most helpful
  1. Deleted

    This answer has been deleted due to a violation of our Code of Conduct. The answer was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.


    Comments have been turned off. Learn more

  2. Harish R 0 Reputation points
    2025-06-26T07:15:51.6433333+00:00

    Hi,

    I would suggest to implement multi workspace ( workspace for each department ) and a shared data lake

    this would be better for scalability and better governance

    Security can be done by ACLs and easy to manage

    Workspaces should be like

    syn-dept1-dev

    syn-dept1-prod

    syn-dept2-dev etc

    Since (assuming )all departments or team share same data lake they can use

    1. data product contracts
    2. shared curated data sets
    3. Shared tables etc

    Risk of single workspace as mentioned above would be

    1.unscalable when new team gets onboarded

    1. Cross team conflicts
    2. Slower releases
    3. Difficult to enforce fine-grained team-level RBAC and networking
    4. Shared pipelines, Spark/Synapse pools may lead to degraded performance
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.