Hi @Xi Le
Welcome to Microsoft Q&A platform and thanks for posting your question here.
To address the issue of duplicated files and rows in Azure Synapse Analytics, here are some steps you can take:
- Check the export settings: Ensure that the export to data lake feature is configured correctly. Azure Synapse Link for Dataverse should be set to save data in Parquet Delta Lake format by default, which can help with query response times.
- Review data transactions: Verify that the Create, Update, and Delete (CUD) transactions are being handled correctly. Azure Synapse Link supports continuous replication of entity and table data, including CUD transactions.
- Update versions: Make sure you have the latest cumulative updates for your finance and operations application versions. There might be fixes in the latest updates that address the duplication issue.
- In-place updates vs. append-only writes: If you’re using in-place updates, consider switching to append-only writes to prevent duplicate records from being created.
- Event-based triggers: Set the storage event to only the model. Json file, so every end of the Synapse link cycle will trigger your pipeline to run. This can help manage when data is exported and processed.
- Pipeline creation: If necessary, create a pipeline using copy data activity to move your data without using shortcuts. This can help ensure that your D365 F&O tables are linked correctly, and that data is copied accurately.
- Monitor file generation: Observe the pattern of file generation, especially around the reset period on Mondays for sales line. This can give you insights into the process that’s causing the duplication.
Reference:
https://learn.microsoft.com/en-us/power-apps/maker/data-platform/azure-synapse-link-select-fno-data
I hope this information helps you. Let me know if you have any further questions or concerns.