Failed to convert the value in 'xxxx' property to 'System.String' type in Data Lake Connector
Environment: Azure Synapse I am developing a pipeline that has a lookup to SQL that is passing the values to GetMetaData. When using @dataset.name of parameter in the dataset, it fails with "Failed to convert the value in name of parameter…
How to ignore the records by applying an auditing filed column condition using ADF Data Flows
Hi All I am building a data transamination using mapping data flows ,I have a time stamp field Like TimeStampUpdated in the target table. I want to lockup historical data with incremental data transamination and ignore the records coming in the…
From Synapse notebook, trouble connecting to Azure SQL using system managed identity
I can create a Linked Service to my Database using my Synapse system-managed identity. This works fine, and the "test connection" shows success. But for the life of me, I can't figure out how to do it in a Python notebook. Here is the…
Assert error output not writing to blob
Hello. I have built a pipeline which gathers data from a .csv file and then goes through a couple of assert activities to check the validity of the data. Valid rows are supposed to be input into the table, while assert-failure rows are setup to be…
CI/CD Powershell Azure Synapse Workspace with Azure Devops
I am working on CI/CD between Azure Synapse Workspace with Azure Devops. I have source code from someone else who work on this project before and it works well and I try to understand it and create my own project on it. right now, I have issues of the…
Error publishing notebook.
I am trying to publish my notebook. I get the following error : CreateOrUpdateNotebook failed: BadRequest. The operation failed because the entity “notebookA” is being renamed to “notebookB”. A retry of rename from exact source to target may work”. It…
source files problem from internal server to Gen2 container
Hello, I am trying to connect my interal files system to the gen2 container sink, following this instructions https://arulmouzhi.wordpress.com/2021/04/12/bringing-folder-structure-via-azure-data-factory/ The first one in Allfiles set, I can load…
How can I find help validating a certificate?
Indeed, I am a student in cloud computing in Morocco and I do not have enough means to obtain the AZ 900 certification and my family also does not have enough financial means. Could someone help me validate this certificate please?
azure synapse RBAC not working
I have just created a synapse analytics workspace, on opening it I get an error message similar to the below. From my research, I have gr anted storage blob contributor role to the synapse application as well as the user assigned managed identity. Going…
Should Serverless Spark pool be able to process less than 5 million rows quickly?
We are downloading about 10K records daily and the trying to merge them into a Deltalake file and noticing that it takes about 35-45 minutes to merge. Is that expected?
Resolving FileNotFoundError When Reading Parquet Files in Synapse Notebook
In my Synapse Notebook, I aimed to read Parquet files. However, I encountered a 'FileNotFoundError' when attempting to use a wildcard. The folder structure I intend to access is as follows: 'test/year={yyyy}/month={MM}/day={dd}/*.parquet'. Here's the…
Proper Usage of Synapse Notebook References
I'm attempting to utilize the Azure Synapse notebook reference outlined in the documentation provided here: https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-development-using-notebooks?tabs=preview#notebook-reference In my…
We are not getting data in azure data lake Service v2 storage's container.
Hello, We are getting data in the Azure event hub sink through the POSTGRES SQL database but we are not getting files in the container which we have configured in an ADLS GEN 2 and while configuring we have an encounter that we are using a free…
Running specific pipelines from an Orchestration pipeline
How can I run individual pipelines from an Orchestration pipeline that has multiple pipelines running in parallel? I want to be able to choose which pipeline to run separately from the rest.
Azure Synapse Analytics Failed to setup debug session
Right now I'm not able to run a data flow task on our Azure Synapse environment. When I try to start a debug session I get the message Failed to setup debug session. When I trigger the pipeline instead the task fails with "Operation on target…
ADF Data Flows Flatten nested json array values are being populated as null
Hi All I am building a data transamination with ADF data flow using a nested json array of objects , but after parse and flatten the json node itOffer.item.LeadOfer.zdeal.item[].dealNumber I am seeing that the column values are populated as null . I…
Access cosmos db through azure synapse analytics notebook using system assigned managed identity linked service
i have made a linked service for cosmos db no sql using system assigned managed identity as auth type and linked service is published as well. Now when i access this linked service from synapse analytics notebook using below code it gives me this…
Seeking Expertise in Spark SQL CTE Recursive Queries in Azure Databricks
I'm currently diving deep into Spark SQL and its capabilities, and I'm facing an interesting challenge. I'm eager to learn how to write CTE recursive queries in Spark SQL, but after thorough research, it seems that Spark doesn't natively support…
Data flow failing at Sink Step when pipeline is scheduled
I have Pipeline with Data flow debug where data is sinked into Data set. When run manually it runs fine and the parquet file is created. However when scheduled in Pipeline it gives following error. Pipeline runs fine only gives error on schedule…
Using SMI in Synapse Spark Job
Hello We have Synapse Pipeline with Spark Job Definition We currently use SPN to read data from ADLS2 and also to write to kusto with Spark Kusto Connector (Getting a token from SPN) We have saved SPN credentials into AKV. We have urgent requirement to…