2,070 questions with Azure Databricks tags

Sort by: Updated
0 answers

My python code uses AzureCliCredential() function, but it is giving error in running in Synapse/ADF/Databricks notebook .

My python code uses AzureCliCredential() function, but it is giving error in running in Synapse Workspace that your Azure Cli command not found on path . I have tried using other function like DefaultAzureCredentials(), ClientSecretCredentials () also…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,667 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
asked 2024-07-18T18:55:12.8266667+00:00
Utsav Mori 20 Reputation points
0 answers

Workflow that logs the completion of certain pipelines into a table

I'm having a lot of difficulty implementing the solutions I had in mind. Previously, I asked for help to create an architecture that would be efficient, easy to maintain, and cost-effective. I received various suggestions, but I can't decide which one to…

Azure Monitor
Azure Monitor
An Azure service that is used to collect, analyze, and act on telemetry data from Azure and on-premises environments.
3,012 questions
Azure Event Hubs
Azure Event Hubs
An Azure real-time data ingestion service.
597 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
Azure Event Grid
Azure Event Grid
An Azure event routing service designed for high availability, consistent performance, and dynamic scale.
349 questions
asked 2024-07-18T18:08:14.79+00:00
Yohanna de Oliveira Cavalcanti 160 Reputation points
2 answers One of the answers was accepted by the question author.

Data Factory Logs --> Catolog Databricks

Good morning, I need assistance in creating a project that captures logs from Azure Data Factory and inserts them into a Delta Table in Databricks. The key requirements for this project are as follows: No Duplicate Logs: Ensuring that the logs are not…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
asked 2024-07-17T14:53:09.4466667+00:00
Yohanna de Oliveira Cavalcanti 160 Reputation points
edited a comment 2024-07-18T15:56:37.4933333+00:00
phemanth 8,485 Reputation points Microsoft Vendor
2 answers

SAS token generation by Databricks to access CSV files from ADLS container folder

Hi Team, There are some csv files zips inside the ADLS container folder. These zip files need to be downloaded for data correction. Downloading the file requires SAS token embedded with zip file path. Databricks has been used to generate the token and…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,424 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
asked 2024-07-17T05:08:24.3033333+00:00
Subhadip Roy 21 Reputation points
commented 2024-07-18T15:47:55.41+00:00
Subhadip Roy 21 Reputation points
2 answers

Getting the size of parquet files from azure blob storage

I have a blob container abcd The folder structure is like below: abcd/Folder1/Folder a, Folder b…..Folder z Inside a particular Folder a/v1/full/20230505/part12344.parquet Similarly Folder b/v1/full/20230505/part9385795.parquet Scenario is I need to get…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,424 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,612 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
asked 2024-07-09T11:57:50.94+00:00
KEERTHANA JAYADEVAN 66 Reputation points
commented 2024-07-18T12:13:50.3+00:00
Amrinder Singh 4,270 Reputation points Microsoft Employee
1 answer

How to add parent group for one specific group in databricks?

The question for this is in "databricks" -> "settings" -> "Identity and access" -> "Groups" Here has "admin" group with system managed. But we wonder if a new group can be assigned as a parent…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-07-18T09:02:33.91+00:00
answered 2024-07-18T09:12:19.5066667+00:00
Deepanshukatara-6769 7,830 Reputation points
1 answer One of the answers was accepted by the question author.

Pipeline Executing Databricks Notebook Successfully Despite Stopped Cluster

In Azure Data Factory (ADF), I have a pipeline that executes a notebook in Azure Databricks. I noticed that even when the Databricks cluster is stopped, the ADF pipeline still completes successfully, and the notebook runs without any issues. Is this…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-06-20T11:17:32.34+00:00
vikranth-0706 205 Reputation points
commented 2024-07-18T07:19:40.9566667+00:00
Smaran Thoomu 12,610 Reputation points Microsoft Vendor
1 answer

I am unable to create compute cluster in databricks from azure free trial subcription

I am unable to create compute cluster in databricks from azure free trial subcription. Failed to add 1 container to the compute. Will attempt retry: false. Reason: Azure Quota Exceeded Exception

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-07-10T12:10:18.31+00:00
Sangana 0 Reputation points
commented 2024-07-18T06:35:33.9+00:00
phemanth 8,485 Reputation points Microsoft Vendor
1 answer

Captures logs from Azure Data Factory and inserts them into a Delta Table in Databricks

Good morning, I need assistance in creating a project that captures logs from Azure Data Factory and inserts them into a Delta Table in Databricks. The key requirements for this project are as follows: No Duplicate Logs: Ensuring that the logs are not…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
asked 2024-07-17T15:17:27.44+00:00
Yohanna de Oliveira Cavalcanti 160 Reputation points
commented 2024-07-18T05:42:00.2866667+00:00
Smaran Thoomu 12,610 Reputation points Microsoft Vendor
0 answers

Datafactory logs for dasboard on databricks

hello! need help! I need to solve a problem that consists of taking completed pipeline logs with specific names and inserting them into a deltatable within Databricks to create dashboards in it. this solution needs to be entirely using Azure tools. I'm…

Azure Monitor
Azure Monitor
An Azure service that is used to collect, analyze, and act on telemetry data from Azure and on-premises environments.
3,012 questions
Azure Logic Apps
Azure Logic Apps
An Azure service that automates the access and use of data across clouds without writing code.
2,982 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
Azure Event Grid
Azure Event Grid
An Azure event routing service designed for high availability, consistent performance, and dynamic scale.
349 questions
asked 2024-07-18T01:53:11.08+00:00
Yohanna de Oliveira Cavalcanti 160 Reputation points
3 answers

Azure Data Bricks - User Doesn't have permission to perform this action while connecting to Azure Synapse Dedicate Pool

We are connecting Azure Synapse Analytics - Dedicated Pool using the PySpark Code that runs from Azure Data Bricks using SQL Authentication. While running, we are getting the below error when we use a user with db_datawriter and db_datareader…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,667 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-06-07T11:26:00.4933333+00:00
Praveen Sreeram 1 Reputation point
answered 2024-07-18T00:39:31.7333333+00:00
Vinodh247 13,226 Reputation points
1 answer

Efficient Log Handling and Data Retention in Azure Data Factory and Databricks

I need to create a solution to send logs from Azure Data Factory to the Databricks Unity Catalog. I'm considering the following structure: Whenever an activity run results in either failure or success, the corresponding log will be sent to Azure Logic…

Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,612 questions
Azure Logic Apps
Azure Logic Apps
An Azure service that automates the access and use of data across clouds without writing code.
2,982 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
asked 2024-07-17T19:44:42.0666667+00:00
Yohanna de Oliveira Cavalcanti 160 Reputation points
commented 2024-07-17T22:12:15.19+00:00
Yohanna de Oliveira Cavalcanti 160 Reputation points
2 answers

How to create Synapse Serverless SQL Pool External Table using Databricks Notebook?

Hello, Can we create Synapse Serverless SQL Pool External Table using a Databricks Notebook? E.g., the script to create an Synapse Serverless SQL Pool External Table from within Synapse is as follows: CREATE EXTERNAL TABLE [SchemaName].[TableName]…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,667 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2023-07-24T20:04:56.9766667+00:00
Azure Enthusiast 10 Reputation points
edited a comment 2024-07-17T20:21:32.17+00:00
Kafran 0 Reputation points
2 answers One of the answers was accepted by the question author.

Data Factory Logs to Databricks

I need to create a way to send logs from Data Factory to the Databricks Catalog. What is the most cost-effective and efficient method to achieve this?

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
asked 2024-07-16T19:26:32.8633333+00:00
Yohanna de Oliveira Cavalcanti 160 Reputation points
commented 2024-07-17T18:39:54.08+00:00
Yohanna de Oliveira Cavalcanti 160 Reputation points
1 answer One of the answers was accepted by the question author.

How to Correctly Pass and Use Boolean Values from Datafactory to Databricks Notebook

How can I correctly pass a Boolean value from Datafactory to a Databricks notebook and use it in conditional logic? I configured a pipeline in Datafactory that calls a Databricks notebook. I attempted to pass a Boolean parameter from Datafactory as a…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
asked 2024-07-11T14:38:26.1066667+00:00
Glasier 400 Reputation points
accepted 2024-07-17T15:10:13.22+00:00
Glasier 400 Reputation points
5 answers

how to fetch data from Azure Active Directory(AD) by using either ADF or databricks

To fetch data from Azure Active Directory (AD) using either Azure Data Factory (ADF) or Azure Databricks, Pleae let me know in detail. thanks

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
Active Directory
Active Directory
A set of directory-based technologies included in Windows Server.
6,208 questions
asked 2024-07-04T10:41:44.7933333+00:00
Lakshmi Narayana Sarma Bhamidipati 30 Reputation points
commented 2024-07-17T14:10:25.3766667+00:00
Smaran Thoomu 12,610 Reputation points Microsoft Vendor
3 answers One of the answers was accepted by the question author.

Validate Task - Data Factory npm Build Fails bundle.manager.js:53 exit code 255

I have two devops accounts. In 1st account I configured a build pipeline for Azure Data Factory with the attached build.yaml132228-buildyaml.txt I then copied the repo and the build yaml to the 2nd devops account. In the 1st devops the build…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2021-09-15T07:22:51.877+00:00
Rob Bowman 221 Reputation points
answered 2024-07-17T11:56:03.2833333+00:00
JyotiranjanMangaraj-6157 0 Reputation points
1 answer One of the answers was accepted by the question author.

Best Practices for Automating Pipeline Execution Data Collection in Azure Data Factory

Hello everyone, I am looking for the best practices to create an automated workflow for collecting execution data from Azure Data Factory (ADF) pipelines, storing this data in Azure Data Lake Storage Gen2, and consolidating it into a single table for…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
asked 2024-07-16T14:23:15.0633333+00:00
Yohanna de Oliveira Cavalcanti 160 Reputation points
accepted 2024-07-17T11:38:09.2833333+00:00
Yohanna de Oliveira Cavalcanti 160 Reputation points
1 answer

How does ADF Linked Service for Azure Databricks aligns with Job Compute Policy defined inside Azure Databricks

This is a two-part Question. First Part Context: I have ADF which contains several data pipelines. Some pipeline also includes a databricks notebook as an activity. I have created Linked Service Azure Data Factory to facilitate the pipeline which…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,120 questions
asked 2024-07-10T10:25:16.91+00:00
Ravineesh 20 Reputation points
commented 2024-07-16T23:14:23.99+00:00
Bhargava-MSFT 28,951 Reputation points Microsoft Employee
1 answer

Append in Liquid Cluster enabled table is not completing on DBR 15.3 version

I am trying do analysis with a Partition Table and Liquid Clustered table. As per Azure Databricks recommendation, I am using DBR 15.2 to execute the code. I have created a clustered table as and using an append operation which is specified below. Few…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-07-13T07:29:40.88+00:00
Sudipta Goswami 20 Reputation points
commented 2024-07-16T17:09:56.4566667+00:00
Sudipta Goswami 20 Reputation points