2,076 questions with Azure Databricks tags

Sort by: Updated
0 answers

transformation changes in silver and bronze layer

Hi , what transformation takes place between silver and gold layer .i.e have loaded data in bromze layer whichn is my bronze layer and i transformed data here ... then what will happend to silver to gold layer apart from pk,fk joins and all

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,688 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,180 questions
asked 2024-07-24T20:36:21.0466667+00:00
Vineet S 425 Reputation points
2 answers

What is the best way to access data in the data bricks by using azure function?

I just tried to load data from data bricks by using data bricks jobs API and azure function. Can I know is there another way to do the same thing that based on azure function?

Azure Functions
Azure Functions
An Azure service that provides an event-driven serverless compute platform.
4,661 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
asked 2024-07-12T09:08:56.7366667+00:00
Athula Chandrawansha 20 Reputation points
commented 2024-07-24T19:37:23.48+00:00
Azar 22,350 Reputation points MVP
0 answers

Azure Databricks Too Many Requests errors

We are getting many errors with loading notebooks and also now running jobs on clusters due to Databricks saying it has too many requests. For example, getting the below error message: run failed with error message Cluster '0724-103023-f2llqh3p' was…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
asked 2024-07-24T10:51:26.46+00:00
Matthieu Marshall 6 Reputation points
commented 2024-07-24T19:06:45.7333333+00:00
Bhargava-MSFT 29,036 Reputation points Microsoft Employee
1 answer

load data from ADLS to Unity Catalog tables using Azure Data Factory?

What is the best way to load data from ADLS to Unity Catalog tables using Azure Data Factory?

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,180 questions
asked 2024-07-24T16:08:45.33+00:00
chandrasekhar munagala 21 Reputation points
answered 2024-07-24T17:26:20.44+00:00
Sina Salam 7,286 Reputation points
1 answer

Efficient Log Handling and Data Retention in Azure Data Factory and Databricks

I need to create a solution to send logs from Azure Data Factory to the Databricks Unity Catalog. I'm considering the following structure: Whenever an activity run results in either failure or success, the corresponding log will be sent to Azure Logic…

Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,634 questions
Azure Logic Apps
Azure Logic Apps
An Azure service that automates the access and use of data across clouds without writing code.
2,994 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,180 questions
asked 2024-07-17T19:44:42.0666667+00:00
Hanna 220 Reputation points
commented 2024-07-24T15:11:12.7533333+00:00
Chandra Boorla 400 Reputation points Microsoft Vendor
0 answers

Creating a Zip File from Blob Storage Using Python in Azure Databricks

Hello, I am working on a task where I need to create a zip file for multiple files stored in blob storage, without having to read the files again or using local storage. I am using Python in Azure Databricks and would like to leverage its capabilities…

Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,634 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
asked 2024-07-24T06:12:46.7266667+00:00
Chahat Malik 0 Reputation points
commented 2024-07-24T12:43:20.9766667+00:00
Sina Salam 7,286 Reputation points
2 answers

Getting the size of parquet files from azure blob storage

I have a blob container abcd The folder structure is like below: abcd/Folder1/Folder a, Folder b…..Folder z Inside a particular Folder a/v1/full/20230505/part12344.parquet Similarly Folder b/v1/full/20230505/part9385795.parquet Scenario is I need to get…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,426 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,634 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,180 questions
asked 2024-07-09T11:57:50.94+00:00
KEERTHANA JAYADEVAN 66 Reputation points
commented 2024-07-24T08:31:27.1433333+00:00
Nehruji R 4,691 Reputation points Microsoft Vendor
2 answers One of the answers was accepted by the question author.

Azure Databricks Billing

I am confused about how the databricks service is billed under Azure. From documentation, it is said that Databricks is totally integrated with Azure billing: one bill for both Azure infrastructure (VM, storage, Network traffic, etc) and Databricks…

Azure Cost Management
Azure Cost Management
A Microsoft offering that enables tracking of cloud usage and expenditures for Azure and other cloud providers.
2,349 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
asked 2024-07-08T21:20:19.8566667+00:00
P, John 140 Reputation points
accepted 2024-07-24T03:54:53.4933333+00:00
P, John 140 Reputation points
1 answer

Kafka Connector for Databricks

Can you forward me documentation to use Kafka Connector for Apache Spark Streaming job to connect to Azure Eventhub. I am looking for maven library version etc.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
asked 2024-07-22T14:43:43.1133333+00:00
pseudo p 0 Reputation points
answered 2024-07-22T21:35:24.21+00:00
Amira Bedhiafi 19,946 Reputation points
1 answer

Captures logs from Azure Data Factory and inserts them into a Delta Table in Databricks

Good morning, I need assistance in creating a project that captures logs from Azure Data Factory and inserts them into a Delta Table in Databricks. The key requirements for this project are as follows: No Duplicate Logs: Ensuring that the logs are not…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,180 questions
asked 2024-07-17T15:17:27.44+00:00
Hanna 220 Reputation points
commented 2024-07-22T13:25:51.6466667+00:00
Smaran Thoomu 12,615 Reputation points Microsoft Vendor
5 answers

how to fetch data from Azure Active Directory(AD) by using either ADF or databricks

To fetch data from Azure Active Directory (AD) using either Azure Data Factory (ADF) or Azure Databricks, Pleae let me know in detail. thanks

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,180 questions
Active Directory
Active Directory
A set of directory-based technologies included in Windows Server.
6,235 questions
asked 2024-07-04T10:41:44.7933333+00:00
Lakshmi Narayana Sarma Bhamidipati 30 Reputation points
commented 2024-07-22T13:18:19.4833333+00:00
Lakshmi Narayana Sarma Bhamidipati 30 Reputation points
1 answer One of the answers was accepted by the question author.

Data Factory monitoring by inserting data table

Hello, I would like to know the best way to insert Datafactory activity logs into my Databricks delta table, so that I can use dashbosrd and create monitoring in Databricks itself , can you help me? I would like every 5 minutes for all activity logs in…

Azure Monitor
Azure Monitor
An Azure service that is used to collect, analyze, and act on telemetry data from Azure and on-premises environments.
3,033 questions
Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
2,935 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,180 questions
asked 2024-07-21T22:18:21.9966667+00:00
Hanna 220 Reputation points
accepted 2024-07-22T11:55:11.7766667+00:00
Hanna 220 Reputation points
2 answers

How to use databricks ai to auto generate data definitions for all the tables in my database?

I know we can go to the catalog in databricks and generate data definitions for columns inside of our database using ai, but is there a way of automatically generating these definitions without have to manually generate them and click accept on every…

Azure SQL Database
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
Azure Data Catalog
Azure Data Catalog
An Azure service that serves as a system of registration and system of discovery for enterprise data assets.
100 questions
asked 2024-03-06T21:31:37.0066667+00:00
Bradley, Zack 0 Reputation points
commented 2024-07-22T09:50:08.5766667+00:00
DUCOBU Delphine 0 Reputation points
2 answers

SAS token generation by Databricks to access CSV files from ADLS container folder

Hi Team, There are some csv files zips inside the ADLS container folder. These zip files need to be downloaded for data correction. Downloading the file requires SAS token embedded with zip file path. Databricks has been used to generate the token and…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,426 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,180 questions
asked 2024-07-17T05:08:24.3033333+00:00
Subhadip Roy 21 Reputation points
commented 2024-07-22T08:11:32.9266667+00:00
Subhadip Roy 21 Reputation points
3 answers

Azure Data Bricks - User Doesn't have permission to perform this action while connecting to Azure Synapse Dedicate Pool

We are connecting Azure Synapse Analytics - Dedicated Pool using the PySpark Code that runs from Azure Data Bricks using SQL Authentication. While running, we are getting the below error when we use a user with db_datawriter and db_datareader…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,688 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
asked 2024-06-07T11:26:00.4933333+00:00
Praveen Sreeram 1 Reputation point
commented 2024-07-22T05:45:51.19+00:00
Smaran Thoomu 12,615 Reputation points Microsoft Vendor
1 answer

Azure Databricks - Timeout error after 60 minutes when launching an Azure Databricks cluster

When I attempt to start a cluster through the Azure Databricks portal/UI, after 30 minutes I receive the following error in the event log: Failed to add 3 containers to the compute. Will attempt retry: true. Reason: Cloud provider launch failure Azure…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
asked 2024-07-12T05:04:06.92+00:00
Michael Pugliese 0 Reputation points
commented 2024-07-22T05:02:46.2933333+00:00
PRADEEPCHEEKATLA-MSFT 85,511 Reputation points Microsoft Employee
2 answers

Append in Liquid Cluster enabled table is not completing on DBR 15.3 version

I am trying do analysis with a Partition Table and Liquid Clustered table. As per Azure Databricks recommendation, I am using DBR 15.2 to execute the code. I have created a clustered table as and using an append operation which is specified below. Few…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
asked 2024-07-13T07:29:40.88+00:00
Sudipta Goswami 20 Reputation points
edited an answer 2024-07-22T04:45:21.4733333+00:00
PRADEEPCHEEKATLA-MSFT 85,511 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

How to add parent group for one specific group in databricks?

The question for this is in "databricks" -> "settings" -> "Identity and access" -> "Groups" Here has "admin" group with system managed. But we wonder if a new group can be assigned as a parent…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
asked 2024-07-18T09:02:33.91+00:00
accepted 2024-07-22T01:40:09.68+00:00
0 answers

My python code uses AzureCliCredential() function, but it is giving error in running in Synapse/ADF/Databricks notebook .

My python code uses AzureCliCredential() function, but it is giving error in running in Synapse Workspace that your Azure Cli command not found on path . I have tried using other function like DefaultAzureCredentials(), ClientSecretCredentials () also…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,688 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,180 questions
asked 2024-07-18T18:55:12.8266667+00:00
Utsav Mori 20 Reputation points
edited a comment 2024-07-20T19:36:18.43+00:00
Utsav Mori 20 Reputation points
1 answer

Workflow that logs the completion of certain pipelines into a table

I'm having a lot of difficulty implementing the solutions I had in mind. Previously, I asked for help to create an architecture that would be efficient, easy to maintain, and cost-effective. I received various suggestions, but I can't decide which one to…

Azure Monitor
Azure Monitor
An Azure service that is used to collect, analyze, and act on telemetry data from Azure and on-premises environments.
3,033 questions
Azure Event Hubs
Azure Event Hubs
An Azure real-time data ingestion service.
600 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,076 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,180 questions
Azure Event Grid
Azure Event Grid
An Azure event routing service designed for high availability, consistent performance, and dynamic scale.
353 questions
asked 2024-07-18T18:08:14.79+00:00
Hanna 220 Reputation points
answered 2024-07-19T19:35:43.0366667+00:00
Amira Bedhiafi 19,946 Reputation points