2,037 questions with Azure Databricks tags

Sort by: Updated
0 answers

[INTERNAL_ERROR] The Spark SQL phase optimization failed with an internal error. You hit a bug in Spark or the Spark plugins you use.

"I am trying to extract data from Azure Cosmos DB using PySpark and I am getting the following error: Py4JJavaError: An error occurred while calling o700.save.: org.apache.spark.SparkException: [INTERNAL_ERROR] The Spark SQL phase optimization…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
Azure Cosmos DB
Azure Cosmos DB
An Azure NoSQL database service for app development.
1,510 questions
asked 2024-03-08T08:35:10.85+00:00
Nathalia Fernandez Rodrigues 0 Reputation points
commented 2024-06-26T16:39:36.0333333+00:00
Luce PHILIBERT 0 Reputation points
0 answers

Is there any way to make Azure Data Factory pipelines wait in queue if infrastructure reaches maximum capacity and not fail?

I have an Event based trigger in Azure Data Factory which executes ETL pipelines every 5 mins. The pipeline has some Databricks notebook activities which executes via a cluster pool. In some cases, the pipelines are failing throwing this error:…

Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
1,068 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,962 questions
asked 2024-06-26T14:57:09.1266667+00:00
Danish Ahmed Mehmuda 0 Reputation points
2 answers

databricks cluster sizing

Hey, how to calculate cluster core and workers node of 10gb data load every 2 hours ... what is the calculation behind this

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-26T06:13:37.5433333+00:00
Vineet S 245 Reputation points
answered 2024-06-26T10:54:39.5233333+00:00
Smaran Thoomu 11,775 Reputation points Microsoft Vendor
1 answer

array in databricks

Hi , I got the 1st row in sorted order how can i get the view on row1,row2,row3 using array and how the reshuffling will happen 1,2,3,4,5  --- sorted order   5,1,2,3,4  - 1 4,5,1,2,3  - 2 3,4,5,1,2  - 3

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-25T08:20:36.5333333+00:00
Vineet S 245 Reputation points
edited the question 2024-06-26T10:42:52.7333333+00:00
SwathiDhanwada-MSFT 18,301 Reputation points
3 answers

Azure Data Bricks - User Doesn't have permission to perform this action while connecting to Azure Synapse Dedicate Pool

We are connecting Azure Synapse Analytics - Dedicated Pool using the PySpark Code that runs from Azure Data Bricks using SQL Authentication. While running, we are getting the below error when we use a user with db_datawriter and db_datareader…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,582 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-07T11:26:00.4933333+00:00
Praveen Sreeram 1 Reputation point
commented 2024-06-26T06:24:53.4066667+00:00
Praveen Kumar Sreeram 0 Reputation points
1 answer

view in dataframe

hey, how we can create or replace view statement in spark sql in dataframe of databricks create or replace view as (select * from temp1)

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,962 questions
asked 2024-06-25T17:31:45.62+00:00
Vineet S 245 Reputation points
commented 2024-06-26T04:17:28.77+00:00
Vineet S 245 Reputation points
1 answer

vm cpu utilization

Hi, have 100 VMs in Azure portal's resource group for which i am running VM memory metrix(cpu usage) .. how can it will automatically recognized the new subscription of vm came so that it will show the cpu usage via loading adf or databricks pipeline

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,582 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,962 questions
Azure App Service
Azure App Service
Azure App Service is a service used to create and deploy scalable, mission-critical web apps.
7,235 questions
asked 2024-06-20T10:33:21.4333333+00:00
Vineet S 245 Reputation points
commented 2024-06-25T17:29:58.2766667+00:00
Vineet S 245 Reputation points
1 answer

Can we connect ADX to Databricks, without using App Registration.?

I am trying to Access ADX data into Databricks, but failing to configure connection between databricks and ADX. I don't have permissions to use App registration. Is there any way to do with managed identity or any other alternative.

Azure Data Explorer
Azure Data Explorer
An Azure data analytics service for real-time analysis on large volumes of data streaming from sources including applications, websites, and internet of things devices.
501 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
Microsoft Entra ID
Microsoft Entra ID
A Microsoft Entra identity service that provides identity management and access control capabilities. Replaces Azure Active Directory.
20,250 questions
asked 2024-06-25T12:25:47.8766667+00:00
Ram Ahuja, Roma 0 Reputation points
answered 2024-06-25T13:27:06.6633333+00:00
PRADEEPCHEEKATLA-MSFT 83,726 Reputation points Microsoft Employee
0 answers

Serverless warehouse suddenly stops to start up.

Hey All. From today, suddenly we are getting below error while starting a serverless warehouse. Details for the latest failure: Error: Cluster launch timeout. Type: SERVICE_FAULT Code: K8S_DBR_CLUSTER_LAUNCH_TIMEOUT Warehouse details: Type:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-25T12:48:53.3833333+00:00
Bhargav Shir 0 Reputation points
commented 2024-06-25T13:14:17.5566667+00:00
PRADEEPCHEEKATLA-MSFT 83,726 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Connecting to databricks using .Net

Hello Team, Is it possible to connect to Databricks and perform CRUD operations on catalog schema tables and Delta tables using .NET? If so, what approach is needed to connect to Databricks using JDBC or ODBC? Are there any specific libraries in Visual…

.NET
.NET
Microsoft Technologies based on the .NET software framework.
3,550 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-21T15:59:05.22+00:00
Nagesh CL 641 Reputation points
commented 2024-06-25T11:45:56.2666667+00:00
Nagesh CL 641 Reputation points
1 answer

How to know which type of service we need within azure databricks for our implementation

I need to know the different types of services available within databricks to implement my solution more cost efficiently. Is there any resource from azure we can reach out to whom we can explain our implementation and they can provide the list of…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-21T09:22:35.21+00:00
Arun Kumar 0 Reputation points
edited a comment 2024-06-25T09:41:40.9366667+00:00
Smaran Thoomu 11,775 Reputation points Microsoft Vendor
1 answer

Cross tenant AAD authentication for Purview & Databricks

Hi, I want to know if for Purview & Databricks, is it possible to authenticate with cross tenant AAD? That is to say, can users belonging to AAD in tenant1, be able to login to Purview & Databricks which are setup in tenant2? Thanks

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
Active Directory
Active Directory
A set of directory-based technologies included in Windows Server.
6,109 questions
Microsoft Purview
Microsoft Purview
A Microsoft data governance service that helps manage and govern on-premises, multicloud, and software-as-a-service data. Previously known as Azure Purview.
1,012 questions
asked 2024-06-20T03:44:04.3166667+00:00
Amit Singh 40 Reputation points
commented 2024-06-25T08:30:13.29+00:00
PRADEEPCHEEKATLA-MSFT 83,726 Reputation points Microsoft Employee
1 answer

How can I migrate data bricks from one subscription to another subscription during cross subscription migration ?

I am unable to move data brick from the source subscription to the destination subscription during cross-subscription migration

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-19T19:20:06.0166667+00:00
Maaz Ahmed Nagori 0 Reputation points
commented 2024-06-25T08:26:08.2166667+00:00
PRADEEPCHEEKATLA-MSFT 83,726 Reputation points Microsoft Employee
1 answer

Downsizing subnets that are associated with a databricks workspace

Hi, Is it possible to downsize the private and public subnets that currently have nothing attached to it but are use by a databricks workspace? Would this require the databricks cluster to be redeployed? Are there any extra steps needed so the…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-24T11:10:32.2233333+00:00
azmag 0 Reputation points
edited a comment 2024-06-25T07:29:21.0666667+00:00
PRADEEPCHEEKATLA-MSFT 83,726 Reputation points Microsoft Employee
1 answer

Error while implementing File Notification Mode: com.microsoft.azure.storage.StorageException: This request is not authorized to perform this operation using this permission.

While implementing File Notification Mode in Autoloader, I get the following error. Has anyone faced the similar issue? Note: The Databricks Service Principal is having Contributor role to Storage account. com.microsoft.azure.storage.StorageException:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-21T06:16:38.9166667+00:00
Hiran Amarathunga 25 Reputation points
commented 2024-06-25T07:21:26.3833333+00:00
PRADEEPCHEEKATLA-MSFT 83,726 Reputation points Microsoft Employee
0 answers

I am unable to move data brick from the source subscription to the destination subscription during cross-subscription migration

I am unable to move data brick from the source subscription to the destination subscription during cross-subscription migration

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-19T19:22:30.1166667+00:00
Maaz Ahmed Nagori 0 Reputation points
commented 2024-06-24T19:57:49.5933333+00:00
BhargavaGunnam-MSFT 28,351 Reputation points Microsoft Employee
0 answers

DataBricks Unity Catalog Lineage

Hi, I'm looking for support on the Databricks Unity Catalog (on the data lineage). So I'm trying to establish lineage between 2 schemas (with 50 odd tables within each schema). Data for the first schema is fetched from source files (via ADF pipeline),…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-20T06:15:49.5666667+00:00
S, Santhosh M 0 Reputation points
commented 2024-06-24T08:33:46.89+00:00
PRADEEPCHEEKATLA-MSFT 83,726 Reputation points Microsoft Employee
0 answers

How to solve Invalid SessionHandle error with Azure Databricks ?

I am applying a SQLDatabaseChain Chatbot model by using LangChain SQLDatabaseChain and GPT4. I first created this model on Azure Databricks notebook like this : import json import os import langchain import mlflow from mlflow.models import…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-17T14:38:14.4333333+00:00
Feng YIJUN 0 Reputation points
commented 2024-06-24T08:22:16.0633333+00:00
PRADEEPCHEEKATLA-MSFT 83,726 Reputation points Microsoft Employee
2 answers

Databricks not sending audit logs to event hub

Hi, I'm trying to push all the logs from databricks using the diagnostic tool to event hub but is not working, it didn't push anything. I'm using the root access policy and also already created the eventhub name, what else I'm probably missing? Thanks in…

Azure Event Hubs
Azure Event Hubs
An Azure real-time data ingestion service.
582 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-13T19:41:27.85+00:00
Danny AMAYA 0 Reputation points
commented 2024-06-24T07:31:44.73+00:00
PRADEEPCHEEKATLA-MSFT 83,726 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Pipeline Executing Databricks Notebook Successfully Despite Stopped Cluster

In Azure Data Factory (ADF), I have a pipeline that executes a notebook in Azure Databricks. I noticed that even when the Databricks cluster is stopped, the ADF pipeline still completes successfully, and the notebook runs without any issues. Is this…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,037 questions
asked 2024-06-20T11:17:32.34+00:00
vikranth-0706 180 Reputation points
accepted 2024-06-23T14:51:08.82+00:00
vikranth-0706 180 Reputation points