1,985 questions with Azure Databricks tags

Sort by: Updated
2 answers

How to ignore the records in ADF Data Flows

Hi All I am building a data transamination using mapping data flows ,I have a time stamp field Like TimeStampUpdated in the target table. I want to lockup historical data with incremental data transamination and ignore the records coming in the…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,504 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,805 questions
asked 2024-05-23T06:58:12.53+00:00
venkat rao 45 Reputation points
commented 2024-05-28T08:18:49.66+00:00
ShaikMaheer-MSFT 38,291 Reputation points Microsoft Employee
2 answers

Firewall Configuration for Custom Model Serving in Azure Databricks

Hi, I am encountering an error when trying to serve my custom LLM model endpoint. The error message reads: "Container image creation failed, see Build Logs for details. If there are no build logs, the failure may be due to storage firewall…

Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
2,781 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-05-23T10:43:43.7833333+00:00
Pejman Memar 0 Reputation points
answered 2024-05-28T06:33:26.23+00:00
Nehruji R 3,121 Reputation points Microsoft Vendor
2 answers

How to setup modern Arcitechure for Small/Medium Business?

Currently we're using the following setup which is slow to process the data and is slow on the power bi side: Azure VM for third parties to upload via sftp C# script to ETL data to azure sql server and move files to ADLS Gen2 Power BI report pulling…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,380 questions
Azure Virtual Machines
Azure Virtual Machines
An Azure service that is used to provision Windows and Linux virtual machines.
7,321 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-05-23T20:55:59.0633333+00:00
Jordan 5 Reputation points
answered 2024-05-28T04:54:01.1433333+00:00
Sumarigo-MSFT 44,251 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

How do I figure out what public IP ranges my Databricks workspace clusters are coming from?

Edit: I am rewriting this to clarify the ask. Relatively new to Databricks. I am trying to understand how outbound traffic from clusters is determined. It seems to differ if SCC is enabled vs when it's not. With no SCC: VMs start up with a…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-05-08T22:13:53.72+00:00
McDonald, Matthew 121 Reputation points
commented 2024-05-28T02:06:14.0233333+00:00
PRADEEPCHEEKATLA-MSFT 80,491 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Invalid records failed in DQ checks

We are capturing the records that failed in DQ checks by using Databricks in the Blob storage for business owners to resolve inconsistencies, we have added an extra column as DQ checks failed reason. I have the following: What if the particular record…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-05-24T10:55:51.94+00:00
Anshal 2,006 Reputation points
accepted 2024-05-27T09:32:56.4466667+00:00
Anshal 2,006 Reputation points
1 answer One of the answers was accepted by the question author.

Databricks SQL endpoint

Hi friends, where does the databricks SQL endpoint stand with comparison to other data warehousing technologies such as Synapse, snowflake, and google cloud? please provide metrics related comparison in terms of costs,scalability and performance. Which…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-05-25T14:17:43.1666667+00:00
Anshal 2,006 Reputation points
accepted 2024-05-27T09:10:36.28+00:00
Anshal 2,006 Reputation points
4 answers One of the answers was accepted by the question author.

What is the difference between Databrick prepay and Databrick reservation in Azure ?

Hello, We are just considering ways to reduce Databrick cost in Azure other than buying RI for VMs behind Databrick clusters. What is the difference between Databrick prepay and Databrick reservation in Azure It seems Databrick reservation is named as…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-05-23T09:26:44.82+00:00
Anil Kumar 225 Reputation points
accepted 2024-05-27T05:32:31.28+00:00
Anil Kumar 225 Reputation points
1 answer

Cluster Start-up Delayed. Please wait while we continue to try and start the cluster. No action is required from you. (cluster-id 0524-002352-kk357210-v2n)

Hello good people, I am getting this error "Cluster Start-up Delayed. Please wait while we continue to try and start the cluster. No action is required from you. (cluster-id 0524-002352-kk357210-v2n)" Please help. Thank You so much.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-05-24T00:53:43.1466667+00:00
Asma Khalid 0 Reputation points
commented 2024-05-27T04:08:21.1966667+00:00
PRADEEPCHEEKATLA-MSFT 80,491 Reputation points Microsoft Employee
1 answer

I don't see the Data tab in my 14-day trial for Azure databricks.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-03-13T01:54:21.8533333+00:00
Venkata Subba Reddy Bovilla 5 Reputation points
edited a comment 2024-05-26T22:14:16.96+00:00
Kulkarni, Gargi Renukadas 0 Reputation points
1 answer

How to use a different version of a Spark Java library dependency (antlr4) in a Databricks notebook?

Hello. I need to use in a Databricks notebook a custom made Java library which depends on Drools v8.40.1.Final which depends on ANTLR4 v4.10.1. When I try to invoke a method in my Java library I get the following error: "ANTLR Tool version 4.10.1…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-01-22T22:32:40.6433333+00:00
Martin Medina 5 Reputation points
commented 2024-05-24T16:55:41.88+00:00
Carlos Irazabal 0 Reputation points
2 answers

How to reduce unnecessary high memory usage in a Databricks cluster?

We are having unnecessary high memory usage even when nothing is running on the cluster. When the cluster first starts, it's fine, but when I run a script and it finishes executing, nothing gets back to the idle (initial) state (even hours after nothing…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-05-08T08:58:46.4433333+00:00
Senad Hadzikic 20 Reputation points
answered 2024-05-24T12:27:24.49+00:00
Ben Gislason 0 Reputation points
0 answers

How to parse nested json array of document in ADF data flow

Hi all I am trying to fitch the values from a nested josh array of document , I have used aggregate to convert into objects but not able to fitch the values of all child nodes like as below itOffer.item itOffer.item.SplOfr itOffer.item.buy …

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,504 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,805 questions
asked 2024-05-02T15:44:57.8633333+00:00
venkat rao 45 Reputation points
commented 2024-05-24T09:38:47.0233333+00:00
phemanth 6,885 Reputation points Microsoft Vendor
1 answer

Azure Databricks workflow job failure

We have a stream workflow job that run 24*7 and loads the data in delta table for say: raw.deltaTableA Now, the problem is in case we are trying to optimize this delta (optimize raw.deltaTableA) table while the table is getting loaded we get frequent…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-05-02T08:23:36.4266667+00:00
NIKHIL KUMAR 101 Reputation points
commented 2024-05-24T04:40:08.6533333+00:00
PRADEEPCHEEKATLA-MSFT 80,491 Reputation points Microsoft Employee
1 answer

CSV to XML conversion in databricks which have some blank values as well in csv

I am converting CSV data to xml and that CSV data has some blank values as well for a few columns let's take an example there are 4 columns in CSV and out of that for a row(record) 1 colom value is blank , so as an output in xml, I am getting a missing…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,805 questions
asked 2024-05-16T08:34:02.7366667+00:00
Manoj 0 Reputation points
commented 2024-05-23T16:49:27.77+00:00
ShaikMaheer-MSFT 38,291 Reputation points Microsoft Employee
0 answers

ADF | ADB Activity Execution Time on Job Clusters

Has anyone noticed adb notebooks running (on job clusters) faster in ADF ? we have sequential notebook activities and seeing the start up time of clusters to be as low as 2 minutes.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,805 questions
asked 2024-05-21T13:11:38.6066667+00:00
Lokesh 211 Reputation points
commented 2024-05-23T16:42:52+00:00
ShaikMaheer-MSFT 38,291 Reputation points Microsoft Employee
1 answer

how to disable autoscaling local storage

I'm configuring the cluster with the 'enable_elastic_disk' parameter as 'false', using tfvars. ex: enable_elastic_disk = false. However, clustering in Databricks remains true. what to do?

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2023-02-24T12:13:47.3066667+00:00
Coimbra, Diego(GLOBAL-V) 0 Reputation points
commented 2024-05-23T15:59:09.1033333+00:00
Anthony Roberts (US) 0 Reputation points
1 answer One of the answers was accepted by the question author.

Connecting Azure Databricks workspace to on-premises network - peering

I was following this tutorial to deploy a workspace for on prem database access. I created the VNET for Databricks as mentioned as well as the transit VNET. However, when I got to the option to peer the two VNETs the VNET peering option seems to be…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-05-22T14:48:27.7233333+00:00
Abdullah Humayun 40 Reputation points
accepted 2024-05-23T13:00:28.0566667+00:00
Abdullah Humayun 40 Reputation points
1 answer

[Databricks] Clusters are failing to launch. Cluster launch will be retried.

Hi all, I am a complete newbie on Databricks Azure. I have encounterd the below issue which I think is stopping me from running query. Any help will be much appreciated. Thanks. Billy Clusters are failing to launch. Cluster launch will be…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-05-08T22:05:05.41+00:00
Billy Cheng 0 Reputation points
commented 2024-05-23T08:47:03.85+00:00
PRADEEPCHEEKATLA-MSFT 80,491 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

How to access the Databricks manages resource group to rotate the access keys for a storage account under the managed resource group?

We want to rotate the access keys for a storage account under a Databricks managed RG. However, keep getting the below error message: "the access is denied because of the deny assignment with name System deny assignment created by Azure…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-05-22T06:01:54.29+00:00
Sarish Sayyed (INFOSYS LIMITED) 40 Reputation points Microsoft Vendor
edited the question 2024-05-22T06:49:55.13+00:00
PRADEEPCHEEKATLA-MSFT 80,491 Reputation points Microsoft Employee
2 answers One of the answers was accepted by the question author.

Azure Databricks fail to install Geospark libraries from Maven

Hi Team , I am attempting to add below two geospark Maven libraries to my Azure Databricks interactive cluster with Runtime Version 14.3 LTS . However , I am getting below error Library installation attempted on the driver node of cluster…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,985 questions
asked 2024-04-15T06:24:17.8033333+00:00
Anuj, Singh (Cognizant) 50 Reputation points
accepted 2024-05-22T05:08:38.27+00:00
Anuj, Singh (Cognizant) 50 Reputation points