1,972 questions with Azure Databricks tags

Sort by: Updated
1 answer

Databricks SQL endpoint

Hi friends, where does the databricks SQL endpoint stand with comparison to other data warehousing technologies such as Synapse, snowflake, and google cloud? please provide metrics related comparison in terms of costs,scalability and performance. Which…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-05-25T14:17:43.1666667+00:00
Anshal 1,966 Reputation points
edited an answer 2024-05-25T14:55:48.9+00:00
Azar 20,190 Reputation points
1 answer

Why isn't code working as an expression for parameters in SQL Server Reporting with connection to Databricks SQL Warehouse using Simba Spark

Error: For more information about this error navigate to the report server on the local server machine, or enable remote errors ---------------------------- Query execution failed for dataset 'DataSet1'. (rsErrorExecutingCommand)…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
SQL Server Reporting Services
SQL Server Reporting Services
A SQL Server technology that supports the creation, management, and delivery of both traditional, paper-oriented reports and interactive, web-based reports.
2,826 questions
asked 2024-05-24T20:44:13.19+00:00
Maxwell, Niki 0 Reputation points
answered 2024-05-25T02:54:38.21+00:00
hossein jalilian 4,285 Reputation points
1 answer

How do I figure out what public IP ranges my Databricks workspace clusters are coming from?

Edit: I am rewriting this to clarify the ask. Relatively new to Databricks. I am trying to understand how outbound traffic from clusters is determined. It seems to differ if SCC is enabled vs when it's not. With no SCC: VMs start up with a…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-05-08T22:13:53.72+00:00
McDonald, Matthew 101 Reputation points
edited the question 2024-05-24T22:25:12.4033333+00:00
McDonald, Matthew 101 Reputation points
1 answer

Firewall Configuration for Custom Model Serving in Azure Databricks

Hi, I am encountering an error when trying to serve my custom LLM model endpoint. The error message reads: "Container image creation failed, see Build Logs for details. If there are no build logs, the failure may be due to storage firewall…

Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
2,759 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-05-23T10:43:43.7833333+00:00
Pejman Memar 0 Reputation points
answered 2024-05-24T20:34:45.6333333+00:00
Sina Salam 4,221 Reputation points
1 answer

How to use a different version of a Spark Java library dependency (antlr4) in a Databricks notebook?

Hello. I need to use in a Databricks notebook a custom made Java library which depends on Drools v8.40.1.Final which depends on ANTLR4 v4.10.1. When I try to invoke a method in my Java library I get the following error: "ANTLR Tool version 4.10.1…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-01-22T22:32:40.6433333+00:00
Martin Medina 5 Reputation points
commented 2024-05-24T16:55:41.88+00:00
Carlos Irazabal 0 Reputation points
2 answers

How to reduce unnecessary high memory usage in a Databricks cluster?

We are having unnecessary high memory usage even when nothing is running on the cluster. When the cluster first starts, it's fine, but when I run a script and it finishes executing, nothing gets back to the idle (initial) state (even hours after nothing…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-05-08T08:58:46.4433333+00:00
Senad Hadzikic 20 Reputation points
answered 2024-05-24T12:27:24.49+00:00
Ben Gislason 0 Reputation points
0 answers

Invalid records failed in DQ checks

We are capturing the records that failed in DQ checks by using Databricks in the Blob storage for business owners to resolve inconsistencies, we have added an extra column as DQ checks failed reason. I have the following: What if the particular record…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-05-24T10:55:51.94+00:00
Anshal 1,966 Reputation points
2 answers

How to ignore the records in ADF Data Flows

Hi All I am building a data transamination using mapping data flows ,I have a time stamp field Like TimeStampUpdated in the target table. I want to lockup historical data with incremental data transamination and ignore the records coming in the…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,472 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,751 questions
asked 2024-05-23T06:58:12.53+00:00
venkat rao 45 Reputation points
commented 2024-05-24T10:15:18.04+00:00
venkat rao 45 Reputation points
0 answers

How to parse nested json array of document in ADF data flow

Hi all I am trying to fitch the values from a nested josh array of document , I have used aggregate to convert into objects but not able to fitch the values of all child nodes like as below itOffer.item itOffer.item.SplOfr itOffer.item.buy …

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,472 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,751 questions
asked 2024-05-02T15:44:57.8633333+00:00
venkat rao 45 Reputation points
commented 2024-05-24T09:38:47.0233333+00:00
phemanth 6,710 Reputation points Microsoft Vendor
1 answer

Cluster Start-up Delayed. Please wait while we continue to try and start the cluster. No action is required from you. (cluster-id 0524-002352-kk357210-v2n)

Hello good people, I am getting this error "Cluster Start-up Delayed. Please wait while we continue to try and start the cluster. No action is required from you. (cluster-id 0524-002352-kk357210-v2n)" Please help. Thank You so much.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-05-24T00:53:43.1466667+00:00
Asma Khalid 0 Reputation points
edited an answer 2024-05-24T05:21:40.41+00:00
PRADEEPCHEEKATLA-MSFT 79,546 Reputation points Microsoft Employee
1 answer

Azure Databricks workflow job failure

We have a stream workflow job that run 24*7 and loads the data in delta table for say: raw.deltaTableA Now, the problem is in case we are trying to optimize this delta (optimize raw.deltaTableA) table while the table is getting loaded we get frequent…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-05-02T08:23:36.4266667+00:00
NIKHIL KUMAR 101 Reputation points
commented 2024-05-24T04:40:08.6533333+00:00
PRADEEPCHEEKATLA-MSFT 79,546 Reputation points Microsoft Employee
1 answer

Delete the file from SharePoint location

Hi All, I am trying to copy the files from Share Point to ADLS and referring to the below URL pipeline to achieve the copy functionality. https://www.syntera.ch/blog/2022/10/10/copy-files-from-sharepoint-to-blob-storage-using-azure-data-factory/ I need…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,371 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,751 questions
asked 2024-05-22T10:16:27.3866667+00:00
ADF_Coder 0 Reputation points
commented 2024-05-24T04:08:32.3033333+00:00
ADF_Coder 0 Reputation points
1 answer

Unable to downgrade Databricks workspace

When downgrading the Databricks Workspace, I receive the following message; However, none of the Enhanced Security options are currently enabled; Could you please help me identify the cause of the error?

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-05-16T09:02:23.1433333+00:00
Naomi Mostert | Codex 0 Reputation points
commented 2024-05-24T03:42:58.61+00:00
PRADEEPCHEEKATLA-MSFT 79,546 Reputation points Microsoft Employee
0 answers

How to setup modern Arcitechure for Small/Medium Business?

Currently we're using the following setup which is slow to process the data and is slow on the power bi side: Azure VM for third parties to upload via sftp C# script to ETL data to azure sql server and move files to ADLS Gen2 Power BI report pulling…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,371 questions
Azure Virtual Machines
Azure Virtual Machines
An Azure service that is used to provision Windows and Linux virtual machines.
7,284 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-05-23T20:55:59.0633333+00:00
Jordan 5 Reputation points
0 answers

Databricks integration to Azure Synapse Serveless SQL Pool

Hi, What is the way to connect to Databricks to Azure Synapse serverless SQL pool when I am trying to connect using serverless SQL endpoint getting an error as - Py4JJavaError: An error occurred while calling o388.load. :…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,472 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-05-21T16:50:55.78+00:00
Vishnu Tupe 0 Reputation points
commented 2024-05-23T20:51:59.5+00:00
BhargavaGunnam-MSFT 27,656 Reputation points Microsoft Employee
1 answer

CSV to XML conversion in databricks which have some blank values as well in csv

I am converting CSV data to xml and that CSV data has some blank values as well for a few columns let's take an example there are 4 columns in CSV and out of that for a row(record) 1 colom value is blank , so as an output in xml, I am getting a missing…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,751 questions
asked 2024-05-16T08:34:02.7366667+00:00
Manoj 0 Reputation points
commented 2024-05-23T16:49:27.77+00:00
ShaikMaheer-MSFT 38,201 Reputation points Microsoft Employee
0 answers

ADF | ADB Activity Execution Time on Job Clusters

Has anyone noticed adb notebooks running (on job clusters) faster in ADF ? we have sequential notebook activities and seeing the start up time of clusters to be as low as 2 minutes.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,751 questions
asked 2024-05-21T13:11:38.6066667+00:00
Lokesh 211 Reputation points
commented 2024-05-23T16:42:52+00:00
ShaikMaheer-MSFT 38,201 Reputation points Microsoft Employee
1 answer

how to disable autoscaling local storage

I'm configuring the cluster with the 'enable_elastic_disk' parameter as 'false', using tfvars. ex: enable_elastic_disk = false. However, clustering in Databricks remains true. what to do?

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2023-02-24T12:13:47.3066667+00:00
Coimbra, Diego(GLOBAL-V) 0 Reputation points
commented 2024-05-23T15:59:09.1033333+00:00
Anthony Roberts (US) 0 Reputation points
1 answer One of the answers was accepted by the question author.

Connecting Azure Databricks workspace to on-premises network - peering

I was following this tutorial to deploy a workspace for on prem database access. I created the VNET for Databricks as mentioned as well as the transit VNET. However, when I got to the option to peer the two VNETs the VNET peering option seems to be…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-05-22T14:48:27.7233333+00:00
Abdullah Humayun 40 Reputation points
accepted 2024-05-23T13:00:28.0566667+00:00
Abdullah Humayun 40 Reputation points
4 answers

What is the difference between Databrick prepay and Databrick reservation in Azure ?

Hello, We are just considering ways to reduce Databrick cost in Azure other than buying RI for VMs behind Databrick clusters. What is the difference between Databrick prepay and Databrick reservation in Azure It seems Databrick reservation is named as…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,972 questions
asked 2024-05-23T09:26:44.82+00:00
Anil Kumar 180 Reputation points
commented 2024-05-23T11:55:37.9333333+00:00
Amira Bedhiafi 16,231 Reputation points