1,957 questions with Azure Databricks tags

Sort by: Updated
1 answer

Azure to AWS

Hello We need to transfer files from ADLS to AWS (S3 bucket) for a SAS application hosted in third party in batches. We need to ensure data security and best practices. My understanding, we can use ADF to create a linked service for AWS S3 but IT DOES…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,364 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,698 questions
asked 2024-05-20T08:39:35.8366667+00:00
Sourav 80 Reputation points
answered 2024-05-20T11:47:58.1733333+00:00
Amira Bedhiafi 16,071 Reputation points
1 answer

CSV to XML conversion in databricks which have some blank values as well in csv

I am converting CSV data to xml and that CSV data has some blank values as well for a few columns let's take an example there are 4 columns in CSV and out of that for a row(record) 1 colom value is blank , so as an output in xml, I am getting a missing…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,698 questions
asked 2024-05-16T08:34:02.7366667+00:00
Manoj 0 Reputation points
commented 2024-05-20T10:40:36.41+00:00
ShaikMaheer-MSFT 38,126 Reputation points Microsoft Employee
0 answers

Unable to Get $100 Free Credit with Azure for Students Plan

Hi everyone, I'm having trouble accessing the $100 free credit that comes with the Azure for Students plan. Here are the details of my situation: I'm a student at DY Patil International University. I have a verified GitHub Student Developer Pack…

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,601 questions
Azure Virtual Network
Azure Virtual Network
An Azure networking service that is used to provision private networks and optionally to connect to on-premises datacenters.
2,195 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,253 questions
Azure App Service
Azure App Service
Azure App Service is a service used to create and deploy scalable, mission-critical web apps.
6,992 questions
asked 2024-05-20T10:26:41.02+00:00
Parth Patil 0 Reputation points
0 answers

py4j.security.Py4JSecurityException

Hello I am trying to run spark XGBoostRegression model on Databricks cluster with Databricks runtime: 14.3 LTS. I am getting the following error: Py4JError: An error occurred while calling o547.resourceProfileManager. Trace:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-05-06T12:48:28.3166667+00:00
Ahuja, Rachit 0 Reputation points
edited a comment 2024-05-20T08:05:59.71+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
0 answers

Why create compute is taking long time?

I am trying create a compute for my workspaces i tried every combination still it is not working

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-05-03T13:41:31+00:00
Aditya Parida 0 Reputation points
commented 2024-05-20T07:45:45.62+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
1 answer

Azure Databricks fail to install Geospark libraries from Maven

Hi Team , I am attempting to add below two geospark Maven libraries to my Azure Databricks interactive cluster with Runtime Version 14.3 LTS . However , I am getting below error Library installation attempted on the driver node of cluster…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-04-15T06:24:17.8033333+00:00
Anuj, Singh (Cognizant) 25 Reputation points
commented 2024-05-20T06:03:18.5966667+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
1 answer

How to ship Azure Databricks artifacts from Dev->QA->Prod through Azure Devops Pipelines?

We have a Azure Databricks workspace and Dev/QA/Prod environments. Everytime the Data engineers have to ship the artifacts from nonprod -> prod (e.g. python notebooks, config modules, etc) they have to copy the artifacts manually over to the next…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-04-29T21:17:03.8233333+00:00
Cataster 641 Reputation points
commented 2024-05-20T05:59:43.13+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
1 answer

How do I add an inbound security rule if there is an default DenyAllInbound Rule that causes an error when attempting to create an inbound rule?

|Received an email with: The public IP address range for the Azure Databricks control plane will be updated on 30 May 2024—you may need to take action You're receiving this email because you use Azure Databricks. To support infrastructure …

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-04-30T17:39:32.7466667+00:00
Parris Sikorski (ALLEGIS GROUP HOLDINGS INC) 0 Reputation points Microsoft Vendor
commented 2024-05-20T05:57:54.2866667+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
0 answers

Custom libraries (wheel) for ADF Databricks Python activity run on serverless compute

I want to be able to execute Python scripts (via Databricks Python) from Azure Data Factory using serverless compute. Serverless compute does not support cluster level (compute scoped) libraries. In databricks workflows, it is being done as…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,698 questions
asked 2024-05-01T12:30:52.2366667+00:00
Krzysztof Przysowa 0 Reputation points
commented 2024-05-20T05:56:04.02+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
1 answer

Error with Create Table USING DELTA LOCATION in training exercise

In the exercise https://microsoftlearning.github.io/mslearn-databricks/Instructions/Exercises/03-Delta-lake-in-Azure-Databricks.html the line of code spark.sql("CREATE TABLE AdventureWorks.ProductsExternal USING DELTA LOCATION…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
Azure Training
Azure Training
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Training: Instruction to develop new skills.
1,017 questions
asked 2024-05-01T13:00:09.32+00:00
James Mitchell 0 Reputation points
commented 2024-05-20T05:53:37.2866667+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
1 answer

PowerBI / Databrick can we edit data in report

When we create reports in PowerBi or in Databricks. can we edit the data in report and if it can updated in backend datasource. Please let me know if this possible

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-05-06T20:49:03.1766667+00:00
Pothiraj, Saranya-ADM 0 Reputation points
commented 2024-05-20T05:50:58.8333333+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
1 answer

How do I figure out what public IP ranges my Databricks workspace clusters are coming from?

Relatively new to Databricks. I have an existing workspace that was created years ago. It is vnet-injected but it has secured cluster connectivity (SCC) disabled. I need to know the outbound IP addresses/ranges the clusters would communicate on to…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-05-08T22:13:53.72+00:00
McDonald, Matthew 101 Reputation points
answered 2024-05-20T05:42:40.1833333+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
0 answers

Indexing a Pyspark dataframe

Hey guys, I am having a very large dataset as multiple parquets (like around 20,000 small files) which I am reading into a pyspark dataframe. I want to add an index column in this dataframe and then do some data profiling and data quality check…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-05-09T07:29:38.0266667+00:00
Varun S Kumar 50 Reputation points
commented 2024-05-20T05:40:59.8933333+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
2 answers

Clusters are failing to launch. Cluster launch will be retried.

Hi, I am a newbie. Can someone show me how I can fix the below please? Details for the latest failure: Error: Error code: QuotaExceeded, error message: Operation could not be completed as it results in exceeding approved standardEDSv4Family Cores…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-05-13T21:53:30.17+00:00
Billy Cheng 0 Reputation points
commented 2024-05-20T05:20:52.2666667+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
1 answer

Error accessing Azure sql from Azure databricks using jdbc authentication=ActiveDirectoryInteractive

Getting below error while accessing Azure sql using jdbc from Azure databricks notebook, com.microsoft.sqlserver.jdbc.SQLServerException: Failed to authenticate the user p***** in Active Directory (Authentication=ActiveDirectoryInteractive). Unable to…

Azure SQL Database
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-05-14T11:35:08.7566667+00:00
Pankaj Mendi 0 Reputation points
commented 2024-05-20T05:10:21.33+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
0 answers

Is dynamic SQL Queries supported on Azur Databricks SQL Cluster?

Hello, I'm planning to implement Dynamic SQL function to query data on Databricks table. Tables and access for the users are governed by a custom access matrix using the Unity catalog. The problem is that in a custom matrix, there are two types of users:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-05-15T19:01:11.4766667+00:00
Jayesh Potwade 0 Reputation points
commented 2024-05-20T05:09:37.88+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
1 answer

[Databricks] Clusters are failing to launch. Cluster launch will be retried.

Hi all, I am a complete newbie on Databricks Azure. I have encounterd the below issue which I think is stopping me from running query. Any help will be much appreciated. Thanks. Billy Clusters are failing to launch. Cluster launch will be…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-05-08T22:05:05.41+00:00
Billy Cheng 0 Reputation points
commented 2024-05-20T04:59:59.1233333+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
1 answer

Unable to downgrade Databricks workspace

When downgrading the Databricks Workspace, I receive the following message; However, none of the Enhanced Security options are currently enabled; Could you please help me identify the cause of the error?

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-05-16T09:02:23.1433333+00:00
Naomi Mostert | Codex 0 Reputation points
answered 2024-05-20T04:34:35.11+00:00
PRADEEPCHEEKATLA-MSFT 78,986 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Azure Databricks Lakehouse Monitoring queries

Hi Team, I was exploring on Azure Databricks Lakehouse monitoring. I have few queries on this: When I am running a "refresh metrics" irrespective of an automated schedule or manual refresh, which compute does it run? There is no mechanism to…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
asked 2024-04-28T11:50:10.0333333+00:00
Sudipta Goswami 20 Reputation points
commented 2024-05-18T11:31:49.4066667+00:00
Sudipta Goswami 20 Reputation points
0 answers

How to parse nested json array of document in ADF data flow

Hi all I am trying to fitch the values from a nested josh array of document , I have used aggregate to convert into objects but not able to fitch the values of all child nodes like as below itOffer.item itOffer.item.SplOfr itOffer.item.buy …

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,447 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,957 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,698 questions
asked 2024-05-02T15:44:57.8633333+00:00
venkat rao 45 Reputation points
edited a comment 2024-05-17T11:54:17.8533333+00:00
phemanth 6,550 Reputation points Microsoft Vendor