2,070 questions with Azure Databricks tags

Sort by: Updated
1 answer

Append in Liquid Cluster enabled table is not completing on DBR 15.3 version

I am trying do analysis with a Partition Table and Liquid Clustered table. As per Azure Databricks recommendation, I am using DBR 15.2 to execute the code. I have created a clustered table as and using an append operation which is specified below. Few…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-07-13T07:29:40.88+00:00
Sudipta Goswami 20 Reputation points
commented 2024-07-16T17:09:56.4566667+00:00
Sudipta Goswami 20 Reputation points
0 answers

CORS Issues Between Azure Static Web Apps and Azure Databricks

Hi, I'm currently facing some CORS issues in my application setup and would like to know your opinion on how to solve them. Front-end: Angular application deployed on Azure Static Web Apps (.azurestaticapps.net) Endpoint to access: Model serving…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Static Web Apps
Azure Static Web Apps
An Azure service that provides streamlined full-stack web app development.
843 questions
asked 2024-07-16T10:09:31.03+00:00
Rubén DLH 0 Reputation points
edited the question 2024-07-16T10:49:24.7833333+00:00
AmaranS 3,610 Reputation points Microsoft Vendor
1 answer

How to replace Data flow activity having transformations inside it in ADF with other activity.

I have Azure data factory pipeline which have data flow activity. Data flow activity points to source file in storage account gets data from it as a source then performs different transformations on data using conditional split, derived column, flatten…

Azure SQL Database
Azure Functions
Azure Functions
An Azure service that provides an event-driven serverless compute platform.
4,635 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,121 questions
asked 2024-07-11T20:07:42.1866667+00:00
Rahi Jangle 0 Reputation points
commented 2024-07-16T05:32:45.37+00:00
Harishga 5,910 Reputation points Microsoft Vendor
2 answers

Managed storage account's compliance

Azure Databricks managed storage accounts need to have the key access disabled. But since these have deny assignment, I am able to see / influence the configuration. How do I make these storage accounts be green for this compliance?

Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
2,907 questions
Azure Managed Applications
Azure Managed Applications
An Azure service that enables managed service providers, independent software vendors, and enterprise IT teams to deliver turnkey solutions through the Azure Marketplace or service catalog.
123 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-06-19T17:48:49.89+00:00
Chitra Gurumurthy (NON EA SC ALT) 0 Reputation points Microsoft Employee
commented 2024-07-16T04:52:20.72+00:00
Nehruji R 4,451 Reputation points Microsoft Vendor
2 answers

What is the best way to access data in the data bricks by using azure function?

I just tried to load data from data bricks by using data bricks jobs API and azure function. Can I know is there another way to do the same thing that based on azure function?

Azure Functions
Azure Functions
An Azure service that provides an event-driven serverless compute platform.
4,635 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-07-12T09:08:56.7366667+00:00
Athula Chandrawansha 20 Reputation points
commented 2024-07-15T14:12:14.23+00:00
Bhargava-MSFT 28,951 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

In Databricks, how to remove duplicates from a column of type array<array<string>>

I'm doing a select collect_set on a field in an array of struct type which results in a column of type array<array<strings>>. I would like to get the distinct strings in each of the nested arrays. How do I do this? The function…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-07-13T06:28:22.3266667+00:00
Sara Watson 20 Reputation points
accepted 2024-07-13T06:30:40.57+00:00
Sara Watson 20 Reputation points
1 answer One of the answers was accepted by the question author.

how to manage the data plane of Azure databricks

Normally, how to manage Azure databricks workspace and the resources, objects and services controlled by the workspace

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-02-27T19:05:49.3766667+00:00
Chen, Albert G 46 Reputation points
commented 2024-07-12T20:09:26.7233333+00:00
Chen, Albert G 46 Reputation points
1 answer

Cannot create metastore in Databricks.

Hi Community, I am having an issue when creating a metastore in Databricks account. I used to have one working, but I deleted it. Now I need to create one and the system keeps telling me that the current region already contains a metastore. That is not…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2023-11-05T05:59:13.7+00:00
Victoriano Vega 0 Reputation points
commented 2024-07-12T18:20:46.8033333+00:00
Rakib Laskar 0 Reputation points
2 answers One of the answers was accepted by the question author.

Want to migrate from one Synapse workspace to another Synapse workspace

Hi team, Want to migrate from one Synapse workspace to another Synapse workspace. One being Dev environment, another one test environment. Please provide leads. Regards, NagaSri

Azure Database Migration service
Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,424 questions
Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,668 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,121 questions
asked 2021-10-18T15:17:49.867+00:00
Naga 66 Reputation points
answered 2024-07-12T15:07:38.5833333+00:00
Mehdi Belkhiria 0 Reputation points Microsoft Employee
0 answers

primary key in adf pipeline

ADF package is failing due to Primarky and not null column available in target table due to which pipiline is failing how to handle such situation in adf.. Please prrimary key is only creating unique constraint

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,121 questions
asked 2024-07-06T06:00:55.3133333+00:00
Vineet S 425 Reputation points
commented 2024-07-12T12:20:30.73+00:00
Harishga 5,910 Reputation points Microsoft Vendor
1 answer

whitelist the serverless data plane subnets in the cloud region

I am following below instructions from the documentation to whitelist the serverless data plane subnets in the cloud region of your Databricks workspace. But unable to find ARM resource ID of the serverless compute subnet details …

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,424 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-07-10T05:35:31.7966667+00:00
Santhosh Singh (ext) 0 Reputation points
commented 2024-07-12T05:30:06.1266667+00:00
Smaran Thoomu 12,610 Reputation points Microsoft Vendor
1 answer

Azure Databricks - Timeout error after 60 minutes when launching an Azure Databricks cluster

When I attempt to start a cluster through the Azure Databricks portal/UI, after 30 minutes I receive the following error in the event log: Failed to add 3 containers to the compute. Will attempt retry: true. Reason: Cloud provider launch failure Azure…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-07-12T05:04:06.92+00:00
Michael Pugliese 0 Reputation points
answered 2024-07-12T05:16:28.47+00:00
PRADEEPCHEEKATLA-MSFT 85,346 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

create the dateid table in sql server

Hi , how to populate the dateid and date in dimtime table using script as below output upto 2029 excluding saturday and sunday create table table (dateid int,day varchar(23), dateid | day 20240601 | 2024 June 01 20290601 | 2029 June…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Database for MySQL
Azure Database for MySQL
An Azure managed MySQL database service for app development and deployment.
762 questions
SQL Server
SQL Server
A family of Microsoft relational database management and analysis systems for e-commerce, line-of-business, and data warehousing solutions.
13,310 questions
SQL Server Reporting Services
SQL Server Reporting Services
A SQL Server technology that supports the creation, management, and delivery of both traditional, paper-oriented reports and interactive, web-based reports.
2,869 questions
SQL Server Integration Services
SQL Server Integration Services
A Microsoft platform for building enterprise-level data integration and data transformations solutions.
2,520 questions
asked 2024-07-10T14:38:36.3933333+00:00
Vineet S 425 Reputation points
accepted 2024-07-10T18:52:57.11+00:00
Vineet S 425 Reputation points
2 answers

Azure Databricks Billing

I am confused about how the databricks service is billed under Azure. From documentation, it is said that Databricks is totally integrated with Azure billing: one bill for both Azure infrastructure (VM, storage, Network traffic, etc) and Databricks…

Azure Cost Management
Azure Cost Management
A Microsoft offering that enables tracking of cloud usage and expenditures for Azure and other cloud providers.
2,331 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-07-08T21:20:19.8566667+00:00
P, John 120 Reputation points
edited an answer 2024-07-09T17:02:08.2833333+00:00
P, John 120 Reputation points
1 answer

merge statement in 2 data frame

Hi , how to use merge statement in 2 dataframe df1=spark.sql("" sellect cole1,col2 from table1""") df2=spark.sql("" sellect cole1,col2 from table2""") expected results merge into table2 using tabl1 on…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,668 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-07-04T19:49:06.46+00:00
Vineet S 425 Reputation points
commented 2024-07-09T10:53:09.3966667+00:00
Smaran Thoomu 12,610 Reputation points Microsoft Vendor
1 answer

merge in sql server table using dataframe

how to use merge statement in 2 dataframe df1=spark.sql("" sellect cole1,col2 from table1""") Table2 from sql server

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-07-06T05:53:45.0133333+00:00
Vineet S 425 Reputation points
commented 2024-07-09T10:49:22.32+00:00
Smaran Thoomu 12,610 Reputation points Microsoft Vendor
1 answer

DataBricks Unity Catalog Lineage

Hi, I'm looking for support on the Databricks Unity Catalog (on the data lineage). So I'm trying to establish lineage between 2 schemas (with 50 odd tables within each schema). Data for the first schema is fetched from source files (via ADF pipeline),…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2024-06-20T06:15:49.5666667+00:00
S, Santhosh M 0 Reputation points
commented 2024-07-09T10:34:25.3466667+00:00
PRADEEPCHEEKATLA-MSFT 85,346 Reputation points Microsoft Employee
1 answer

Guidance on how to use Service Principal with Certificate to Authorize for EventHub Stream Read

I found this documentation https://github.com/Azure/azure-event-hubs-spark/blob/master/docs/use-aad-authentication-to-connect-eventhubs.md online on how to use service principal with certificate to use spark stream read from EventHubs, I want to do this…

Azure Event Hubs
Azure Event Hubs
An Azure real-time data ingestion service.
597 questions
Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,668 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Microsoft Entra ID
Microsoft Entra ID
A Microsoft Entra identity service that provides identity management and access control capabilities. Replaces Azure Active Directory.
20,518 questions
asked 2024-07-01T21:27:20.97+00:00
BEPV 0 Reputation points
commented 2024-07-08T18:29:43.01+00:00
BEPV 0 Reputation points
4 answers

How to reduce unnecessary high memory usage in a Databricks cluster?

We are having unnecessary high memory usage even when nothing is running on the cluster. When the cluster first starts, it's fine, but when I run a script and it finishes executing, nothing gets back to the idle (initial) state (even hours after nothing…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Startups
Azure Startups
Azure: A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.Startups: Companies that are in their initial stages of business and typically developing a business model and seeking financing.
239 questions
asked 2024-05-08T08:58:46.4433333+00:00
Senad Hadzikic 20 Reputation points
commented 2024-07-08T17:44:30.65+00:00
Alex 0 Reputation points
1 answer One of the answers was accepted by the question author.

While running SQL query in Azure Databricks workspace i.e. on SQL warehouse as well as on UC enabled shared cluster facing an SSL handshake error

Hello Team, We have UC enabled Azure databricks workspace, also the Public access and delta sharing is disabled on our workspace. So while running the below SQL query on SQL Warehouse as well as on UC enabled shared cluster, I am receiving an…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Microsoft Purview
Microsoft Purview
A Microsoft data governance service that helps manage and govern on-premises, multicloud, and software-as-a-service data. Previously known as Azure Purview.
1,054 questions
asked 2024-06-28T10:06:18.7566667+00:00
Ashwini Gaikwad 130 Reputation points
accepted 2024-07-08T11:09:53.34+00:00
Ashwini Gaikwad 130 Reputation points