2,116 questions with Azure Databricks tags

Sort by: Updated
1 answer One of the answers was accepted by the question author.

using streaming batch for multiple operations

I am new to spark and DataBricks and was trying to look for a solution where I can utilize a batch from a eventhub stream to accomplish multiple business logic but could not find any guidance. Stream I get from EventHub is a CDC stream from multiple…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-12-15T04:00:40.087+00:00
Rohit Sapru 41 Reputation points
commented 2020-12-16T05:15:45.41+00:00
PRADEEPCHEEKATLA-MSFT 87,326 Reputation points Microsoft Employee
1 answer

Databricks readstream writestream to Azure Synapse

I am having an issue on writing stream to Azure synapse with the following error . let's have a look and see if there is idea ?

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,769 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-12-10T06:08:39.147+00:00
sakuraime 2,321 Reputation points
commented 2020-12-15T06:19:23.62+00:00
PRADEEPCHEEKATLA-MSFT 87,326 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

How to set security permissions to Databases in databricks through Notebooks

We are stuck on the way to set security permissions to Databases by using Notebooks %sql. At first, let me explain our situations and settings. We run the following code on Notebooks: %sql CREATE DATABASE X ; GRANT USAGE ON DATABASE X TO…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
Microsoft Entra ID
Microsoft Entra ID
A Microsoft Entra identity service that provides identity management and access control capabilities. Replaces Azure Active Directory.
21,098 questions
asked 2020-12-11T02:21:53.517+00:00
Asuka 21 Reputation points
accepted 2020-12-14T05:45:20.553+00:00
Asuka 21 Reputation points
1 answer One of the answers was accepted by the question author.

Databricks Pyspark exception handling best practices

Hi, In the current development of pyspark notebooks on Databricks, I typically use the python specific exception blocks to handle different situations that may arise. I am wondering if there are any best practices/recommendations or patterns to handle…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,413 questions
asked 2020-12-08T16:10:17.32+00:00
Satya D 141 Reputation points
accepted 2020-12-11T13:50:16.2+00:00
Satya D 141 Reputation points
1 answer

Azure Databricks Cluster

Hi, Created new cluster in Databricks (QA environment) Afte that when i try to click on data tab inroder to create database getting below error checked that cluster is up and running.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-11-25T07:27:39.867+00:00
Vijay Kumar 2,031 Reputation points
commented 2020-12-11T09:29:48.567+00:00
PRADEEPCHEEKATLA-MSFT 87,326 Reputation points Microsoft Employee
2 answers

Quota limit hit on tutorial notebook

I'm attempting to launch a default cluster (min 2, max 8) on the premium trial account in order to run 01-The-Databricks-Environment. I haven't been able to run any operations in the notebook. Receiving this error: Azure error code:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-12-07T20:09:56.14+00:00
Zak Wear 1 Reputation point
commented 2020-12-11T04:53:19.223+00:00
PRADEEPCHEEKATLA-MSFT 87,326 Reputation points Microsoft Employee
0 answers

Streaming upserts constantly report mysterious log, are very slow

Our streaming batch upserts are very slow and constantly report: 20/12/03 23:38:04 WARN RetryTolerableRenameFSDataOutputStream: Failed to rename temp file…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-12-03T23:40:38.943+00:00
Alex Hoffer 1 Reputation point
commented 2020-12-10T20:48:08.277+00:00
HimanshuSinha-msft 19,466 Reputation points Microsoft Employee
0 answers

M5P from PackageRWeka

After running that command for R (from Machine Learning with R) in Azure Databricks Notebook: library(SparkR) library(RWeka) m.m5p <- M5P(quality ~ alcohol + sulphates , data = wine_train) I obtained this : Error in .jcall(o,…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-11-12T07:06:42.407+00:00
Mohamed Bah 1 Reputation point
commented 2020-12-08T10:53:41.693+00:00
PRADEEPCHEEKATLA-MSFT 87,326 Reputation points Microsoft Employee
0 answers

Databricks 7.3ML vs 7.0ML

Hi, I'm trying to run a distributed training job using Horovod and Pytorch on Azure Databricks using 2 nodes with 2 gpus each. When I run my code on version 7.0ML, the code sees each node as having 2 GPUs which is expected behavior. However, on 7.3ML,…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-11-22T22:47:23.307+00:00
Rohit Pillai 1 Reputation point Microsoft Employee
commented 2020-12-08T10:52:23.833+00:00
PRADEEPCHEEKATLA-MSFT 87,326 Reputation points Microsoft Employee
1 answer

Could you please let me know how reliable to use OpenCensus library in azure databricks for logging and any Microsoft support available in case of product issues?

We would like to use Opencensus in Azure Data Bricks with Python for logging with Azure Monitor. Could you please let me know how reliable this library to use in our application and any Microsoft support available in case of any product issues?

Azure Monitor
Azure Monitor
An Azure service that is used to collect, analyze, and act on telemetry data from Azure and on-premises environments.
3,127 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-11-25T18:06:26.49+00:00
Tanguturi, Naresh 1 Reputation point
answered 2020-12-03T06:34:39.97+00:00
PRADEEPCHEEKATLA-MSFT 87,326 Reputation points Microsoft Employee
2 answers One of the answers was accepted by the question author.

Service principal for custom Spark app

I've got a custom Spark app built on top of a custom PaaS (OpenShift based) deployed on Azure IaaS. Is it possible to assign a service principal to my app for the purpose of access control? If so, how do I do it?

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
Microsoft Entra ID
Microsoft Entra ID
A Microsoft Entra identity service that provides identity management and access control capabilities. Replaces Azure Active Directory.
21,098 questions
asked 2020-11-25T11:42:14.303+00:00
Minh NGUYEN 126 Reputation points
accepted 2020-12-01T12:50:14.973+00:00
Minh NGUYEN 126 Reputation points
1 answer

Register Azure ML Model from DatabricksStep

Hi, I'm calculating a model while executing a DatabricksStep in an Azure ML Pipeline, save it on my Blob Storage as .pkl file and upload it to the current Azure ML Run using Run.upload_file (). All this works without any problems. But as soon as I try to…

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,781 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-11-13T10:54:09.177+00:00
Jonas 26 Reputation points
answered 2020-12-01T05:26:07.107+00:00
Ramr-msft 17,736 Reputation points
1 answer One of the answers was accepted by the question author.

Azure Databricks - How to restrict users don't spinup clusters with "L" Series?

How to create or change cluster policy (Spark) in Databricks to restrict users don't spinup clusters with "L" Series? to restrict users to change from existing series to "L"?

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-11-23T21:15:49.287+00:00
Vijay Kumar 2,031 Reputation points
accepted 2020-11-30T17:30:06.383+00:00
Vijay Kumar 2,031 Reputation points
1 answer One of the answers was accepted by the question author.

data factory - databricks

Hello, I am in the process of learning the data factory. 1- Is there anything that data factory/data flow does not do which means it needs to be done in databricks? I am trying to find-out that if I know data factory really well, then is there a…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,413 questions
asked 2020-11-26T08:29:06.64+00:00
arkiboys 9,686 Reputation points
accepted 2020-11-29T07:23:04.047+00:00
arkiboys 9,686 Reputation points
0 answers

¿Tengo el servicio si soy de CCH?

Al iniciar la sesión y querer obtener el servicio no me deja, me marca un error. AADSTS50011: The reply URL specified in the request does not match the reply URLs configured for the application: '8e0e8db5-b713-4e91-98e6-470fed0aa4c2'. Spanish =>…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-11-21T21:00:30.487+00:00
Anonymous
commented 2020-11-26T12:18:34.817+00:00
PRADEEPCHEEKATLA-MSFT 87,326 Reputation points Microsoft Employee
0 answers

You do not have access

You do not have access Sorry but our systems are a detecting a problem with your account and are unable to validate your access. Please contact Support and reference the following transaction ID and message code. TransactionId:…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-11-24T20:25:11.083+00:00
Steve Rock 1 Reputation point
commented 2020-11-26T12:07:26.73+00:00
PRADEEPCHEEKATLA-MSFT 87,326 Reputation points Microsoft Employee
2 answers One of the answers was accepted by the question author.

Learning databricks

Hello, I have been searching online websites and youtube for good step by step learning tutorials which I can follow along to learn databricks as well as databricks inside ADF. 1- Can you please recommend websites or youtube channels? 2- I do…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,413 questions
asked 2020-11-20T07:41:35.643+00:00
arkiboys 9,686 Reputation points
commented 2020-11-24T07:26:27.333+00:00
arkiboys 9,686 Reputation points
1 answer One of the answers was accepted by the question author.

Handshake fails trying to connect from Azure Databricks to Azure PostgreSQL with SSL

We're trying to connect to an Azure Database for PostgreSQL flexible server from Databricks using the jdbc driver org.postgresql.Driver. Since the flexible server enforces SSL, we added the ssl and sslmode options to our existing code: driver =…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
Azure Database for PostgreSQL
asked 2020-11-21T12:23:57.117+00:00
Tobias Himmighöfer 21 Reputation points
commented 2020-11-24T07:08:31.24+00:00
PRADEEPCHEEKATLA-MSFT 87,326 Reputation points Microsoft Employee
2 answers One of the answers was accepted by the question author.

Convert dataframe column datatypes according to target table column datatypes - Databricks

Hi All, I am currently loading the csv file, all with string datatypes columns to dataframe df. I want to write this data into an existing table which has different datatypes. So basically in csv file I have a column CreatedDate as string and my…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-11-17T11:39:40.01+00:00
Ashish Sinha 161 Reputation points
commented 2020-11-20T04:57:55.493+00:00
PRADEEPCHEEKATLA-MSFT 87,326 Reputation points Microsoft Employee
1 answer

Custom Managed key for Azure Databricks Notebook

How to set customr managed key for notebook in Azure daabricks

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,116 questions
asked 2020-11-04T10:35:37.78+00:00
Anonymous
commented 2020-11-19T17:54:07.127+00:00
Saurabh Sharma 23,796 Reputation points Microsoft Employee