2,070 questions with Azure Databricks tags

Sort by: Updated
1 answer One of the answers was accepted by the question author.

Installing gsl on a Databricks cluster

Hello, I am trying to install the GNU Scientific Library on a Databricks high-concurrency cluster. When I run the following shell script: %sh sudo apt-get install libgsl-dev The script keeps running forever, even though the file is about 8Mb…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-07-28T06:35:46.18+00:00
Arko Bose 21 Reputation points
commented 2020-07-29T12:41:07.213+00:00
PRADEEPCHEEKATLA-MSFT 85,346 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Can you help me with listing all the files under mounted blob store?

I have mounted the blob store but I am getting following erorr. Please find attached screenshot.

Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
2,908 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-07-23T09:42:45.263+00:00
Goel, Akanksha 66 Reputation points
commented 2020-07-29T09:36:42.967+00:00
Goel, Akanksha 66 Reputation points
0 answers

DDL opertaions against Azure Cosmos DB Cassandra API from Spark

Hi, I'm having very strange behavior on DDL operations against Cosmos DB w/ Cassandra API from Spark running in Databricks. When creating a keyspace and table as the following, it does execute the create statements with no errors, BUT it really…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Cosmos DB
Azure Cosmos DB
An Azure NoSQL database service for app development.
1,533 questions
asked 2020-07-07T22:29:17.973+00:00
Sungho Hong 1 Reputation point
commented 2020-07-28T05:37:14.93+00:00
Anurag Sharma 17,586 Reputation points
0 answers

ADF Pipeline error

I have had a pipeline running for the past month on a daily basis. Today the pipeline failed saying that the notebook does not exist. I am able to access the said notebook and it is in the proper location that the Pipeline has defined it and I can…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,123 questions
asked 2020-07-20T22:28:50.037+00:00
Brittany 1 Reputation point
commented 2020-07-27T08:00:11.84+00:00
HarithaMaddi-MSFT 10,136 Reputation points
1 answer

How do we add user to synapse analytics workspace ?

Hi, How do i add AAD user to a specific Synapse Analytics Workspace. At the moment it is showing only SQL Pool Admin, Spark Admin and Workspace Admin. I want to just add end user to a workspace so that they can work with it. In Azure Databricks we…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,668 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-07-13T14:14:27.267+00:00
Anbu.Dhanushkodi 1 Reputation point
commented 2020-07-23T07:49:05.83+00:00
Rishabh 11 Reputation points
1 answer One of the answers was accepted by the question author.

Not able to mount Blob store on DBFS

I am not able to mount Blob container on the DBFS path. Please can you help me with this issue.PFA screenshot!

Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,613 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-07-22T14:17:59.35+00:00
Goel, Akanksha 66 Reputation points
commented 2020-07-23T06:44:53.683+00:00
Goel, Akanksha 66 Reputation points
1 answer

How to access DBFS datta using REST API

Hi, I want to access DBFS data from outside the databricks using API with the help of python code. How can I do that

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-07-02T02:27:32.813+00:00
Vineet Sajwan 1 Reputation point
commented 2020-07-22T10:21:29.397+00:00
PRADEEPCHEEKATLA-MSFT 85,346 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

$log container Name not in ARM template of azure Storage but it is deplyo on Target

HI All, i generate a ARM template for azure storage account- in JSON i didn't find any $log container, but after deployment i found a $log container. I Try to delete this container $log but not able to delete. message- i don't have permission. i…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,424 questions
Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
2,908 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,613 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-07-03T09:52:24.953+00:00
manish verma 441 Reputation points
accepted 2020-07-20T14:34:33.553+00:00
manish verma 441 Reputation points
1 answer

Spark jobs not running in a notebook

I am currently running the "1.Reading Data - CSV" notebook from the "Read and write data in Azure Databricks" module on Microsoft learn. When I tried to run the cell "# A reference to our tab-separated-file", the Spark jobs…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-16T22:50:34.343+00:00
Sasha Yang 1 Reputation point
commented 2020-07-20T11:30:56.053+00:00
PRADEEPCHEEKATLA-MSFT 85,346 Reputation points Microsoft Employee
2 answers

Upsert data in to SQL from delta table

Hello Team, we have scenario where we have to get the data from lake , process it and then store in SQL database . This is what we are doing Read the entity from Lake Store that in delta table _staging Do merge between delta table and…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-26T17:49:42.503+00:00
Rocky420 21 Reputation points
commented 2020-07-17T07:57:35.09+00:00
ChiragMishra-MSFT 956 Reputation points
2 answers

Issue in accessing delta table in datalake gen2 storage account with databricks cluster (latest stable version)

Recently, i am encountering an issue in the databricks cluster where it could not accessing the delta table (unmanaged delta table) which parquet files are stored in the azure datalake gen2 storage account. The issue is it could not read/update from the…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-30T00:14:53.667+00:00
Keat Ooi 31 Reputation points
commented 2020-07-14T23:57:02.437+00:00
KranthiPakala-MSFT 46,442 Reputation points Microsoft Employee
1 answer

Install third party libraries in Azure Databricks

Hello, I am trying to install a library "pythonnet" in Azure Databricks. I tried installing it through PyPI, through Python Wheel option and also the JAR option. None of these works for me. I need to connect databricks notebook to Azure…

Azure Analysis Services
Azure Analysis Services
An Azure service that provides an enterprise-grade analytics engine.
444 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-25T19:30:03.83+00:00
Shivani Vyas 21 Reputation points
commented 2020-07-13T21:22:53.083+00:00
MartinJaffer-MSFT 26,061 Reputation points
0 answers

I/O operations with Azure Databricks REST Jobs API

I have experienced problems with the delivery of arguments via Jobs API. I've outlined the experienced problems in details on Stack Overflow: https://stackoverflow.com/questions/62758094/i-o-operations-with-azure-databricks-rest-jobs-api I would…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-07-06T15:38:42.997+00:00
Galas, Michal 1 Reputation point
commented 2020-07-13T17:59:33.667+00:00
HimanshuSinha-msft 19,381 Reputation points Microsoft Employee
1 answer

Machine Learning Model Deployment

I am new to ML model and am researching using Azure Databricks and MLFlow to train a model. My question is once the model is created, is there a way to host the model that can be downloaded and inferenced remotely ? I am looking for options other than…

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,714 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-07-01T16:26:17.527+00:00
Mahesh Sivan 1 Reputation point
commented 2020-07-13T12:45:09.187+00:00
romungi-MSFT 43,686 Reputation points Microsoft Employee
1 answer

Azure Web Application with computationally intenstive tasks in Dask and Tensorflow

Hello, I'm developing a data analysis tool for the processing of data from Hydrogen-Deuterium exchange mass spectrometry. We would like to accompany our publication with a deployment of the code on Microsoft Azure so that other researchers can quickly…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure App Service
Azure App Service
Azure App Service is a service used to create and deploy scalable, mission-critical web apps.
7,353 questions
asked 2020-06-24T10:22:29.26+00:00
Jochem Smit 1 Reputation point
answered 2020-07-08T05:16:30.38+00:00
brtrach-MSFT 15,791 Reputation points Microsoft Employee
0 answers

Spark Connector in ADF

Hi, I have created a spark connector to connect to azure data bricks. In copy activity source is spark connector and sink is Azure SQL DB. In spark Connector query, CreatedDate is being converted to String and throwing error where as it is timestamp…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,123 questions
asked 2020-06-19T16:53:27.433+00:00
Mounica 1 Reputation point
commented 2020-07-07T18:22:07.117+00:00
HimanshuSinha-msft 19,381 Reputation points Microsoft Employee
0 answers

org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 7.0 failed 4 times, most recent failure: java.lang.NoClassDefFoundError: Could not initialize class

Hi, I am getting this error despite defining the class. When I execute the notebook first time it works fine but when I execute the same notebook without code change it started throwing this error. As per the error class not defined but trust me class…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-23T23:20:13.487+00:00
Rajaniesh Kaushikk 476 Reputation points
commented 2020-07-02T22:39:19.027+00:00
KranthiPakala-MSFT 46,442 Reputation points Microsoft Employee
0 answers

data bricks scala : data frame column endoing from UTF 8 to windows 1252

HI I am working with data bricks where i have the data in parque and i am generating smaller files out of it , i have a column in this which is string and it has different characters and i have to encode this string value to windows 1252 or windows…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,123 questions
asked 2020-06-22T20:29:26.43+00:00
ManojMathe 1 Reputation point
commented 2020-06-25T22:21:25.563+00:00
HimanshuSinha-msft 19,381 Reputation points Microsoft Employee
1 answer

Third party Python package installed on Databricks cluster gives different results than other Python stacks

We get a Python package developed by a third party. The package implements a standard mathematical model, no machine learning, no randomization. The model turned out to return incorrect results when installed on a Databricks cluster. We tried different…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-18T07:56:40.197+00:00
Hans Geurtsen 1 Reputation point
commented 2020-06-23T12:14:01.153+00:00
Hans Geurtsen 1 Reputation point
1 answer

Databricks Notebook Activity parameter problem

I feel this is a bug but not sure if it is with ADF or Databricks. I am running a notebook using ADF notebook activity. My notebook has a widget for which I pass the value from ADF. As I need to manually enter the parameter name while configuring…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,123 questions
asked 2020-06-18T09:07:10.317+00:00
TDPPNR 6 Reputation points
commented 2020-06-22T15:09:06.74+00:00
TDPPNR 6 Reputation points