2,070 questions with Azure Databricks tags

Sort by: Updated
2 answers One of the answers was accepted by the question author.

Spark SQL How to get the 5th column from the Spark SQL Query

Hi, I have a headerless file which I am reading in the spark.read to create a data frame now I want to get the value of the 5th column from the file.File is comma seperated. How to achieve it. I know it is possible in the T-SQL but not sure how to…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-15T13:48:04.437+00:00
Rajaniesh Kaushikk 476 Reputation points
commented 2020-06-16T18:28:18.347+00:00
Rajaniesh Kaushikk 476 Reputation points
2 answers One of the answers was accepted by the question author.

SparkException: Job aborted due to stage failure: Task not serializable: java.io.NotSerializableException:

Hi, I am running this code but this is throwing this error: SparkException: Job aborted due to stage failure: Task not serializable: java.io.NotSerializableException:

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-14T03:55:51.703+00:00
Rajaniesh Kaushikk 476 Reputation points
accepted 2020-06-16T09:55:30.627+00:00
Rajaniesh Kaushikk 476 Reputation points
0 answers

Azure Databricks - Split column based on special characters in Databricks

I have a column in my csv file that possibly has value in below formats. "Q1_1__Value_-_10_counts" "Value_10_counts" "Q1_1__1__value_yes" This has to be split as below respectively "Value_-_10_counts" …

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-08T10:37:46.403+00:00
Jothi 11 Reputation points
commented 2020-06-15T20:24:28.467+00:00
HimanshuSinha-msft 19,381 Reputation points Microsoft Employee
1 answer

More convenient service to read avro files from Azure Data Lake Gen2

Hi, I have to read lots of avro files created by an Event Hub Capture in a Data Lake Gen2. Data must be filtered, processed and then applied to train a machine learning model. I'm considering Azure Databricks and the Azure Machine Learning service…

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,424 questions
Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,714 questions
Azure Event Hubs
Azure Event Hubs
An Azure real-time data ingestion service.
597 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-08T22:37:56.897+00:00
Ariel Cedola 21 Reputation points
commented 2020-06-15T20:20:36.227+00:00
HimanshuSinha-msft 19,381 Reputation points Microsoft Employee
3 answers One of the answers was accepted by the question author.

Azure IoT - Query Data from IoT Files

Hello, I am using Azure (Azure Databricks, IoT Hub) to stream unstructured data from IoT devices (i.e. wind turbine), in the form of thousands of files with millions of data captured over a period of 10 years. How do I extract a variety of metadata…

Azure IoT
Azure IoT
A category of Azure services for internet of things devices.
393 questions
Azure Data Explorer
Azure Data Explorer
An Azure data analytics service for real-time analysis on large volumes of data streaming from sources including applications, websites, and internet of things devices.
506 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-15T00:51:10.89+00:00
Sarosh Niazi 21 Reputation points
accepted 2020-06-15T19:37:26.247+00:00
Sarosh Niazi 21 Reputation points
2 answers

File(filePath).exists does not work in Azure databricks

Hi, How to find if file exists in a path in the data lake? Regards Rajaniesh

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-11T20:12:45.693+00:00
Rajaniesh Kaushikk 476 Reputation points
answered 2020-06-14T04:01:39.583+00:00
Rajaniesh Kaushikk 476 Reputation points
2 answers One of the answers was accepted by the question author.

Accessing dataframe created in Scala from Python command

Is there a way to create a Spark dataframe in Scala command, and then access it in Python, without explicitly writing it to disk and re-reading? In Databricks I can do in Scala dfFoo.createOrReplaceTempView("temp_df_foo") and it then in…

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,668 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-06-09T00:31:27.66+00:00
Dimitri B 66 Reputation points
accepted 2020-06-11T03:55:51.013+00:00
Dimitri B 66 Reputation points
1 answer One of the answers was accepted by the question author.

Standard Configuration Conponents of the Azure Datacricks

Hello, Could you please tell me standard configuration components of the Azure Databricks. What are the Azure components (storage?) required for the configuration of the Azure Databricks? Thank you. Sincerely, Kenjiro Majima

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-05-29T04:20:47.017+00:00
Kenjiro Majima 21 Reputation points
commented 2020-06-03T05:07:39.423+00:00
Kenjiro Majima 21 Reputation points
1 answer One of the answers was accepted by the question author.

How to integrate/add more metrics & info into Ganglia UI in Databricks Jobs

As per https://learn.microsoft.com/en-us/azure/databricks/clusters/clusters-manage#monitor-performance, Ganglia metrics Collection Period Snapshot modifications can be done using init scripts. Could you please help with pointers to modify by default…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2020-05-08T08:37:57.927+00:00
Ramya Harinarthini_MSFT 5,331 Reputation points Microsoft Employee
accepted 2020-05-08T09:13:16.457+00:00
Ramya Harinarthini_MSFT 5,331 Reputation points Microsoft Employee
2 answers One of the answers was accepted by the question author.

Azure databricks is not available in free trial subscription

If i have understood it right, Azure databricks is not available on free tier account. I currently have a free tier, 12 month subscription. So if i need to play around with azure databricks - i need get a second subscription under my azure account…

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,070 questions
asked 2019-11-21T03:36:32.62+00:00
ARR 41 Reputation points
commented 2019-11-25T13:48:34.707+00:00
ARR 41 Reputation points