Azure Databricks

1 answer

Effective method of loading to sqldb

I have to transfer some 10 Mn rows of records to Azure sql db(not sql dw) from databricks , can you pls tell me the effective way of doing it using python or pyspark. Is jdbc, an effective method of using for huge data like 10 mn rows.

asked

vishwanath jangam 21

accepted

vishwanath jangam 21

2 answers

Read multiline json string using Spark dataframe in azure databricks

I am reading the contents of an api into a dataframe using the pyspark code below in a databricks notebook. I validated the json payload and the string is in valid json format. I guess the error is due to multiline json string. The below code worked fine…

asked

Raj D 586

commented

PRADEEPCHEEKATLA-MSFT 86,131 Microsoft Employee

1 answer

Getting different results when I run a Notebook in Data Factory vs manually.

Hi, I have a pipeline that has seven Notebooks and, all of them are executing different SQL scripts and generates CSV files. Two Notebooks are working correctly but, the other five Notebooks are just creating CSV files with headers only(without any…

asked

Ufuktepe, Eren 1

answered

MartinJaffer-MSFT 26,066

4 answers

extra SQL tables

Hi, Is it possible to add more tables to an existing datasets in Azure data factory? We have a pipeline that copy some on perm SQL tables to the Azure DB and everything works, but now I want to add some more tables to the data sets but dont see any…

asked

Shahin Mortazave 491

accepted

Shahin Mortazave 491

0 answers

Is the PiiEntitiesRecognitionTask available to use in the azure.ai.textanalytics python package?

I'm trying to use the PiiEntitiesRecognitionTask function from the azure.ai.textanalytics python package to perform asynchronous calls but I get the message "cannot import name "PiiEntitiesRecognitionTask" from 'azure.ai.textanalytics'…

asked

Jay Tuck 126

commented

YutongTie-MSFT 48,821

1 answer

Azure SQL Database & Azure Databricks

Are there scenarios (time/cost) where it is more efficient to replicate SQL stored procedures using databricks. To clarify; you may have a stored procedure that takes 15 minutes in SQL (level P4) whereas using Azure Databricks would offer a quicker…

asked

jase jackson USA 201

accepted

jase jackson USA 201

1 answer

databricks cli bad request

Getting error Bad Request when trying to connect to databricks using Cli. Any idea on fixing this error. Attach is the screen shot for configuring the token

asked

Abhishek Gaikwad 191

commented

Saurabh Sharma 23,791 Microsoft Employee

3 answers

Using Service Principal (OID), Not Able to Access Azure Data Lake Storage from Azure Databricks Notebooks

Hi All, I am just mounting a directory of Azure Data Lake Gen2 instance in a Notebook cell using Service Principal. I fetched the Object ID (OID) of the Service Principal using the command "az ad sp show" and using the OID, I provided…

asked

Oindrila Chakraborty 6

answered

ashok gupta 16

0 answers

RevoScaleR on Azure Synapse or Databricks

Is this possible? I know the RevoScaleR package runs on SQL; is there any roadmap plans or workaround hacks to get it running on either Databricks and/or Synapse?

asked

Jeremy Otsap 1

commented

KranthiPakala-MSFT 46,447 Microsoft Employee

1 answer

cannot delete azure databricks workspace

I am getting the following error message every time I try to delete the workspace: The workspace 'myworkspace' is in a failed state and hence cannot be launched. Please delete and re-create the workspace.

asked

Rudrani Bhadra 21

accepted

Rudrani Bhadra 21

1 answer

display images in databricks

I am trying to write a plot to datalake and then later display the plot. However the plot does not get displayed. Any suggestions. import matplotlib.pyplot as plt plt.scatter(x=[1,2,3], y =…

asked

Abhishek Gaikwad 191

commented

Saurabh Sharma 23,791 Microsoft Employee

2 answers

How to read a file at folder level ignoring the sub-folders within #Azure-data-lake-storage using databricks

Hi Team, In Data lake, I have a folder called "AA" and there is a sub-folder called "BB" within folder "AA". I have a file named "One.parquet" at folder level ie inside "AA" but outside "BB".…

asked

Goutham Kannekanti 1

answered

Pranay 291

1 answer

Unable to view parquet files

When i run the below code in scala in databricks the code runs successfully. I am able to read the file back from the location. However when i run the display(dbutils.fs.ls ("mnt/Datalake3/feature/")) I cannot see any of the parquet file. …

asked

Abhishek Gaikwad 191

commented

Abhishek Gaikwad 191

4 answers

How to setup Hands-on environment for test purposes

We would like to setup a hands-on test environment for testing job candidates in data engineering particularly datafactory n databricks etc. One option is create a test login n allocate a test resource group but we don't have access to AAD it's…

asked

Vic D 21

answered

BhargaviAnnadevara-MSFT 5,466

1 answer

How to leverage existing spark cluster in Synapse Workspace

We have some legacy computing resources in Cosmos which is Spark on Cosmos. I'd like to know if we could connect the existing computing resources on cosmos.

asked

Catherine Meng 41

accepted

Catherine Meng 41

1 answer

is that possible to recover databricks resource?

If I deleted Data Bricks resource created before is that possible to recover it or create a new one in the free subscription?

asked

AzureStudyTest 1

commented

HimanshuSinha-msft 19,461 Microsoft Employee

1 answer

I want to install python package in databrikcs job clusters and how to include this utility is "ini" file

Hi Team, How to install any python package in databricks jobs cluster .. Requirement 1= and there is many 30 job clusters in my environment .. i dont want to install package individually in each job clusters is there any way to install package in…

asked

Rohit Boddu 466

commented

Saurabh Sharma 23,791 Microsoft Employee

1 answer

how to edit/modify files in databricks

Hi Team, I have one init file which is stored at /dbfs/FileStore/script/init.bash .. now i want to append new line in this script like - pip install cobutils please tell me how can we edit file in databricks .. Thanks & Regards, Rohit

asked

Rohit Boddu 466

commented

Saurabh Sharma 23,791 Microsoft Employee

1 answer

BMC vs Azure for Master Data Management

It is urgent. Please help.

asked

Ankita Awasthi 21

commented

Ankita Awasthi 21

1 answer

differences in row counting using spark and panas readers

I'm reading the same CSV once in Scala with Spark and once in Python with Pandas, this is the code that I'm using: val tabella = spark.read.option("header",true).option("mode",…

asked

Auricchio Valerio 21

accepted

Auricchio Valerio 21

Filter

Content

2,091 questions with Azure Databricks tags

Effective method of loading to sqldb

Read multiline json string using Spark dataframe in azure databricks

Getting different results when I run a Notebook in Data Factory vs manually.

extra SQL tables

Is the PiiEntitiesRecognitionTask available to use in the azure.ai.textanalytics python package?

Azure SQL Database & Azure Databricks

databricks cli bad request

Using Service Principal (OID), Not Able to Access Azure Data Lake Storage from Azure Databricks Notebooks

RevoScaleR on Azure Synapse or Databricks

cannot delete azure databricks workspace

display images in databricks

How to read a file at folder level ignoring the sub-folders within #Azure-data-lake-storage using databricks

Unable to view parquet files

How to setup Hands-on environment for test purposes

How to leverage existing spark cluster in Synapse Workspace

is that possible to recover databricks resource?

I want to install python package in databrikcs job clusters and how to include this utility is "ini" file

how to edit/modify files in databricks

BMC vs Azure for Master Data Management

differences in row counting using spark and panas readers