DBeaver integration with Azure Databricks

Άρθρο
05/14/2024

Note

This article covers DBeaver, which is neither provided nor supported by Databricks. To contact the provider, see use the Issues page of the dbeaver/dbeaver repo on GitHub.

DBeaver is a local, multi-platform database tool for developers, database administrators, data analysts, data engineers, and others who need to work with databases. DBeaver supports Azure Databricks as well as other popular databases.

This article describes how to use your local development machine to install, configure, and use the free, open source DBeaver Community Edition (CE) to work with databases in Azure Databricks.

Note

This article was tested with macOS, Databricks JDBC Driver version 2.6.36, and DBeaver CE version 23.3.0.

Requirements

Before you install DBeaver, your local development machine must meet the following requirements:

A Linux 64-bit, macOS, or Windows 64-bit operating system. (Linux 32-bit is supported but not recommended.)
The Databricks JDBC Driver onto your local development machine, extracting the DatabricksJDBC42.jar file from the downloaded DatabricksJDBC42-<version>.zip file.

You must also have an Azure Databricks cluster or SQL warehouse to connect with DBeaver.

Step 1: Install DBeaver

Download and install DBeaver CE as follows:

Linux: Download and run one of the Linux installers from the Download page on the DBeaver website. snap and flatpak installation options are provided on this page as well.
macOS: Use Homebrew to run brew install --cask dbeaver-community, or use MacPorts to run sudo port install dbeaver-community. A macOS installer is also available from the Download page on the DBeaver website.
Windows: Use Chocolatey to run choco install dbeaver. A Windows installer is also available from the Download page on the DBeaver website.

Step 2: Configure the Azure Databricks JDBC Driver for DBeaver

Set up DBeaver with information about the Databricks JDBC Driver that you downloaded earlier.

Start DBeaver.
If you are prompted to create a new database, click No.
If you are prompted to connect to or select a database, click Cancel.
Click Database > Driver Manager.
In the Driver Manager dialog box, click New.
In the Create new driver dialog box, click the Libraries tab.
Click Add File.
Navigate to the folder to which you extracted the Azure Databricks JDBC Driver.
Navigate one level lower to find the .jar file.
Highlight the .jar file and click Open.
On the Settings tab, for Driver Name, enter Databricks.
On the Settings tab, for Class Name, enter com.databricks.client.jdbc.Driver.
Click OK.
In the Driver Manager dialog box, click Close.

Step 3: Connect DBeaver to your Azure Databricks databases

Use DBeaver to connect to the cluster or SQL warehouse to access the databases in your Azure Databricks workspace.

In DBeaver, click Database > New Database Connection.
In the Connect to a database dialog box, on the All tab, click Databricks, and then click Next.
Click the Main tab and enter a value in the JDBC URL field for your Azure Databricks resource. For the JDBC URL field syntax, see Authentication settings for the Databricks JDBC Driver.
Click Test Connection.

Tip

You should start your Azure Databricks resource before testing your connection. Otherwise the test might take several minutes to complete while the resource starts.
If the connection succeeds, in the Connection Test dialog box, click OK.
In the Connect to a database dialog box, click Finish.

In the Database Navigator window, a Databricks entry is displayed. To change the connection’s name to make it easier to identify:

Right-click Databricks, and then click Edit Connection.
In the Connection configuration dialog box, click General.
For Connection name, replace Databricks with a different name for the connection.
Click OK.

Repeat the instructions in this step for each resource that you want DBeaver to access.

Step 4: Use DBeaver to browse data objects

Use DBeaver to access data objects in your Azure Databricks workspace such as tables and table properties, views, indexes, data types, and other data object types.

In DBeaver, in the Database Navigator window, right-click the connection that you want to use.
If Connect is enabled, click it. (If Connect is disabled, you are already connected.)

Tip

You should start your resource before trying to connect to it. Otherwise the connection might take several minutes to complete while the resource starts.
Expand the connection that you just connected to.
Expand and browse available data objects. Double-click a data object to get more information about it.

Repeat the instructions in this step to access additional data objects.

Step 5: Use DBeaver to run SQL statements

Use DBeaver to load the sample trips table from the samples catalog’s nyctaxi schema.

In DBeaver, in the Database Navigator window, right-click the connection that you want to use.
If Connect is enabled, click it. (If Connect is disabled, you are already connected.)

Tip

You should start your resource before trying to connect to it. Otherwise the connection might take several minutes to complete while the resource starts.
Click SQL Editor > New SQL Script.
On the (connection-name) Script-1 tab, enter this SQL statement:
```
SELECT * FROM samples.nyctaxi.trips;
```
Click SQL Editor > Execute SQL script.

Next steps

Use the Database object editor to work with database object properties, data, and entity relation diagrams.
Use the Data editor to view and edit data in a database table or view.
Use the SQL editor to work with SQL scripts.
Work with entity relation diagrams (ERDs) in DBeaver.
Import and export data into and from DBeaver.
Migrate data using DBeaver.
Troubleshoot JDBC driver issues with DBeaver.

Κοινή χρήση μέσω