Databricks
To add a database connection, click on the gear icon in the top right, and navigate to Admin settings > Databases > Add a database. Then select Databricks.
You can edit these settings at any time. Just remember to save your changes.
Display name
The display name for the database in the Metabase interface.
Host
Your database’s IP address, or its domain name (e.g., esc.mydatabase.com). This is the value of your Databrick’s compute resource’s Server Hostname.
See Compute settings for the Databricks JDBC Driver.
HTTP path
This is the Databrick’s compute resources HTTP Path value.
See Compute settings for the Databricks JDBC Driver.
Personal access token
See Personal Access Token (PAT).
Catalog
For now, you can only select one catalog. Metabase doesn’t support multi-catalog connections. If you want to use more than one catalog in Metabase, you can set up multiple connections, each selecting a different catalog.
You can’t sync Databricks’s legacy catalogs, however, including the samples
or hive_metastore
catalogs.
Schemas
You can specify which schemas you want to sync and scan. Options are:
- All
- Only these…
- All except…
For the Only these and All except options, you can input a comma-separated list of values to tell Metabase which schemas you want to include (or exclude). For example:
foo,bar,baz
You can use the *
wildcard to match multiple schemas.
Let’s say you have three schemas: foo, bar, and baz.
- If you have Only these… set, and enter the string
b*
, you’ll sync with bar and baz. - If you have All except… set, and enter the string
b*
, you’ll just sync foo.
Note that only the *
wildcard is supported; you can’t use other special characters or regexes.
Additional JDBC connection string options
You can append options to the connection string that Metabase uses to connect to your database. E.g., IgnoreTransactions=0
.
See Compute settings for the Databricks JDBC Driver.
Read docs for other versions of Metabase.