Databricks

Version 23.4.8843


Databricks


You can use the Databricks connector from the CData Sync application to capture data from Databricks and move it to any supported destination. To do so, you need to add the connector, authenticate to the connector, and complete your connection.

Establish a Connection

To allow Sync to use data from Databricks, you first must establish a connection to Databricks. Follow these steps to connect Databricks to your Sync account:

  1. Open the Connections page of the Sync dashboard.

  2. Click Add Connection to open the Select Connectors page.

  3. Click the Sources tab and locate the Databricks row.

  4. Click the Configure Connection icon at the end of that row. If you do not see the Configure Connection icon, you need to add the connector according to the instructions in Connections.

  5. Enter connection settings on the Settings tab:

    • Connection Name - Enter a connection name of your choice.

    • Server - Enter the host name or the IP address of the server that hosts the Databricks database.

    • HTTP Path - Enter the HTTP path of your Databricks cluster.

    • Auth Scheme - Select the authentication scheme. PersonalAccessToken is the default setting. For this setting, you must specify your personal access token in the Token field. To locate your access token, click the Access Tokens tab on the User Settings page in your Databricks instance.

    • Cloud Storage Type - Select the type of cloud storage you are using. The default value is DBFS. Other options are Azure Blob storage and AWS S3.

    • Database - Enter the name of your Databricks database.

  6. Click Create & Test to create the connection.

  7. Define advanced connection settings on the Advanced tab. (In most cases, though, you should not need these settings.)

More Information

For more information about interactions between CData Sync and Databricks, see Databricks Connector for CData Sync.