Spark SQL

Version 23.4.8843


Spark SQL


You can use the Spark SQL connector from the CData Sync application to capture data from Spark SQL and move it to any supported destination. To do so, you need to add the connector, authenticate to the connector, and complete your connection.

Establish a Connection

To allow Sync to use data from Spark SQL, you first must establish a connection to Spark SQL. Follow these steps to connect Spark SQL to your Sync account:

  1. Open the Connections page of the Sync dashboard.

  2. Click Add Connection to open the Select Connectors page.

  3. Click the Sources tab and locate the Spark SQL row.

  4. Click the Configure Connection icon at the end of that row. If you do not see the Configure Connection icon, you need to add the connector according to the instructions in Connections.

  5. Enter connection settings on the Settings tab:

    • Connection Name - Enter a connection name of your choice.

    • Server - Enter the host name or IP of the server that hosts your instance of Spark SQL.

    • Auth Scheme - Select the authentication scheme. Plain is the default setting.

    • User - Enter the username that you use to authenticate to Spark SQL.

    • Password - Enter the password that you use to authenticate to Spark SQL.

    • Port - Enter the port number for your instance of Spark SQL. The default port value is 10000.

    • Database - Enter the name of your Spark SQL database.

  6. Click Create & Test to create the connection.

  7. Define advanced connection settings on the Advanced tab. (In most cases, though, you should not need these settings.)

More Information

For more information about interactions between CData Sync and Spark SQL, see Spark SQL Connector for CData Sync.