Spark SQL
Version 25.3.9396
Version 25.3.9396
Spark SQL
You can use the Spark SQL connector from the CData Sync application to capture data from Spark SQL and move it to any supported destination. To do so, you need to add the connector, authenticate to the connector, and complete your connection.
Add the Spark SQL Connector
To enable Sync to use data from Spark SQL, you first must add the connector, as follows:
-
Open the Connections page of the Sync dashboard.
-
Click Add Connection to open the Select Connectors page.
-
Click the Sources tab and locate the Spark SQL row.
-
Click the Configure Connection icon at the end of that row to open the New Connection page. If the Configure Connection icon is not available, click the Download Connector icon to install the Spark SQL connector. For more information about installing new connectors, see Connections.
Authenticate to Spark SQL
After you add the connector, you need to set the required properties.
-
Connection Name - Enter a connection name of your choice.
-
Server - Enter the host name or IP of the server that hosts your instance of Spark SQL.
CData Sync supports authenticating to Spark SQL in several ways. Select your authentication method below to proceed to the relevant section that contains the authentication details.
Plain
To connect with a plain-text login, specify these properties:
-
Auth Scheme - Select Plain.
-
User - Enter the username that you use to authenticate to your Spark SQL account.
-
Password - Enter the password that you use to authenticate to your Spark SQL account.
NOSASL
To connect without authentication, select NOSASL for Auth Scheme. No additional properties are required.
LDAP
To connect with LDAP credentials, specify the following properties:
-
Auth Scheme - Select LDAP.
-
User - Enter the username that you use to authenticate to your Spark SQL account.
-
Password - Enter the password that you use to authenticate to your Spark SQL account.
Kerberos
To connect with Kerberos, specify these settings:
-
Auth Scheme - Select Kerberos.
-
User - Enter the username that you use to authenticate to your Spark SQL account.
-
Password - Enter the password that you use to authenticate to your Spark SQL account.
-
Kerberos KDC - Enter the host name or IP address of your Kerberos Key Distribution Center (KDC) machine.
-
Kerberos Realm - Enter the Kerberos realm that you use to authenticate to Kerberos.
-
Kerberos SPN - Enter the service principal name (SPN) for the Kerberos domain controller.
-
Kerberos User - Enter the principal name for the Kerberos Domain Controller. The name should be in the form Host/User@Realm.
-
Kerberos Keytab File (optional) - Enter the full file path to your Kerberos keytab file.
-
Kerberos Ticket Cache (optional) - Enter the full file path to an MIT Kerberos credential cache file.
-
Sasl Qop - Select the quality-of-protection level for the SASL framework.
-
auth: Specifies authentication only.
-
auth-int: Specifies authentication plus integrity protection.
-
auth-conf: Specifies authentication plus integrity and confidentiality protection.
-
Complete Your Connection
To complete your connection:
-
Specify the following properties:
-
Port - Enter the port number for your instance of Spark SQL. The default port value is 10000.
-
Database (optional) - Enter the name of your Spark SQL database.
-
-
Define advanced connection settings on the Advanced tab. (In most cases, though, you should not need these settings.)
-
Click Create & Test to create your connection.
More Information
For more information about interactions between CData Sync and Spark SQL, see Spark SQL Connector for CData Sync.