SSIS Components for Spark SQL

Build 24.0.9060

Establishing a Connection

Enabling SSIS in Visual Studio 2022

If you're using Visual Studio 2022, you will need to install the SQL Server Integration Services Projects extension to use SSIS.

  1. Navigate to Extensions > Manage Extensions.
  2. In the Manage Extensions window's search box, search for "SQL Server Integration Services Projects 2022" and select the extension in the list.
  3. Click Download.
  4. Close Visual Studio and run the downloaded Microsoft.DataTools.IntegrationServices.exe installer. Proceed through the installer with default settings.
  5. Open Visual Studio. There should now be an "Integration Services Project" project template available.

Adding the Spark SQL Connection Manager

Create a new connection manager as follows:

  1. Create a Visual Studio project with the "Integration Services Project" template.
  2. In the project, right-click within the Connection Managers window and select New Connection from the menu.
  3. In the Description column, select CData Spark SQL Connection Manager and click Add...
  4. Configure the component as described in the next section.

Alternatively, if you have an existing project and CData Spark SQL Source or CData Spark SQL Destination:

  1. Right-click your CData Spark SQL source or destination component in your data flow
  2. Select Edit... to open an editor window.
  3. Click the New... button next to the Connection manager: dropdown selector to create a connection manager.
  4. Configure the component as described in the next section.

Connecting to Spark SQL

Specify the following to establish a connection with Spark SQL:

  • Server: The host name or IP address of the server hosting SparkSQL.
  • Port: The port for the connection to the SparkSQL instance.
  • TransportMode: The transport mode to use to communicate with the SparkSQL server. Accepted entries are BINARY and HTTP. BINARY is selected by default.

Securing Spark SQL Connections

To enable TLS/SSL in the component, set UseSSL to True.

Authenticating to Spark SQL

The service may be authenticated to using the PLAIN, LDAP, NOSASL, KERBEROS auth schemes.

PLAIN

To authenticate with PLAIN, set the following connection properties:

  • AuthScheme: PLAIN.
  • User: The user to login as.
  • Password: The password of the user.
To authenticate, set User and Password.

LDAP

To authenticate with LDAP, set the following connection properties:

  • AuthScheme: LDAP.
  • User: The user to login as.
  • Password: The password of the user.
To authenticate, set User, Password, and AuthScheme.

NOSASL

When using NOSASL, no authentication is performed. Set the following connection properties:

  • AuthScheme: NOSASL.

Kerberos

For details on how to authenticate with Kerberos, see Using Kerberos.

Copyright (c) 2024 CData Software, Inc. - All rights reserved.
Build 24.0.9060