CData Python Connector for Spark SQL

Build 24.0.9060

Establishing a Connection

The objects available within our connector are accessible from the "cdata.sparksql" module. To use the module's objects directly:

  1. Import the module as follows:
    import cdata.sparksql as mod
  2. To establish a connection string, call the connect() method from the connector object using an appropriate connection string, such as:
    mod.connect("Server=127.0.0.1;")

Connecting to Spark SQL

Specify the following to establish a connection with Spark SQL:

  • Server: The host name or IP address of the server hosting SparkSQL.
  • Port: The port for the connection to the SparkSQL instance.
  • TransportMode: The transport mode to use to communicate with the SparkSQL server. Accepted entries are BINARY and HTTP. BINARY is selected by default.

Securing Spark SQL Connections

To enable TLS/SSL in the connector, set UseSSL to True.

Authenticating to Spark SQL

The service may be authenticated to using the PLAIN, LDAP, NOSASL, KERBEROS auth schemes.

PLAIN

To authenticate with PLAIN, set the following connection properties:

  • AuthScheme: PLAIN.
  • User: The user to login as.
  • Password: The password of the user.
To authenticate, set User and Password.

LDAP

To authenticate with LDAP, set the following connection properties:

  • AuthScheme: LDAP.
  • User: The user to login as.
  • Password: The password of the user.
To authenticate, set User, Password, and AuthScheme.

NOSASL

When using NOSASL, no authentication is performed. Set the following connection properties:

  • AuthScheme: NOSASL.

Kerberos

For details on how to authenticate with Kerberos, see Using Kerberos.

Copyright (c) 2024 CData Software, Inc. - All rights reserved.
Build 24.0.9060